NeurIPS
Multi-Agent Reinforcement Learning is A Sequence Modeling Problem
Download Abstract: Large sequence model (SM) such as GP …
Multi-Agent Reinforcement Learning is A Sequence Modeling Problem 查看全文 »
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
Download Abstract: Setting up a well-designed reward fu …
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Download Abstract: Gradient-based Meta-RL (GMRL) refers …
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 查看全文 »
A Unified Diversity Measure for Multiagent Reinforcement Learning
Download Abstract: Promoting behavioural diversity is o …
A Unified Diversity Measure for Multiagent Reinforcement Learning 查看全文 »
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation
Download Abstract: Learning new task-specific skills fr …
Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation 查看全文 »
Constrained Update Projection Approach to Safe Policy Optimization
Download Abstract: Safe reinforcement learning (RL) stu …
Constrained Update Projection Approach to Safe Policy Optimization 查看全文 »