NeurIPS
Multi-Agent Reinforcement Learning is A Sequence Modeling Problem
Download Abstract: Large sequence model (SM) such as GP …
Multi-Agent Reinforcement Learning is A Sequence Modeling Problem 查看全文 »
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning
Download Abstract: Setting up a well-designed reward fu …
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Download Abstract: Gradient-based Meta-RL (GMRL) refers …
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 查看全文 »