Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning Download Abstract: Setting up a well-designed reward fu … Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 查看全文 »