菜单

Research
- Research Index
- Demo
Blog
About
Career

NeurIPS

Interactive Visual Reasoning Under Uncertainty

Active Reasoning in an Open-World Environment

Evaluating and Inducing Personality in Pre-trained Language Models

Learning non-Markovian Decision-Making from State-only Sequences

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Download Abstract: Large sequence model (SM) such as GP …

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem 查看全文 »

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning

Download Abstract: Setting up a well-designed reward fu …

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 查看全文 »

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

Download Abstract: Gradient-based Meta-RL (GMRL) refers …

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 查看全文 »

A Unified Diversity Measure for Multiagent Reinforcement Learning

Download Abstract: Promoting behavioural diversity is o …

A Unified Diversity Measure for Multiagent Reinforcement Learning 查看全文 »

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation

Download Abstract: Learning new task-specific skills fr …

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation 查看全文 »

Constrained Update Projection Approach to Safe Policy Optimization

Download Abstract: Safe reinforcement learning (RL) stu …

Constrained Update Projection Approach to Safe Policy Optimization 查看全文 »

文章导航

1 2 后一页 →

Copyright © 2023 BIGAI:Beijing Institute for General Artificial Intelligence

Facebook-f Twitter Google-plus-g Pinterest

Scroll to Top