菜单

Research
- Research Index
- Demo
Blog
About
Career

NeurIPS

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding

PhyRecon: Physically Plausible Neural Scene Reconstruction

Interactive Visual Reasoning Under Uncertainty

Active Reasoning in an Open-World Environment

Evaluating and Inducing Personality in Pre-trained Language Models

Learning non-Markovian Decision-Making from State-only Sequences

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem

Download Abstract: Large sequence model (SM) such as GP …

Multi-Agent Reinforcement Learning is A Sequence Modeling Problem 查看全文 »

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning

Download Abstract: Setting up a well-designed reward fu …

Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning 查看全文 »

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning

Download Abstract: Gradient-based Meta-RL (GMRL) refers …

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning 查看全文 »

文章分页

← 前一页 1 2 3 4 后一页 →

Copyright © 2023 BIGAI:Beijing Institute for General Artificial Intelligence

Facebook-f Twitter Google-plus-g Pinterest

Scroll to Top