ICLR
Proactive Multi-Camera Collaboration for 3D Human Pose Estimation
Download Abstract: This paper presents a multi-agent reinforcement learning (MARL) scheme for proactive Multi-Camera Collaboration in 3D Human Pose Estimation in dynamic human crowds. Traditional fixed-viewpoint multi-camera solutions for human motion capture (MoCap) are limited in capture space and susceptible to dynamic occlusions. Active camera approaches proactively control camera poses to find optimal viewpoints for …
Proactive Multi-Camera Collaboration for 3D Human Pose Estimation Read More »
SQA3D: Situated Question Answering in 3D Scenes
Download Abstract: We propose a new task to benchmark scene understanding of embodied agents: Situated Question Answering in 3D Scenes (SQA3D). Given a scene context (e.g., 3D scan), SQA3D requires the tested agent to first understand its situation (position, orientation, etc.) in the 3D scene as described by text, then reason about its surrounding environment …
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
Download Abstract: Inspired by humans’ exceptional ability to master arithmetic and generalize to new problems, we present a new dataset, HINT, to examine machines’ capability of learning generalizable concepts at three levels: perception, syntax, and semantics. In HINT, machines are tasked with learning how concepts are perceived from raw signals such as images (i.e., perception), …
A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics Read More »
Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention
Download Abstract: The ability to decompose complex natural scenes into meaningful object-centric abstractions lies at the core of human perception and reasoning. In the recent culmination of unsupervised object-centric learning, the Slot-Attention module has played an important role with its simple yet effective design and fostered many powerful variants. These methods, however, have been exceedingly …
Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention Read More »