ICLR

Proactive Multi-Camera Collaboration for 3D Human Pose Estimation

Download Abstract: This paper presents a multi-agent reinforcement learning (MARL) scheme for proactive Multi-Camera Collaboration in 3D Human Pose Estimation in dynamic human crowds. Traditional fixed-viewpoint multi-camera solutions for human motion capture (MoCap) are limited in capture space and susceptible to dynamic occlusions. Active camera approaches proactively control camera poses to find optimal viewpoints for …

Proactive Multi-Camera Collaboration for 3D Human Pose Estimation Read More »

A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics

Download Abstract: Inspired by humans’ exceptional ability to master arithmetic and generalize to new problems, we present a new dataset, HINT, to examine machines’ capability of learning generalizable concepts at three levels: perception, syntax, and semantics. In HINT, machines are tasked with learning how concepts are perceived from raw signals such as images (i.e., perception), …

A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics Read More »

Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention

Download Abstract: The ability to decompose complex natural scenes into meaningful object-centric abstractions lies at the core of human perception and reasoning. In the recent culmination of unsupervised object-centric learning, the Slot-Attention module has played an important role with its simple yet effective design and fostered many powerful variants. These methods, however, have been exceedingly …

Unsupervised Object-Centric Learning with Bi-level Optimized Query Slot Attention Read More »

Scroll to Top