Native Active Perception as Reasoning for Omni-Modal Understanding Paper • 2606.19341 • Published 9 days ago • 17
ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining Paper • 2606.17200 • Published 11 days ago • 49
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold Paper • 2606.13376 • Published 15 days ago • 14
World Model Self-Distillation: Training World Models to Solve General Tasks Paper • 2606.12072 • Published 16 days ago • 14
OpenHA Collection A Series of Open-Source Hierarchical Agentic Models & Datasets in Minecraft • 10 items • Updated Sep 21, 2025 • 3
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language Paper • 2604.19667 • Published Apr 21 • 23