WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors Paper • 2605.10434 • Published 3 days ago • 27
World Model for Robot Learning: A Comprehensive Survey Paper • 2605.00080 • Published 14 days ago • 14
World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published 2 days ago • 54
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 7 days ago • 48
ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control Paper • 2604.27711 • Published 14 days ago • 41
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 13 days ago • 25
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 13 days ago • 81
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published 14 days ago • 89
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 15 days ago • 106
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published 20 days ago • 121
A Survey on LLM-based Conversational User Simulation Paper • 2604.24977 • Published 17 days ago • 8
PhyCo: Learning Controllable Physical Priors for Generative Motion Paper • 2604.28169 • Published 14 days ago • 13
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons Paper • 2604.28130 • Published 14 days ago • 22
Synthetic Computers at Scale for Long-Horizon Productivity Simulation Paper • 2604.28181 • Published 14 days ago • 18
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published 15 days ago • 24
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 17 days ago • 70
AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery Paper • 2604.25256 • Published 16 days ago • 29
DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios Paper • 2604.25914 • Published 16 days ago • 41
Building a Precise Video Language with Human-AI Oversight Paper • 2604.21718 • Published 22 days ago • 16