Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models Paper • 2606.25473 • Published 2 days ago • 17
Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 3 days ago • 110
ENPIRE: Agentic Robot Policy Self-Improvement in the Real World Paper • 2606.19980 • Published 8 days ago • 14
EgoCS-400K: An Egocentric Gameplay Dataset for World Models Paper • 2606.18180 • Published 10 days ago • 15
GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine? Paper • 2606.17861 • Published 10 days ago • 55
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 10 days ago • 61
ActiveMimic: Egocentric Video Pretraining with Active Perception Paper • 2606.06194 • Published 22 days ago • 2
MBench: A Comprehensive Benchmark on Memory Capability for Video World Models Paper • 2606.00793 • Published 18 days ago • 11
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions Paper • 2606.09076 • Published 18 days ago • 61
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing Paper • 2606.06042 • Published 22 days ago • 24
Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation Paper • 2606.04527 • Published 23 days ago • 28
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 24 days ago • 41
VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion Paper • 2605.30351 • Published 29 days ago • 26
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 28 days ago • 61