Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published 3 days ago • 24
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12, 2025 • 79
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published Nov 13, 2025 • 98
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs Paper • 2510.13795 • Published Oct 15, 2025 • 58
RLVR-World: Training World Models with Reinforcement Learning Paper • 2505.13934 • Published May 20, 2025 • 16
Vid2World: Crafting Video Diffusion Models to Interactive World Models Paper • 2505.14357 • Published May 20, 2025 • 27