Running Featured 1.34k FineWeb: decanting the web for the finest text data at scale 🍷 1.34k Explore and download the FineWeb web‑text dataset
GigaWorld-Policy: An Efficient Action-Centered World--Action Model Paper • 2603.17240 • Published Mar 18 • 26
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 11 days ago • 116
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 11 days ago • 68
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 11 days ago • 65
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 14 days ago • 224
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Paper • 2503.07137 • Published Mar 10, 2025 • 2
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published 17 days ago • 39
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 16 days ago • 239