Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 11 days ago • 105
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 11 days ago • 60
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published 24 days ago • 89
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published 29 days ago • 41
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published about 1 month ago • 28
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published Jan 6 • 172
StoryMem: Multi-shot Long Video Storytelling with Memory Paper • 2512.19539 • Published Dec 22, 2025 • 19
In-Video Instructions: Visual Signals as Generative Control Paper • 2511.19401 • Published Nov 24, 2025 • 32
Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published Oct 23, 2025 • 50
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21, 2025 • 41
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling Paper • 2510.09212 • Published Oct 10, 2025 • 18
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9, 2025 • 127
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 170
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer Paper • 2508.10893 • Published Aug 14, 2025 • 31