Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation Paper • 2602.02214 • Published 2 days ago • 22
Language-based Trial and Error Falls Behind in the Era of Experience Paper • 2601.21754 • Published 6 days ago • 16
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published 8 days ago • 16
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper • 2601.14253 • Published 15 days ago • 10
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion Paper • 2601.16148 • Published 13 days ago • 12
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper • 2601.09499 • Published 21 days ago • 9
3AM: Segment Anything with Geometric Consistency in Videos Paper • 2601.08831 • Published 22 days ago • 34
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 29 days ago • 146
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 130
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper • 2512.24766 • Published Dec 31, 2025 • 9
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published Dec 29, 2025 • 24
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published Dec 31, 2025 • 15
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published Dec 29, 2025 • 65
Yume-1.5: A Text-Controlled Interactive World Generation Model Paper • 2512.22096 • Published Dec 26, 2025 • 60
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published Dec 17, 2025 • 33
DreaMontage: Arbitrary Frame-Guided One-Shot Video Generation Paper • 2512.21252 • Published Dec 24, 2025 • 35