MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published 4 days ago • 82
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 26 days ago • 30
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding Paper • 2409.03757 • Published Sep 5, 2024 • 3
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling Paper • 2510.23605 • Published Oct 27, 2025 • 6
DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models Paper • 2506.03517 • Published Jun 4, 2025 • 13
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models Paper • 2411.05005 • Published Nov 7, 2024 • 13
Floating No More: Object-Ground Reconstruction from a Single Image Paper • 2407.18914 • Published Jul 26, 2024 • 20
Multi-task View Synthesis with Neural Radiance Fields Paper • 2309.17450 • Published Sep 29, 2023 • 3