Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published 3 days ago • 40
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 10 days ago • 148
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 9 days ago • 59
Visual Representation Alignment for Multimodal Large Language Models Paper • 2509.07979 • Published Sep 9, 2025 • 84
Exploring Conditions for Diffusion models in Robotic Control Paper • 2510.15510 • Published Oct 17, 2025 • 40
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation Paper • 2510.23581 • Published Oct 27, 2025 • 42