Repurposing Geometric Foundation Models for Multi-view Diffusion Paper • 2603.22275 • Published 1 day ago • 24
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning Paper • 2603.22057 • Published 1 day ago • 39
Running GLD - Geometric Latent Diffusion 🌐 Generate new views and 3D point cloud from multi‑view images
Running GLD - Geometric Latent Diffusion 🌐 Generate new views and 3D point cloud from multi‑view images
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 7 days ago • 58
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 8 days ago • 145
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation Paper • 2510.23581 • Published Oct 27, 2025 • 42
Visual Representation Alignment for Multimodal Large Language Models Paper • 2509.07979 • Published Sep 9, 2025 • 84