MVTrack4Gen: Multi-View Point Tracking as Geometric Supervision for 4D Video Generation Paper • 2606.26087 • Published 2 days ago • 29
CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models Paper • 2512.03045 • Published Dec 2, 2025 • 3
CANVAS: A Benchmark for Vision-Language Models on Tool-Based User Interface Design Paper • 2511.20737 • Published Nov 25, 2025 • 3
Vision-aligned Latent Reasoning for Multi-modal Large Language Model Paper • 2602.04476 • Published Feb 4 • 14