MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling Paper • 2606.13473 • Published 14 days ago • 90
Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 23 days ago • 50
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding Paper • 2605.05997 • Published May 7 • 18
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence Paper • 2605.25979 • Published May 25 • 27
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 29 days ago • 75