Confidence-Aware Tool Orchestration for Robust Video Understanding Paper • 2606.26904 • Published 2 days ago • 6
Confidence-Aware Tool Orchestration for Robust Video Understanding Paper • 2606.26904 • Published 2 days ago • 6
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning Paper • 2602.08236 • Published Feb 9 • 9
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published Feb 4 • 8
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories Paper • 2602.14941 • Published Feb 16 • 6
EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding Paper • 2605.09874 • Published May 11 • 2
PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation Paper • 2605.14269 • Published May 14 • 9
VGGT-Edit: Feed-forward Native 3D Scene Editing with Residual Field Prediction Paper • 2605.15186 • Published May 14 • 26
PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation Paper • 2605.14269 • Published May 14 • 9
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams Paper • 2603.07392 • Published Mar 8 • 18
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents Paper • 2603.09827 • Published Mar 10 • 30
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories Paper • 2602.14941 • Published Feb 16 • 6
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning Paper • 2602.08236 • Published Feb 9 • 9
Reliable and Responsible Foundation Models: A Comprehensive Survey Paper • 2602.08145 • Published Feb 4 • 8