ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents Paper • 2604.23781 • Published 11 days ago • 33
MM-ACT: Learn from Multimodal Parallel Generation to Act Paper • 2512.00975 • Published Nov 30, 2025 • 6
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published Dec 4, 2025 • 50