Unhackable Temporal Rewarding for Scalable Video MLLMs Paper β’ 2502.12081 β’ Published Feb 17, 2025 β’ 1
Perception-R1: Pioneering Perception Policy with Reinforcement Learning Paper β’ 2504.07954 β’ Published Apr 10, 2025
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning Paper β’ 2507.05255 β’ Published Jul 7, 2025 β’ 75
Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding Paper β’ 2507.19427 β’ Published Jul 25, 2025 β’ 19
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale Paper β’ 2508.10711 β’ Published Aug 14, 2025 β’ 145