Rewards as Labels: Revisiting RLVR from a Classification Perspective Paper • 2602.05630 • Published Feb 5 • 3
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper • 2510.23473 • Published Oct 27, 2025 • 85