GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published 11 days ago • 3
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11 • 17