GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published 11 days ago • 3
GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training Paper • 2512.13043 • Published 11 days ago • 3
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11 • 17
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Paper • 2503.08525 • Published Mar 11 • 17
WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents Paper • 2410.07484 • Published Oct 9, 2024 • 51