Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe Paper • 2603.21972 • Published Mar 23 • 5
Rex-Thinker: Grounded Object Referring via Chain-of-Thought Reasoning Paper • 2506.04034 • Published Jun 4, 2025 • 4
IDEA-Research/Rex-Thinker-GRPO-7B Zero-Shot Object Detection • 8B • Updated Jun 9, 2025 • 137 • 9
IDEA-Research/Rex-Thinker-GRPO-7B Zero-Shot Object Detection • 8B • Updated Jun 9, 2025 • 137 • 9