Boosting Tool Use of Large Language Models via Iterative Reinforced Fine-Tuning Paper • 2501.09766 • Published Jan 15, 2025 • 1
Tool Zero: Training Tool-Augmented LLMs via Pure RL from Scratch Paper • 2511.01934 • Published Nov 2, 2025 • 1
Precision over Diversity: High-Precision Reward Generalizes to Robust Instruction Following Paper • 2601.04954 • Published 8 days ago • 1