Doc2Agent: Scalable Generation of Tool-Using Agents from API Documentation Paper • 2506.19998 • Published Jun 24, 2025 • 1
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published Feb 5, 2025 • 18
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18, 2025 • 144