DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Paper • 2603.11076 • Published 3 days ago • 4
HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing Paper • 2601.21459 • Published Jan 29 • 10