ResearchMath-14K: Scaling Research-Level Mathematics via Agents Paper • 2605.28003 • Published May 27 • 50
How to Fine-Tune a Reasoning Model? A Teacher-Student Cooperation Framework to Synthesize Student-Consistent SFT Data Paper • 2604.14164 • Published Mar 23 • 35