LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws Paper • 2502.12120 • Published Feb 17, 2025
MATH-Beyond: A Benchmark for RL to Expand Beyond the Base Model Paper • 2510.11653 • Published Oct 13, 2025 • 1