MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy Paper โข 2508.05592 โข Published Aug 7, 2025 โข 6