LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 3 days ago • 63
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning Paper • 2401.10480 • Published Jan 19, 2024