LoHoSearch: Benchmarking Long-Horizon Search Agents Beyond the Human Difficulty Ceiling Paper • 2606.12837 • Published 7 days ago • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published Mar 22 • 78