Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b Viewer • Updated 11 days ago • 306k • 20.1k • 271
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published Mar 25, 2025 • 29