Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics Paper • 2601.14027 • Published 9 days ago • 12
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Paper • 2510.25726 • Published Oct 29, 2025 • 46
F1-Reasoner/general-reasoner-noent-f1v1-qwen3-4b-base_F1_Training_qwen3-4b-base 4B • Updated Sep 11, 2025 • 1
F1-Reasoner/general-reasoner-noent-f1v1-qwen3-4b-base_F1_Training_qwen3-4b-base 4B • Updated Sep 11, 2025 • 1