AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350 Running 418 Reward Bench Leaderboard 📐 418 Display and analyze reward model evaluation results KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
Fater.ai mistralai/Mistral-7B-Instruct-v0.1 Text Generation • 7B • Updated Jul 24, 2025 • 334k • 1.82k Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79 Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.79k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350
AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350 Running 418 Reward Bench Leaderboard 📐 418 Display and analyze reward model evaluation results KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.79k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 137 • 350
Fater.ai mistralai/Mistral-7B-Instruct-v0.1 Text Generation • 7B • Updated Jul 24, 2025 • 334k • 1.82k Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79 Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29