AIMO - a ell-hol Collection

ell-hol 's Collections

AIMO

updated Dec 11, 2024

AI-MO/NuminaMath-7B-TIR

Text Generation • 7B • Updated Aug 14, 2024 • 223 • 351
Running

Agents

432

Reward Bench Leaderboard

📐

432

Explore and compare model scores on RewardBench benchmarks
KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 22