Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR Paper • 2511.01937 • Published Nov 2, 2025 • 16
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control +2 danaaubakirova, Molbap, mshukor, cadene • Feb 4, 2025 • 192
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889
view article Article TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation imomayiz • Jan 10, 2025 • 34