od2961/qwen2.5-Math-1.5b-drgrpo-readmeflash-a6000-5epoch-seed43 Reinforcement Learning • Updated Apr 15
od2961/qwen2.5-Math-1.5b-drgrpo-readmeflash-a6000-5epoch-seed44 Reinforcement Learning • Updated Apr 15
Max-and-Omnis/Nemotron-3-Super-64B-A12B-Math-REAP-FP8 Text Generation • 64B • Updated Apr 23 • 1.72k • 1