Qwen2.5-1.5B-Open-R1-GRPO_Math / eval_results.json

Commit History

End of training
053c5b7
verified

Dongwei commited on