Llama-3.2-3B-Open-R1-Distill-GRPO / trainer_state.json

Commit History

Model save
6dbd991
verified

rkumar1999 commited on

Model save
5fb39bd
verified

rkumar1999 commited on