Qwen2.5-1.5B-R1-Distill-GRPO-Math / trainer_state.json
Mingsmilet's picture
Model save
052c39e verified
File too large to display, you can check the raw version instead.