Qwen2.5-3B-Open-R1-Code-GRPO / train_results.json
Yukang's picture
Model save
80cf234 verified
raw
history blame
204 Bytes
{
"total_flos": 0.0,
"train_loss": 1.3396084333325358e-07,
"train_runtime": 69.6904,
"train_samples": 35735,
"train_samples_per_second": 3673.392,
"train_steps_per_second": 7.175
}