Light-R1-SFTData / all_results.json
sedrickkeh's picture
End of training
39505e8 verified
{
"epoch": 2.9786802030456854,
"total_flos": 4.709134058959929e+18,
"train_loss": 0.42409751356625164,
"train_runtime": 40618.2962,
"train_samples_per_second": 2.327,
"train_steps_per_second": 0.005
}