multiple_samples_none_numina_aime / train_results.json
sedrickkeh's picture
End of training
1058391 verified
raw
history blame
217 Bytes
{
"epoch": 2.9725490196078432,
"total_flos": 121055548211200.0,
"train_loss": 0.7961960165273576,
"train_runtime": 4088.556,
"train_samples_per_second": 2.986,
"train_steps_per_second": 0.031
}