train_math_qa_456_1760637839 / train_results.json
rbelanec's picture
End of training
971a4ac verified
raw
history blame contribute delete
248 Bytes
{
"epoch": 20.0,
"num_input_tokens_seen": 77891968,
"total_flos": 3.507564542108369e+18,
"train_loss": 1.0954747395889015,
"train_runtime": 26429.8341,
"train_samples_per_second": 20.32,
"train_steps_per_second": 5.081
}