train_math_qa_42_1760637607 / train_results.json
rbelanec's picture
End of training
13fdbf3 verified
{
"epoch": 20.0,
"num_input_tokens_seen": 77902976,
"total_flos": 3.514797523668566e+18,
"train_loss": 0.18788277705373982,
"train_runtime": 32107.5081,
"train_samples_per_second": 16.727,
"train_steps_per_second": 4.182
}