tiny_bert_rand_100_v2 / train_results.json
Hartunka's picture
End of training
0b07b8e verified
{
"epoch": 25.0,
"total_flos": 3.058354515764736e+17,
"train_loss": 9.169899891581883,
"train_runtime": 12212.2898,
"train_samples": 228639,
"train_samples_per_second": 468.051,
"train_steps_per_second": 4.876
}