distilbert_rand_5_v1_qnli / train_results.json
Hartunka's picture
End of training
b66e1c6 verified
{
"epoch": 7.0,
"total_flos": 4.856261458098893e+16,
"train_loss": 0.4447608894587394,
"train_runtime": 689.5547,
"train_samples": 104743,
"train_samples_per_second": 7594.974,
"train_steps_per_second": 29.729
}