distilbert_rand_5_v1_qqp / train_results.json
Hartunka's picture
End of training
b02d8bb verified
{
"epoch": 7.0,
"total_flos": 1.686920659598684e+17,
"train_loss": 0.26815341997750197,
"train_runtime": 2417.3773,
"train_samples": 363846,
"train_samples_per_second": 7525.635,
"train_steps_per_second": 29.412
}