distilbert_rand_5_v2 / train_results.json
Hartunka's picture
End of training
ffd2a88 verified
{
"epoch": 25.0,
"total_flos": 7.68154255975296e+17,
"train_loss": 6.938646208973716,
"train_runtime": 16943.8632,
"train_samples": 228639,
"train_samples_per_second": 337.348,
"train_steps_per_second": 3.515
}