distilbert_rand_20_v1_sst2 / train_results.json
Hartunka's picture
End of training
c084ccd verified
raw
history blame contribute delete
238 Bytes
{
"epoch": 6.0,
"total_flos": 2.676464049624883e+16,
"train_loss": 0.17497296766801315,
"train_runtime": 377.617,
"train_samples": 67349,
"train_samples_per_second": 8917.634,
"train_steps_per_second": 34.956
}