bert_base_rand_50_v1_sst2 / train_results.json
Hartunka's picture
End of training
08fda71 verified
{
"epoch": 6.0,
"total_flos": 5.316079940232192e+16,
"train_loss": 0.17441612662691058,
"train_runtime": 662.508,
"train_samples": 67349,
"train_samples_per_second": 5082.882,
"train_steps_per_second": 19.924
}