GPT2_AR_200 / train_results.json
xiulinyang's picture
Add checkpoint
183e8c8
{
"epoch": 10.0,
"total_flos": 2.3783323336704e+17,
"train_loss": 0.40263440499196784,
"train_runtime": 24635.8124,
"train_samples": 91022,
"train_samples_per_second": 36.947,
"train_steps_per_second": 1.155
}