chchen's picture
End of training
db1325b verified
raw
history blame contribute delete
225 Bytes
{
"epoch": 4.9976479443033215,
"total_flos": 8.981559806648648e+17,
"train_loss": 0.048681382634620886,
"train_runtime": 37834.6736,
"train_samples_per_second": 1.405,
"train_steps_per_second": 0.088
}