ryan_model / train_results.json
rshrott's picture
🍻 cheers
3e7e79d verified
raw
history blame contribute delete
205 Bytes
{
"epoch": 4.0,
"total_flos": 1.85987442622464e+17,
"train_loss": 0.7916242197940224,
"train_runtime": 235.103,
"train_samples_per_second": 10.208,
"train_steps_per_second": 0.647
}