all_tasks_combined_8b_sft / train_results.json
hlillemark's picture
End of training
214abf1 verified
raw
history blame contribute delete
219 Bytes
{
"epoch": 2.9953574744661093,
"total_flos": 292826854195200.0,
"train_loss": 0.2790517607955389,
"train_runtime": 12257.2359,
"train_samples_per_second": 4.217,
"train_steps_per_second": 0.132
}