adaptive_3B_low / epoch_13 /train_results.json
cuong1692001's picture
Upload epoch_13
b91303a verified
{
"epoch": 1.0,
"total_flos": 14046435803136.0,
"train_loss": 0.1244462939529595,
"train_runtime": 3028.8415,
"train_samples_per_second": 2.905,
"train_steps_per_second": 1.453
}