adaptive_3B_low / epoch_8 /train_results.json
cuong1692001's picture
Upload epoch_8
31e1341 verified
{
"epoch": 1.0,
"total_flos": 14494654857216.0,
"train_loss": 0.135268345028162,
"train_runtime": 3053.5657,
"train_samples_per_second": 2.882,
"train_steps_per_second": 1.441
}