train_runtime,train_samples_per_second,train_steps_per_second,total_flos,train_loss,epoch,step 151.7765,95.786,1.502,6774231968658432.0,0.6172169467859101,3.0,228