train_runtime,train_samples_per_second,train_steps_per_second,total_flos,train_loss,epoch,step 69.4212,279.223,4.379,2550095244619776.0,0.6192449268541838,4.0,304