QWQ-JAVA_DS-JAVA_final_model_sft / train_results.json
yuzhounie's picture
End of training
6197c9f verified
{
"epoch": 1.8842105263157896,
"total_flos": 1.0170584361074688e+16,
"train_loss": 1.1789321388517107,
"train_runtime": 367.3964,
"train_samples_per_second": 2.058,
"train_steps_per_second": 0.038
}