Qwen2.5-3B_difficulty_based_data / train_results.json
atharva14's picture
Add files using upload-large-folder tool
8662f1d verified
raw
history blame contribute delete
208 Bytes
{
"epoch": 3.0,
"total_flos": 6.849963317700592e+18,
"train_loss": 0.09753237076913879,
"train_runtime": 6368.1203,
"train_samples_per_second": 7.066,
"train_steps_per_second": 0.442
}