hp_ablations_gemma_epoch3 / train_results.json
sedrickkeh's picture
End of training
fb3d9f1 verified
raw
history blame contribute delete
221 Bytes
{
"epoch": 2.9991537376586743,
"total_flos": 5063956896940032.0,
"train_loss": 0.5478912142544783,
"train_runtime": 62248.4199,
"train_samples_per_second": 10.934,
"train_steps_per_second": 0.021
}