train_record_1745950246 / train_results.json
rbelanec's picture
End of training
b0f3ef0 verified
{
"epoch": 1.2803277639075603,
"num_input_tokens_seen": 55002224,
"total_flos": 2.3032929154239283e+17,
"train_loss": 0.8202396101236343,
"train_runtime": 83869.5715,
"train_samples_per_second": 1.908,
"train_steps_per_second": 0.477
}