sft_ctx16384 / all_results.json
Rano23's picture
End of training
d155c78 verified
raw
history blame contribute delete
223 Bytes
{
"epoch": 4.9988942130482865,
"total_flos": 6.700673317784781e+16,
"train_loss": 0.4786904600803903,
"train_runtime": 98131.1995,
"train_samples_per_second": 3.317,
"train_steps_per_second": 0.207
}