finewebedu-49K-base / train_results.json
gartland's picture
Model save
41f20b3 verified
raw
history blame contribute delete
233 Bytes
{
"epoch": 1.0,
"total_flos": 750871727308800.0,
"train_loss": 3.4398411965957303,
"train_runtime": 7768.4482,
"train_samples": 3180839,
"train_samples_per_second": 409.456,
"train_steps_per_second": 1.6
}