c1_math_0d_1s_10k / train_results.json
ryanmarten's picture
Upload model
82bcbde verified
{
"epoch": 4.992,
"total_flos": 1.878482348849234e+18,
"train_loss": 0.37849522767922816,
"train_runtime": 19752.9737,
"train_samples_per_second": 2.531,
"train_steps_per_second": 0.02
}