Deepseek-R1-Distill-14B-Code / train_results.json
realYinkaIyiola's picture
Upload folder using huggingface_hub
ca78b15 verified
raw
history blame contribute delete
204 Bytes
{
"epoch": 3.0,
"total_flos": 3000194903834624.0,
"train_loss": 0.4290710630871001,
"train_runtime": 89946.115,
"train_samples_per_second": 0.597,
"train_steps_per_second": 0.037
}