eac123's picture
Upload LoRA adapter from student SFT run (seed 42)
dee3e3b verified
raw
history blame contribute delete
183 Bytes
{
"train_runtime": 14297.3191,
"train_samples_per_second": 6.461,
"train_steps_per_second": 0.108,
"total_flos": 5.5096995344541696e+17,
"train_loss": 0.06416236311802036
}