aetheris / stage2_metadata.json
rcgalbo's picture
Update Stage 2 metadata: COMPLETE, best loss=2.7305
022db6b verified
raw
history blame contribute delete
263 Bytes
{
"stage": 2,
"method": "KL distillation",
"best_loss": 2.7305,
"total_steps": 20000,
"temperature": 2.0,
"alpha": 0.7,
"lr": 0.0005,
"status": "COMPLETE",
"teacher": "CohereLabs/tiny-aya-global",
"student_params_m": 721.6,
"languages": 67
}