c424 / train_state.json
wind77's picture
Upload folder using huggingface_hub
df26a75 verified
{
"step": 40,
"eval_sparse_kl": 0.0796971321105957,
"teacher": "Qwen/Qwen3.5-35B-A3B",
"data_position": 2204
}