ci-random-qwen2-moe-a3b / tokenizer.json

Commit History

Revert to e6fb385 — restore random init (distillation moved to hyper-accel/ci-random-qwen2-moe-a3b-distilled)
0f32a92
verified

ELutris commited on

KD-distilled 2-layer Qwen2-MoE student against Qwen1.5-MoE-A2.7B teacher (alpaca-cleaned, 1500 steps, T=2.0, KL loss)
ca82f0e
verified

ELutris commited on

Upload tokenizer from qwen2_moe
e6fb385
verified

junsoo999 commited on