iter72: add num_local_experts/num_shared_experts aliases (validator MoE accounting) 8ae45d5 verified unconst commited on 25 days ago
iter58 new_king_a0999 alpha=0.999 perturbation of RLStepone/distil-success-h11 b890d5a verified unconst commited on 26 days ago