262 kB
kdirgul's picture
faz3_train: Muon optimizer entegrasyonu (--muon: 2D-Linear Muon + embed/norm AdamW; cklu-opt state + WSD-carpan; geriye uyumlu)
563bb36 verified