Guokai Ma
delock
AI & ML interests
None yet
Recent Activity
commentedon an article 2 months ago
Muon vs MuonClip vs Muon+AdamW for Fine-Tuning new activity 3 months ago
moonshotai/Moonlight-16B-A3B:fix(modeling): add training-path MoE dispatch and KV cache API compat updated a model 3 months ago
delock/Moonlight-16B-A3B-finetune-fixedOrganizations
None yet