morriszjm's picture
training-free expert prune K=128/256 (PR=50%) via routing-mass calibration on 64 AI4Code/general/multilingual prompts
70748f6 verified