MiniMax-M2.5-tiny-24e / generation_config.json
morriszjm's picture
training-free expert prune K=24/32 (PR=25%) via routing-mass calibration
51d7883 verified
raw
history blame contribute delete
166 Bytes
{
"bos_token_id": 200019,
"do_sample": true,
"eos_token_id": 200020,
"temperature": 1.0,
"top_p": 0.95,
"top_k": 40,
"transformers_version": "4.46.1"
}