MiniMax-M2.7-8bit / generation_config.json
mlavkin's picture
RTN W8A16 INT8 from operationrange/MiniMax-M2.7-BF16 (group=128 sym, ignore=lm_head+router+gate+embed, no calibration)
b637d38 verified
raw
history blame contribute delete
144 Bytes
{
"bos_token_id": 200019,
"do_sample": true,
"eos_token_id": 200020,
"top_k": 40,
"top_p": 0.95,
"transformers_version": "4.57.6"
}