mattbucci's picture
add quantization_config.ignore=['lm_head'] (downstream audit fix)
91382f5 verified