Qwen2.5-VL-7B-Instruct-INT4-quantized / generation_config.json
Azaz666's picture
INT4 quantization of Qwen/Qwen2.5-VL-7B-Instruct (LLM backbone quantized, vision encoder fp16)
544728e verified
{
"bos_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"pad_token_id": 151643,
"repetition_penalty": 1.05,
"temperature": 1e-06,
"transformers_version": "5.3.0"
}