Qwen2.5-VL-7B-Instruct-INT4-quantized / generation_config.json

Commit History

INT4 quantization of Qwen/Qwen2.5-VL-7B-Instruct (LLM backbone quantized, vision encoder fp16)
544728e
verified

Azaz666 commited on