Qwen2-VL-2B-Instruct-autoround-4bit / quantization_config.json

Upload AutoRound 4-bit quantized model

fd474f3 verified 10 days ago

240 Bytes

	{
	"bits": 4,
	"group_size": 128,
	"sym": true,
	"data_type": "int",
	"autoround_version": "0.9.7",
	"block_name_to_quantize": "model.language_model.layers",
	"quant_method": "auto-round",
	"packing_format": "auto_round:auto_gptq"
	}