anon22342134
/

test-3-awq

8-bit precision

Model card Files Files and versions

test-3-awq / hf_quant_config.json

anon22342134's picture

Upload quantized model

b5b70ad verified about 1 year ago

history blame contribute delete

338 Bytes

	{
	"producer": {
	"name": "modelopt",
	"version": "0.19.0"
	},
	"quantization": {
	"quant_algo": "W4A16_AWQ",
	"kv_cache_quant_algo": null,
	"group_size": 128,
	"has_zero_point": false,
	"pre_quant_scale": true,
	"exclude_modules": [
	"lm_head"
	]
	}
	}