OPENZEKA
/

Qwen3-4B-Instruct-2507-NVFP4

8-bit precision

Model card Files Files and versions

Qwen3-4B-Instruct-2507-NVFP4 / hf_quant_config.json

birolkuyumcu's picture

Upload 11 files

1a7c709 verified 5 months ago

history blame contribute delete

284 Bytes

	{
	"producer": {
	"name": "modelopt",
	"version": "0.41.0.dev5+g47c04d6b3"
	},
	"quantization": {
	"quant_algo": "NVFP4",
	"kv_cache_quant_algo": "FP8",
	"group_size": 16,
	"exclude_modules": [
	"lm_head"
	]
	}
	}