michaelfeil
/

llama-3-405B-fp8-modelopt

Model card Files Files and versions

llama-3-405B-fp8-modelopt / hf_quant_config.json

michaelfeil's picture

Add files using upload-large-folder tool

0de24bb verified 11 months ago

history blame contribute delete

240 Bytes

	{
	"producer": {
	"name": "modelopt",
	"version": "0.25.0"
	},
	"quantization": {
	"quant_algo": "FP8",
	"kv_cache_quant_algo": "FP8",
	"exclude_modules": [
	"lm_head"
	]
	}
	}