vllm-project-org
/

FLUX.1-dev-AutoRound-w4a16

Model card Files Files and versions

FLUX.1-dev-AutoRound-w4a16 / transformer /quantization_config.json

vllm-ci's picture

Upload folder using huggingface_hub

ede6b4d verified about 2 months ago

history blame contribute delete

291 Bytes

	{
	"bits": 4,
	"data_type": "int",
	"group_size": 128,
	"sym": true,
	"batch_size": 1,
	"iters": 0,
	"autoround_version": "0.12.0",
	"block_name_to_quantize": "transformer_blocks,single_transformer_blocks",
	"quant_method": "auto-round",
	"packing_format": "auto_round:auto_gptq"
	}