Azaz666
/

FastVLM-0.5B-torchao-W4A8

vision-language-model

Model card Files Files and versions

FastVLM-0.5B-torchao-W4A8 / quant_meta.json

Azaz666's picture

Upload torchao-W4A8 quantized model

6d06a01 verified about 1 month ago

history blame contribute delete

282 Bytes

	{
	"model_id": "apple/FastVLM-0.5B",
	"family": "fastvlm",
	"method": "torchao_w4a8",
	"bits_weight": 4,
	"bits_activation": 8,
	"group_size": 128,
	"skip_vision": true,
	"load_time_s": 5.3,
	"quant_time_s": 0.1,
	"quant_method": "Int8DynamicActivationInt4Weight (W4A8)"
	}