efficient-cube3d / quant_config.json
TrNi's picture
Upload quant_config.json with huggingface_hub
81617a5 verified
Raw
History Blame Contribute Delete
189 Bytes
{
"w_bit": 4,
"group_size": 128,
"method": "RTN",
"kernel": "torchao_int4_weight_only (_weight_int4pack_mm)",
"skipped": [
"shape_proj (in=16)",
"lm_head (out=4099)"
]
}