gemma-3-12b-it-INT4-quantized / quantization_config.json
Azaz666's picture
INT4 quantization of google/gemma-3-12b-it (LLM backbone quantized, vision encoder fp16)
bdd9b54 verified
{
"base_model": "google/gemma-3-12b-it",
"quantization": "int4_per_group_symmetric",
"group_size": 128,
"bits": 4,
"method": "static_int4_dequantized",
"description": "INT4 per-group symmetric quantization of LLM backbone weights. Vision encoder kept in fp16. Weights stored as dequantized fp16 for maximum compatibility.",
"quantized_layers": 337,
"skipped_vision_layers": 162
}