Upload INT4 quantized model with bfloat16 compute, extreme shrinkage and modifications. 1910067 verified jnjj commited on Apr 24, 2025
Upload INT4 quantized Gemma‑3‑1B‑IT QAT with bfloat16 compute, extreme shrinkage (100% weight prune, only weights saved), and extensive unconventional modifications including GPTQ/AWQ flags (bfloat16 compute) de2a4ab verified jnjj commited on Apr 24, 2025
Upload INT4 quantized Gemma‑3‑1B‑IT QAT with bfloat16 compute, extreme shrinkage (100% weight prune, only weights saved), and extensive unconventional modifications including GPTQ/AWQ flags (bfloat16 compute) b3b3372 verified jnjj commited on Apr 24, 2025