Update config.json with quantization information 2621a7e verified SamMikaelson commited on 10 days ago
Upload GPTQ 4-bit packed model (75.0% size reduction, 4.0x compression) 42a1727 verified SamMikaelson commited on 11 days ago