ar-llm-browser/ALLaM-7B-SparseGPT-50
7B • Updated • 7
Collection of compressed ALLaM (by humain-ai) models. Methods include multiple quantization formats and pruning with SparseGPT and Wanda.
Note QLoRA quantization (int4)
Note LLM.int8() quantization (int8)