ALLaM (7B) Compressed - a ar-llm-browser Collection

ar-llm-browser 's Collections

Qwen3 (8B) Compressed

ALLaM (7B) Compressed

Small Pruned Models

ALLaM (7B) Compressed

updated 22 days ago

Collection of compressed ALLaM (by humain-ai) models. Methods include multiple quantization formats and pruning with SparseGPT and Wanda.

ar-llm-browser/ALLaM-7B-SparseGPT-50

7B • Updated Apr 25 • 7
ar-llm-browser/ALLaM-7B-SparseGPT-50-2of4

5B • Updated Apr 25 • 2
ar-llm-browser/ALLaM-7B-Wanda-50

7B • Updated about 1 month ago • 8
ar-llm-browser/ALLaM-7B-int4

7B • Updated 22 days ago • 26

Note QLoRA quantization (int4)
ar-llm-browser/ALLaM-7B-int8

7B • Updated 22 days ago • 33

Note LLM.int8() quantization (int8)