Models

851

Full-text search

Active filters: 4bit

benyamini/DSR1-8B-llmc-awq-w4

Text Generation • 2B • Updated Aug 29, 2025 • 1

Sai2076/LLLMA_FINETUNED_PROJEN

Updated Aug 30, 2025

edge-inference/DSR1-8B-llmc-awq-w4

Text Generation • 2B • Updated Aug 30, 2025 • 1

edge-inference/DSR1-14B-llmc-awq-w4

Text Generation • 3B • Updated Aug 30, 2025 • 1

edge-inference/DSR1-32B-llmc-awq-w4

Text Generation • 6B • Updated Aug 30, 2025 • 4

UAB-NLP/ProjGen_Finetuned_llama

Updated Aug 30, 2025

edge-inference/DSR1-1.5B-llmc-awq-w4

Text Generation • 2B • Updated Aug 30, 2025 • 1 • 1

brabooObrabo/Qwen3-4B-Instruct-2507-MLX-4bit-GS32-embed-8bit-GS32

Text Generation • 0.8B • Updated Oct 7, 2025 • 45

marcusmi4n/abeja-qwen2.5-7b-japanese-quantized

Text Generation • 8B • Updated Sep 1, 2025 • 4

TroglodyteDerivations/MLX_DeepSeek_V3_1_4bit

Text Generation • Updated Sep 5, 2025

NangWeiLun/MiMo-VL-7B-SFT-2508-bnb-4bit-fp4

Image-Text-to-Text • 8B • Updated Sep 9, 2025 • 12

NangWeiLun/MiMo-VL-7B-RL-2508-bnb-4bit-fp4

Image-Text-to-Text • 8B • Updated Sep 9, 2025 • 303

2imi9/Qwen3-1.7b-gptq-int4

Text Generation • 2B • Updated Sep 12, 2025 • 4

zooai/nano-1

Text Generation • 0.6B • Updated Sep 12, 2025 • 127 • 1

sweatSmile/DeepSeek-R1-Distill-Qwen-1.5B-Alpaca-Instruct

2B • Updated Sep 20, 2025 • 3 • 1

analystgatitu/economist_model_v3

Text Generation • Updated Sep 30, 2025 • 1

analystgatitu/economist_model_v4

Text Generation • 3B • Updated Oct 2, 2025 • 1

KavinduHansaka/phi4-mini-bnb-4bit

Text Generation • 4B • Updated Oct 1, 2025 • 12

mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX

Image-Text-to-Text • Updated Oct 3, 2025 • 10

aciklab/kubernetes-ai-4bit

12B • Updated Oct 3, 2025 • 2 • 2

Dhana8907/Llama-3.1-8B-Instruct-4bit

Text Generation • 8B • Updated Oct 10, 2025 • 12

huggingface-lc/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit-mxfp4

Text Generation • 31B • Updated Oct 10, 2025 • 547 • 1

SiddhJagani/gpt-oss-20b-no-think-mlx-Q4

Text Generation • 21B • Updated Oct 16, 2025 • 36

ellyfantina/llama3-medquad-lora

Updated Oct 15, 2025 • 2

MightyOctopus/qwen3-0.6B-lora-medical

Updated Oct 17, 2025

iMiW/Giga-Embeddings-instruct-4bit-nf4

Feature Extraction • 4B • Updated Oct 16, 2025 • 430

ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v1

Text Generation • 357B • Updated Oct 28, 2025 • 10

ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v2

Text Generation • 357B • Updated Oct 28, 2025 • 3 • 1

MidnightPhreaker/KAT-Dev-72B-Exp-GPTQ-INT4-gs32-0.01

75B • Updated Oct 22, 2025 • 1 • 1

MidnightPhreaker/KAT-Dev-72B-Exp-GPTQ-INT4-gs32

75B • Updated Oct 22, 2025 • 1