Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

47,233

Base only

Active filters: 4-bit

solarkyle/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated Jan 20 • 177 • 5

nota-ai/Solar-Open-100B-NotaMoEQuant-Int4

Text Generation • Updated Jan 26 • 200 • 47

openbmb/MiniCPM-o-4_5-awq

Any-to-Any • 9B • Updated Jun 2 • 47.5k • 22

lmstudio-community/Qwen3-Coder-Next-MLX-4bit

80B • Updated Feb 2 • 167k • 24

AtomGradient/Qwen3-TTS-0.6B-CustomVoice-4bit-pruned-vocab-lite

0.2B • Updated Feb 17 • 64 • 2

QuantTrio/Qwen3.5-122B-A10B-AWQ

Image-Text-to-Text • 125B • Updated Feb 26 • 20.7k • 29

aufklarer/Qwen3-ASR-0.6B-MLX-4bit

0.3B • Updated Apr 12 • 159k • 5

mlx-community/Qwen3.5-9B-MLX-4bit

Image-Text-to-Text • 2B • Updated Mar 23 • 20.9k • 144

mlx-community/Qwen3.5-2B-4bit

Image-Text-to-Text • 0.6B • Updated Mar 2 • 3.35k • 4

Qwen/Qwen3.5-35B-A3B-GPTQ-Int4

Image-Text-to-Text • 36B • Updated Apr 24 • 849k • 93

QuantTrio/Qwen3.5-9B-AWQ

Image-Text-to-Text • 10B • Updated Mar 4 • 642k • 24

lukey03/Qwen3.5-9B-abliterated-MLX-4bit

Text Generation • 1B • Updated Mar 3 • 1.74k • 17

mlx-community/Qwen3.5-27B-Claude-4.6-Opus-Distilled-MLX-4bit

27B • Updated Mar 6 • 15.7k • 233

mlx-community/Qwen3.5-0.8B-OptiQ-4bit

Text Generation • 0.2B • Updated 2 days ago • 2.29k • 21

mlx-community/Qwen3.5-4B-OptiQ-4bit

Text Generation • 0.9B • Updated 2 days ago • 6.67k • 25

pessini/Tucano2-qwen-3.7B-Think-MLX-4bit

0.6B • Updated Mar 14 • 410 • 1

happypatrick/Qwen3.5-122B-A10B-heretic-int4-AutoRound

Image-Text-to-Text • 19B • Updated Mar 15 • 4.7k • 13

TheCluster/Qwen3.5-9B-Uncensored-HauhauCS-Aggressive-MLX-mxfp4

Image-Text-to-Text • 9B • Updated Mar 17 • 3.3k • 13

mlx-community/Huihui-Qwen3.5-4B-Claude-4.6-Opus-abliterated-4bit

Image-Text-to-Text • 1.0B • Updated Mar 17 • 524 • 5

adpena/Vertigo-Qwen3.5-4B-v0.5-4bit

0.7B • Updated Mar 25 • 117 • 1

prabhal/mistral-clinical-simplifier

Text Generation • 7B • Updated Mar 27 • 8 • 1

Jackrong/MLX-Qwopus3.5-9B-v3-4bit

Text Generation • 1B • Updated Mar 31 • 2.87k • 34

unsloth/gemma-4-E4B-it-UD-MLX-4bit

Image-Text-to-Text • 2B • Updated Apr 13 • 1.02k • 42

mlx-community/gemma-4-26b-a4b-it-4bit

Image-Text-to-Text • 5B • Updated 11 days ago • 34.1k • 76

unsloth/gemma-4-31B-it-unsloth-bnb-4bit

Image-Text-to-Text • 31B • Updated May 4 • 546k • 21

deadbydawn101/gemma-4-E2B-Heretic-Uncensored-mlx-4bit

Image-Text-to-Text • 1B • Updated Apr 9 • 3.81k • 14

Intel/gemma-4-26B-A4B-it-int4-AutoRound

5B • Updated Apr 3 • 18.9k • 14

lmstudio-community/gemma-4-26B-A4B-it-MLX-4bit

Image-Text-to-Text • 5B • Updated Apr 10 • 176k • 9

deadbydawn101/gemma-4-E4B-mlx-4bit

Image-Text-to-Text • 2B • Updated Apr 9 • 260 • 11

jason-schulz/Gemma-4-26B-A4B-Hermes-VLM-MLX

Image-Text-to-Text • 5B • Updated Apr 6 • 714 • 6