-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Qwen/Qwen3-235B-A22B-MLX-4bit
Text Generation
•
33B
•
Updated
•
279
•
16
AngelSlim/Qwen3-32B_int4_gptq
33B
•
Updated
•
29.4k
•
1
mlx-community/SmolLM3-3B-4bit
Text Generation
•
Updated
•
2.39k
•
5
mlx-community/LFM2-350M-4bit
Text Generation
•
55.4M
•
Updated
•
577
•
6
unsloth/LFM2-350M-unsloth-bnb-4bit
Text Generation
•
0.4B
•
Updated
•
99
•
1
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
248B
•
Updated
•
439
•
3
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
•
534B
•
Updated
•
43
•
7
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
21
•
3
fengpeisheng1/Peach-2.0-9B-8k-Roleplay-mlx-4Bit
Text Generation
•
1B
•
Updated
•
71
•
1
lmstudio-community/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit
Text Generation
•
31B
•
Updated
•
160k
•
14
mlx-community/Qwen3-4B-Instruct-2507-4bit
Text Generation
•
0.6B
•
Updated
•
13.4k
•
8
steampunque/gpt-oss-20b-Hybrid-GGUF
21B
•
Updated
•
19
•
1
unsloth/Qwen3-4B-Instruct-2507-unsloth-bnb-4bit
Text Generation
•
4B
•
Updated
•
244k
•
13
mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit-dwq-v2
Text Generation
•
Updated
•
487
•
7
openbmb/MiniCPM-V-4_5-AWQ
Image-Text-to-Text
•
9B
•
Updated
•
3.31k
•
13
lmstudio-community/Hermes-4-405B-MLX-4bit
Text Generation
•
406B
•
Updated
•
120
•
1
Intel/Seed-OSS-36B-Instruct-int4-AutoRound
2B
•
Updated
•
77
•
14
driaforall/mem-agent-mlx-4bit
Text Generation
•
0.6B
•
Updated
•
46
•
5
Intel/Qwen3-Next-80B-A3B-Thinking-int4-mixed-AutoRound
Text Generation
•
Updated
•
232
•
32
mlx-community/Qwen3-Next-80B-A3B-Thinking-4bit
Text Generation
•
Updated
•
29.7k
•
4
mlx-community/Jinx-gpt-oss-20b-mxfp4-mlx
Text Generation
•
21B
•
Updated
•
109
•
3
nightmedia/LIMI-Air-mxfp4-mlx
Text Generation
•
107B
•
Updated
•
19
•
2
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
6.91k
•
13
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
467k
•
37
mku64/CheapResearch-4B-Thinking-mlx-4Bit
0.6B
•
Updated
•
11
•
1
mlx-community/LFM2-8B-A1B-4bit
Text Generation
•
1B
•
Updated
•
537
•
7
unsloth/Qwen3-VL-8B-Thinking-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
•
473
•
2
lmstudio-community/Qwen3-VL-8B-Instruct-MLX-4bit
Image-Text-to-Text
•
Updated
•
166k
•
4
mlx-community/Qwen3-VL-2B-Thinking-4bit
Image-Text-to-Text
•
Updated
•
96
•
1
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
86.3k
•
9