-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Text Generation
•
Updated
•
770k
•
22
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
•
9B
•
Updated
•
230
•
9
nota-ai/Solar-Open-100B-NotaMoEQuant-Int4
Text Generation
•
Updated
•
417
•
41
inferencerlabs/Kimi-K2.5-MLX-3.6bit
Text Generation
•
1T
•
Updated
•
3.03k
•
13
mlx-community/GLM-4.7-Flash-4bit
Text Generation
•
30B
•
Updated
•
21.1k
•
52
mlx-community/Step-3.5-Flash-4bit
Text Generation
•
197B
•
Updated
•
1.81k
•
6
mlx-community/Qwen3-ASR-0.6B-4bit
0.3B
•
Updated
•
217
•
4
lmstudio-community/Qwen3-Coder-Next-MLX-4bit
80B
•
Updated
•
108k
•
4
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
8B
•
Updated
•
250k
•
95
unsloth/gemma-3-12b-it-bnb-4bit
Image-to-Text
•
13B
•
Updated
•
30k
•
34
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
32B
•
Updated
•
340
•
7
Text Generation
•
2B
•
Updated
•
18
•
13
ubaitur5/Ministral-3b-instruct-Q4-mlx
Text Generation
•
Updated
•
55
•
2
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
262k
•
7
Intel/Qwen3-Coder-30B-A3B-Instruct-int4-AutoRound
0.6B
•
Updated
•
783
•
8
mlx-community/gpt-oss-20b-MXFP4-Q8
Text Generation
•
Updated
•
701k
•
29
unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit
Text Generation
•
Updated
•
87k
•
26
Text-to-Image
•
Updated
•
2
•
6
plezan/MiniMax-M2.1-REAP-50-W4A16
Text Generation
•
17B
•
Updated
•
1.38k
•
5
Disty0/FLUX.2-klein-4B-SDNQ-4bit-dynamic
Text-to-Image
•
Updated
•
6.83k
•
6
Intel/GLM-4.7-Flash-int4-AutoRound
1B
•
Updated
•
1.53k
•
5
divyajot5005/Qwen3-TTS-12Hz-1.7B-Base-BNB-4bit
Text-to-Speech
•
Updated
•
212
•
2
Euraika/EuroLLM-22B-Instruct-GPTQ
Text Generation
•
23B
•
Updated
•
708
•
2
DeathGodlike/SicariusSicariiStuff_Assistant-Pepe-8B_EXL3
Text Generation
•
Updated
•
2
•
2
Text Generation
•
1B
•
Updated
•
103
•
2
mlx-community/Qwen3-Coder-Next-4bit
Text Generation
•
80B
•
Updated
•
995
•
2
nightmedia/Qwen3-Coder-Next-mxfp4-mlx
Text Generation
•
80B
•
Updated
•
848
•
2
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
60
•
1
TheBloke/TinyLlama-1.1B-Chat-v0.3-AWQ
Text Generation
•
1B
•
Updated
•
81.6k
•
4
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
1B
•
Updated
•
226
•
5