-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
•
133B
•
Updated
•
10.6k
•
11
pchamart/schematron3B-mlx-8bit
Text Generation
•
0.9B
•
Updated
•
67
•
2
mlx-community/Qwen3-VL-2B-Instruct-8bit
Image-Text-to-Text
•
Updated
•
62
•
1
Text Generation
•
5B
•
Updated
•
1k
•
1
mlx-community/DeepSeek-OCR-8bit
Image-Text-to-Text
•
1B
•
Updated
•
1.92k
•
32
Firworks/Kimi-Linear-48B-A3B-Instruct-nvfp4
28B
•
Updated
•
452
•
10
nightmedia/Qwen3-VLTO-8B-Thinking-160K-qx86x-hi-mlx
Text Generation
•
8B
•
Updated
•
18
•
1
NeoChen1024/gemma-3-27b-it-NVFP4
Image-Text-to-Text
•
18B
•
Updated
•
2.72k
•
2
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
229
•
1
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
•
17B
•
Updated
•
150
•
1
mratsim/Behemoth-X-123B-v2-NVFP4
Text Generation
•
69B
•
Updated
•
175
•
2
kldzj/gpt-oss-120b-heretic-v2
Text Generation
•
117B
•
Updated
•
670
•
20
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
•
4B
•
Updated
•
26
•
2
Firworks/GLM-4.5-Air-Derestricted-nvfp4
61B
•
Updated
•
156
•
3
EricRollei/Hunyuan_Image_3_Int8
Text-to-Image
•
83B
•
Updated
•
67
•
2
mlx-community/Ministral-3-14B-Instruct-2512-8bit
Updated
•
189
•
1
mlx-community/Ministral-3-14B-Reasoning-2512-8bit
Updated
•
714
•
2
RedHatAI/Qwen3-VL-32B-Instruct-NVFP4
Text Generation
•
20B
•
Updated
•
14.3k
•
3
mlx-community/mistralai_Devstral-Small-2-24B-Instruct-2512-MLX-8Bit
Text Generation
•
Updated
•
781
•
6
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
3B
•
Updated
•
6
•
1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
5B
•
Updated
•
252
•
2
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM-NVFP4
Text Generation
•
16B
•
Updated
•
47
•
2
Text Generation
•
177B
•
Updated
•
4.75k
•
15
kriyeng/gpt-oss-120b-math
Text Generation
•
120B
•
Updated
•
33
•
1
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
8.82k
•
5
nightmedia/Qwen3-4B-Agent-Claude-Gemini-heretic-qx86-hi-mlx
Text Generation
•
1B
•
Updated
•
71
•
1
mlx-community/LFM2.5-VL-1.6B-8bit
Image-Text-to-Text
•
0.7B
•
Updated
•
101
•
1
AITRADER/Amsi-fin-o1-MLX-8bit
Image-Text-to-Text
•
2B
•
Updated
•
2
•
1
Octen/Octen-Embedding-8B-INT8
Sentence Similarity
•
8B
•
Updated
•
181
•
4
gesong2077/GLM-4.5-Air-Derestricted-NVFP4
63B
•
Updated
•
7
•
2