Inference Providers
Active filters: mlx
smdesai/paligemma-3b-mix-448-6bit
Image-Text-to-Text
• 0.7B • Updated • 3
smdesai/paligemma2-3b-pt-448-4bit
Image-Text-to-Text
• 0.6B • Updated • 6
smdesai/paligemma2-3b-pt-448-6bit
Image-Text-to-Text
• 0.8B • Updated • 6
ggbetz/Llama-3.1-Argunaut-1-8B-SFT-Q4-mlx
Text Generation
• 1B • Updated • 6
cmcmaster/Llama-3.2-3B-Q4-mlx
Text Generation
• 0.5B • Updated • 4
mlx-community/Llama-3.2-1B-Instruct-MLXTuned
Text Generation
• 1B • Updated • 737
• 5
alexander583/DeepSeek-V2-Lite-Chat-Q4-mlx
2B • Updated • 29
WaveCut/PRIME-RL_Eurus-2-7B-PRIME-Q8-mlx
2B • Updated • 10
• 1
mlx-community/Llama-3.2-1B-Instruct-mlx-FinGreyLit-finetuned
1B • Updated • 21
• 1
ubaitur5/SmallThinker-3B-Preview-Q4-mlx
Text Generation
• Updated • 2
mlx-community/DeepSeek-V3-3bit
105B • Updated • 128
• 3
mlx-community/smallthinker-3b-preview-q8
Text Generation
• Updated • 7
mlx-community/smallthinker-3b-preview-q4
Text Generation
• Updated • 6
mlx-community/DeepSeek-V3-3bit-bf16
105B • Updated • 90
• 2
mlx-community/Dolphin3.0-Llama3.1-8B-4bit
1B • Updated • 201
mlx-community/Dolphin3.0-Llama3.1-8B-8bit
2B • Updated • 91
mlx-community/Dolphin3.0-Llama3.1-8B-bf16
8B • Updated • 38
mlx-community/Mistral-Nemo-Instruct-2407-3bit
Updated • 59
• 1
mlx-community/HuatuoGPT-o1-72B-4bit
Text Generation
• 11B • Updated • 35
• 1
mlx-community/HuatuoGPT-o1-7B-4bit
Text Generation
• 1B • Updated • 11
ivanfioravanti/Phi-3.5-mini-instruct-italian-wine
Text Generation
• 4B • Updated • 10
• CuckmeisterFuller/Dolphin3.0-Qwen2.5-3b-Q4-mlx
0.5B • Updated • 44
mlx-community/Dolphin3.0-Llama3.1-8B-6bit
2B • Updated • 42
mlx-community/Qwen2.5-Coder-32B-Instruct-abliterated-4bit
Text Generation
• Updated • 185
• 1
mlx-community/Tiger-Gemma-9B-v3-Q4-mlx
Updated • 67
• 1
mlx-community/Qwen2.5-Coder-32B-Instruct-abliterated-3bit
Text Generation
• Updated • 67
• 1
prdnr/UwU-7B-Instruct-Q4-mlx
Text Generation
• 1B • Updated • 6
• 1
pcuenq/gemma-2-2b-it-4bit
Text Generation
• 0.4B • Updated • 12
pcuenq/gemma-2-2b-it-4bit-test
Text Generation
• 0.4B • Updated • 8
mlx-community/SmallThinker-3B-Preview-4bit
Text Generation
• Updated • 12
• 1