Inference Providers
Active filters: mlx
mlx-community/JOSIE-TinyLlama-1.1B-32k-base-4bit
Text Generation
• 0.2B • Updated • 33
• 1
mlx-community/JOSIE-TinyLlama-1.1B-32k-base-8bit
Text Generation
• 0.3B • Updated • 17
• 1
mlx-community/Phi-3-small-8k-instruct-aq4_64
Text Generation
• Updated • 23
mlx-community/openchat-3.6-8b-20240522-8bit
Text Generation
• Updated • 32
• 2
mlx-community/openchat-3.6-8b-20240522-4bit
Text Generation
• Updated • 37
• 1
mlx-community/openchat-3.6-8b-20240522-2bit
Text Generation
• Updated • 23
lmbelo/OpenELM-270M-Function-Calling
0.3B • Updated • 5
cstr/Llama3-DiscoLeo-Instruct-8B-v0.1-mlx
Text Generation
• 1B • Updated • 8
dfurman/Mistral-7B-Instruct-v0.3-mlx-4bit
1B • Updated • 58
mayflowergmbh/Llama3-German-8B-4bit
Text Generation
• 2B • Updated • 12
mayflowergmbh/Llama3-German-8B-32k-4bit
Text Generation
• 2B • Updated • 24
• 1
mayflowergmbh/Llama3-DiscoLeo-Instruct-8B-v0.1-4bit
Text Generation
• 2B • Updated • 2
mayflowergmbh/Llama3-DiscoLeo-Instruct-8B-32k-v0.1-4bit
Text Generation
• 2B • Updated • 2
mlx-community/OpenELM-270M-Instruct-4bit
Updated • 13
• 1
lmbelo/Phi-3-mini-4k-instruct
Text Generation
• 4B • Updated • 7
cstr/llama3-8b-spaetzle-v33-mlx-4bit
1B • Updated • 2
mlx-community/Codestral-22B-v0.1-4bit
3B • Updated • 1.06k
• 13
mlx-community/Codestral-22B-v0.1-8bit
6B • Updated • 312
• 8
lmbelo/Phi-3-mini-4k-Function-Calling
Text Generation
• 4B • Updated • 13
mlx-community/AutoCoder-33B-4bit
Updated • 55
• 2
xiaotianxt/llama-3-chinese-8b-instruct-v3-4bit-mlx
1B • Updated • 50
mlx-community/Phi-3-small-8k-instruct-AQ4_32
Text Generation
• Updated • 31
• 2
xiaotianxt/llama-3-treehole-8b-instruct
8B • Updated • 1
xiaotianxt/llama-3-treehole-8b-v3
1B • Updated • 8
mlx-community/dolphin-2.9.2-Phi-3-Medium-4bit
Updated • 21
• 1
mlx-community/dolphin-2.9.2-Phi-3-Medium-8bit
Updated • 20
• 1
mlx-community/dolphin-2.9.2-Phi-3-Medium-2bit
ipihq/Phi-3-medium-128k-instruct
Text Generation
• 14B • Updated • 7
ipihq/Phi-3-medium-128k-instruct_q
Text Generation
• 2B • Updated • 6
meriamcherif/Llama-2-7b-chat-hf-Quantized
Text Generation
• 1B • Updated • 14