Inference Providers
Active filters: 4bit
BennyDaBall/Mistral-44B-MoE-Patched-MLX-4bit-G64
Text Generation
• 44B • Updated • 65
benyamini/DeepSeek-R1-Distill-Llama-8B-AWQ-w4g128
NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4-dq
Image-Text-to-Text
• 8B • Updated • 2
NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-nf4
Image-Text-to-Text
• 8B • Updated • 2
NangWeiLun/MiMo-VL-7B-SFT-bnb-4bit-fp4
Image-Text-to-Text
• 8B • Updated • 3
benyamini/DSR1-8B-llmc-awq-w4
Text Generation
• 2B • Updated • 5
Sai2076/LLLMA_FINETUNED_PROJEN
Updated
edge-inference/DSR1-8B-llmc-awq-w4
Text Generation
• 2B • Updated • 4
edge-inference/DSR1-14B-llmc-awq-w4
Text Generation
• 3B • Updated • 2
edge-inference/DSR1-32B-llmc-awq-w4
Text Generation
• 6B • Updated • 3
UAB-NLP/ProjGen_Finetuned_llama
Updated
edge-inference/DSR1-1.5B-llmc-awq-w4
Text Generation
• 0.6B • Updated • 3
• 1
brabooObrabo/Qwen3-4B-Instruct-2507-MLX-4bit-GS32-embed-8bit-GS32
Text Generation
• 0.8B • Updated • 51
marcusmi4n/abeja-qwen2.5-7b-japanese-quantized
Text Generation
• 8B • Updated • 7
TroglodyteDerivations/MLX_DeepSeek_V3_1_4bit
Text Generation
• Updated NangWeiLun/MiMo-VL-7B-SFT-2508-bnb-4bit-fp4
Image-Text-to-Text
• 8B • Updated • 3
NangWeiLun/MiMo-VL-7B-RL-2508-bnb-4bit-fp4
Image-Text-to-Text
• 8B • Updated • 1.12k
2imi9/Qwen3-1.7b-gptq-int4
Text Generation
• 0.9B • Updated • 3
Text Generation
• 0.6B • Updated • 45
• 1
sweatSmile/DeepSeek-R1-Distill-Qwen-1.5B-Alpaca-Instruct
2B • Updated • 1
• 1
analystgatitu/economist_model_v3
Text Generation
• Updated • 5
analystgatitu/economist_model_v4
Text Generation
• 3B • Updated • 3
KavinduHansaka/phi4-mini-bnb-4bit
Text Generation
• 4B • Updated • 20
mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX
Image-Text-to-Text
• Updated • 6
aciklab/kubernetes-ai-4bit
Image-Text-to-Text
• 12B • Updated • 42
• 2
Dhana8907/Llama-3.1-8B-Instruct-4bit
Text Generation
• 8B • Updated • 1
huggingface-lc/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit-mxfp4
Text Generation
• 31B • Updated • 439
• 2
SiddhJagani/gpt-oss-20b-no-think-mlx-Q4
Text Generation
• 21B • Updated • 24
ellyfantina/llama3-medquad-lora
MightyOctopus/qwen3-0.6B-lora-medical
Updated