Inference Providers
Active filters: 4bit
benyamini/DSR1-8B-llmc-awq-w4
Text Generation
• 2B • Updated • 1
Sai2076/LLLMA_FINETUNED_PROJEN
Updated
edge-inference/DSR1-8B-llmc-awq-w4
Text Generation
• 2B • Updated • 1
edge-inference/DSR1-14B-llmc-awq-w4
Text Generation
• 3B • Updated • 1
edge-inference/DSR1-32B-llmc-awq-w4
Text Generation
• 6B • Updated • 4
UAB-NLP/ProjGen_Finetuned_llama
Updated
edge-inference/DSR1-1.5B-llmc-awq-w4
Text Generation
• 2B • Updated • 1
• 1
brabooObrabo/Qwen3-4B-Instruct-2507-MLX-4bit-GS32-embed-8bit-GS32
Text Generation
• 0.8B • Updated • 45
marcusmi4n/abeja-qwen2.5-7b-japanese-quantized
Text Generation
• 8B • Updated • 4
TroglodyteDerivations/MLX_DeepSeek_V3_1_4bit
Text Generation
• Updated NangWeiLun/MiMo-VL-7B-SFT-2508-bnb-4bit-fp4
Image-Text-to-Text
• 8B • Updated • 12
NangWeiLun/MiMo-VL-7B-RL-2508-bnb-4bit-fp4
Image-Text-to-Text
• 8B • Updated • 303
2imi9/Qwen3-1.7b-gptq-int4
Text Generation
• 2B • Updated • 4
Text Generation
• 0.6B • Updated • 127
• 1
sweatSmile/DeepSeek-R1-Distill-Qwen-1.5B-Alpaca-Instruct
2B • Updated • 3
• 1
analystgatitu/economist_model_v3
Text Generation
• Updated • 1
analystgatitu/economist_model_v4
Text Generation
• 3B • Updated • 1
KavinduHansaka/phi4-mini-bnb-4bit
Text Generation
• 4B • Updated • 12
mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX
Image-Text-to-Text
• Updated • 10
aciklab/kubernetes-ai-4bit
12B • Updated • 2
• 2
Dhana8907/Llama-3.1-8B-Instruct-4bit
Text Generation
• 8B • Updated • 12
huggingface-lc/Qwen3-Coder-30B-A3B-Instruct-MLX-4bit-mxfp4
Text Generation
• 31B • Updated • 547
• 1
SiddhJagani/gpt-oss-20b-no-think-mlx-Q4
Text Generation
• 21B • Updated • 36
ellyfantina/llama3-medquad-lora
MightyOctopus/qwen3-0.6B-lora-medical
Updated
iMiW/Giga-Embeddings-instruct-4bit-nf4
Feature Extraction
• 4B • Updated • 430
ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v1
Text Generation
• 357B • Updated • 10
ModelCloud/GLM-4.6-GPTQMODEL-W4A16-v2
Text Generation
• 357B • Updated • 3
• 1
MidnightPhreaker/KAT-Dev-72B-Exp-GPTQ-INT4-gs32-0.01
75B • Updated • 1
• 1
MidnightPhreaker/KAT-Dev-72B-Exp-GPTQ-INT4-gs32
75B • Updated • 1