Inference Providers
Active filters: 4bit
sanchezalonsodavid17/DeepSeek-OCR-MBQ-Quantized-v1
Image-Text-to-Text
• 3B • Updated • 7
• 6
ModelCloud/MiniMax-M2-GPTQMODEL-W4A16
Text Generation
• 229B • Updated • 12
• 3
ModelCloud/Marin-32B-Base-GPTQMODEL-W4A16
Text Generation
• 33B • Updated • 3
• 1
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
• 33B • Updated • 3
• 2
ModelCloud/Granite-4.0-H-1B-GPTQMODEL-W4A16
Text Generation
• 1B • Updated • 11
• 1
ModelCloud/Granite-4.0-H-350M-GPTQMODEL-W4A16
Text Generation
• 0.3B • Updated • 6
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16
Text Generation
• 15B • Updated • 3
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2
Text Generation
• 15B • Updated • 3
• 1
CHF0101/medquad-lora-r4-best
Updated
CHF0101/medquad-lora-r4-best-v2
CHF0101/medquad-lora-r32-best-v2
sweatSmile/Phi3-Mini-FinSight-FinancialQA
Text Generation
• 4B • Updated • 2
• 1
ikarius/Granite-3.2-8b-instruct-Abliterated-NF4
Text Generation
• 8B • Updated • 4
• 1
ikarius/NeuralDaredevil-8B-abliterated-NF4
Text Generation
• 8B • Updated • 5
• 1
ikarius/Qwen2.5-Coder-14B-Instruct-Abliterated-NF4
Text Generation
• 15B • Updated • 11
• 1
4B • Updated • 11
lunovian/Qwen2.5-Math-7B-Instruct-4bit
2B • Updated Plurigrid/DR-Tulu-8B-MLX-4bit
1B • Updated • 2
ujjwal52/Llama-2-7b-FLASH-UK
Text Generation
• 7B • Updated • 2
• 1
Plurigrid/Olmo-3-32B-Think-MLX-4bit
Text Generation
• 32B • Updated • 1
beta3/gemma3_1b_title_generator
Sugandha-Chauhan/BioMistral-7B-SymptomDiagnosis
Text Classification
• Updated • 3
Text Generation
• 0.6B • Updated • 111
• 3
ikarius/Qwen2.5-Coder-32B-Instruct-Abliterated-NF4
33B • Updated • 4
• 1
0xSero/GLM-4.6-REAP-218B-A32B-W4A16-AutoRound
Text Generation
• 2B • Updated • 74
• 8
smkrv/Qwen3-0.6B-CoreML-4bit
Text Generation
• Updated • 124
• 2