Inference Providers
Active filters: 4bit
TheCluster/gemma-3-27b-it-uncensored-mlx-4bit
Image-Text-to-Text
• Updated • 684
• 4
Lowkey-Loki/gemma-3-12b-it-textonly-mlx-4bit
Text Generation
• 2B • Updated • 35
btbtyler09/Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
• 8B • Updated • 187
IrfanHamid/ChatBot-lora-7b
Updated
Text Generation
• 8B • Updated • 1
Serione/Llama-3.2-1B-SRLQ-4bit
Updated
jjeccles/autoround-quantized-4bit
adriabama06/DeepCoder-1.5B-Preview-AWQ
Text Generation
• Updated • 4
• 2
bubblspace/Bubbl-P4-multimodal-instruct
6B • Updated • 8
• 7
TheCluster/amoral-gemma-3-12B-v2-mlx-4bit
Image-Text-to-Text
• Updated • 473
• 2
TheCluster/amoral-gemma-3-27B-v2-mlx-4bit
Image-Text-to-Text
• Updated • 52
BoltMonkey/boltmonkey_shortreasoning-8b
Text Generation
• 8B • Updated • 2
BoltMonkey/boltmonkey_shortreasoning-8b-Q5_K_M-GGUF
Text Generation
• 8B • Updated • 3
TechyCode/tinyllama-sciq-lora
Text Generation
• Updated Sumo10/Phi-4-mini-instruct-AWQ-4bit
4B • Updated • 469
• 1
Sumo10/Llama-3.2-3B-Instruct-AWQ-4bit
3B • Updated cyberandy/SEOcrate-4B_grpo_new_01
Text Generation
• 4B • Updated • 13
• 7
Chun121/qwen3-4B-rpg-roleplay
Text Generation
• 4B • Updated • 486
• 15
8B • Updated • 5
• 1
SujitShelar/llama3-medchat-8b-lora
Question Answering
• Updated boods/mistral-location-extractor-4bit
Text Generation
• 7B • Updated • 3
mradermacher/SEOcrate-4B_grpo_new_01-GGUF
Reinforcement Learning
• 4B • Updated • 166
• 1
mradermacher/SEOcrate-4B_grpo_new_01-i1-GGUF
Reinforcement Learning
• 4B • Updated • 839
vannishh/llama3-2.1B-4bit-finetuned
Programmer-RD-AI/ResearchQwen-2.5-3B-LoRA
Question Answering
• 3B • Updated • 4
CodCodingCode/DeepSeek-V2-medical
Text Generation
• Updated tripolskypetr/Plutus-Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
• 8B • Updated • 129
• 1
abdou-u/MNLP_M2_quantized_model
Text Generation
• 0.6B • Updated • 1
HagalazAI/CyberDolphin-2.9.3-mistral-nemo-12b
Text Generation
• 12B • Updated • 30
• 1
HagalazAI/CyberDolphin-2.9.3-mistral-nemo-12b-GGUF
Text Generation
• 12B • Updated • 102
• 2