Inference Providers
Active filters: instruct
ModelCloud/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16
Text Generation
• 269B • Updated • 3
• 2
mradermacher/Gemma-2-2B-MedicalQA-Assistant-GGUF
3B • Updated • 37
godcodev/Hermes-2.5-Mistral-7B
7B • Updated • 5
• 1
ModelCloud/MiniMax-M2-GPTQMODEL-W4A16
Text Generation
• 229B • Updated • 13
• 3
allura-org/Lune-Mamba-3B-v1
3B • Updated • 3
mradermacher/Hermes-2.5-Mistral-7B-GGUF
7B • Updated • 27
• 1
mradermacher/Hermes-2.5-Mistral-7B-i1-GGUF
7B • Updated • 1.2k
• 1
ModelCloud/Marin-32B-Base-GPTQMODEL-W4A16
Text Generation
• 33B • Updated • 2
• 1
allura-org/Lune-Mamba-3B-v1-GRPO_IF
3B • Updated • 7
• 2
mradermacher/Lune-Mamba-3B-v1-GGUF
3B • Updated • 43
mradermacher/Lune-Mamba-3B-v1-GRPO_IF-GGUF
3B • Updated • 28
• 1
mradermacher/Lune-Mamba-3B-v1-i1-GGUF
3B • Updated • 43
mradermacher/Lune-Mamba-3B-v1-GRPO_IF-i1-GGUF
3B • Updated • 132
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
• 33B • Updated • 5
• 2
ModelCloud/Granite-4.0-H-1B-GPTQMODEL-W4A16
Text Generation
• 1B • Updated • 11
• 1
ModelCloud/Granite-4.0-H-350M-GPTQMODEL-W4A16
Text Generation
• 0.3B • Updated • 7
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16
Text Generation
• 15B • Updated • 4
• 1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2
Text Generation
• 15B • Updated • 40
• 1
Wwayu/Hermes-4-70B-mlx-4Bit
Text Generation
• Updated • 16
Image-Text-to-Text
• 4B • Updated • 6
davidcondrey/llama-3.3-70B-i-ft-rev3
Updated
mradermacher/zen-vl-4b-instruct-GGUF
Image-Text-to-Text
• 4B • Updated • 343
mradermacher/zen-vl-4b-instruct-i1-GGUF
Image-Text-to-Text
• 4B • Updated • 2.39k
NobodyExistsOnTheInternet/hermes-4-405b-e2.5
Text Generation
• 406B • Updated • 4
jimpre/Llama-3-8B-Lexi-Uncensored-Q4_K_M-GGUF
8B • Updated • 43
• 2
NobodyExistsOnTheInternet/H3-600b
Text Generation
• 807B • Updated • 2
viggovet/viggoVet-Reasoning-20B
Text Generation
• 21B • Updated riomus/Qwen3-Coder-30B-A3B-Instruct-fused
31B • Updated beebsluvr/Nous-Hermes-2-SOLAR-10.7B-Q4_0-GGUF
11B • Updated • 8
viggovet/viggoVet-Standard-32B
Text Generation
• 33B • Updated