Inference Providers
Active filters: chat
Text Generation
• Updated • 607k
• 2.22k
tensorblock/Sailor-0.5B-Chat-GGUF
0.6B • Updated • 200
mlx-community/QwQ-32B-Coder-Fusion-9010-4bit
Text Generation
• Updated • 13
• 1
tensorblock/Experiment31-7B-GGUF
Text Generation
• 7B • Updated • 15
chende2024/QwQ-32B-Preview-Q4_0-GGUF
33B • Updated • 1
• 1
chende2024/Qwen2.5-1.5B-Instruct-Q4_K_M-GGUF
Text Generation
• 2B • Updated • 5
zai-org/VisionReward-Image
Text Generation
• Updated • 11
Audio-Text-to-Text
• 3B • Updated • 492
• 286
bartowski/Hermes-3-Llama-3.2-3B-GGUF
Text Generation
• Updated • 6.03k
• 13
ggml-org/Qwen2.5-Coder-1.5B-32B-speculative-GGUF
Text Generation
• 2B • Updated • 98
• 5
mlx-community/Hermes-3-Llama-3.2-3B-4bit
Text Generation
• 0.5B • Updated • 87
• 1
mlx-community/Hermes-3-Llama-3.2-3B-8bit
Text Generation
• 0.9B • Updated • 275
• 1
mlx-community/Hermes-3-Llama-3.2-3B-bf16
Text Generation
• 3B • Updated • 7
mradermacher/Holland-4B-V1-GGUF
5B • Updated • 60
• 1
JackeyLai/Qwen2.5-3B-Instruct-Q4_0-GGUF
Text Generation
• 3B • Updated • 12
JackeyLai/Qwen2.5-7B-Instruct-Q4_0-GGUF
Text Generation
• 8B • Updated • 50
cphan-intersystems/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 10
cphan-intersystems/Qwen2.5-32B-Instruct-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 3
mradermacher/QwQ-32B-Preview-GGUF
33B • Updated • 14
NikolayKozloff/Llama-DNA-1.0-8B-Instruct-Q8_0-GGUF
Text Generation
• 8B • Updated • 1
• 1
NikolayKozloff/Hermes-3-Llama-3.2-3B-Q8_0-GGUF
3B • Updated • 4
• 2
tensorblock/Sailor-1.8B-Chat-GGUF
ericliu2007/Qwen2.5-14B-Instruct-Q2_K-GGUF
Text Generation
• 15B • Updated • 1
Sg-at-srijan-us-kg/Qwen2.5-Coder-32B-Instruct-128k-yarn-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 5
tensorblock/Llama-2-7b-ultrachat200k-GGUF
Text Generation
• 7B • Updated • 7
mradermacher/Hermes-3-Llama-3.2-3B-GGUF
3B • Updated • 19
• 2
imkebe/Qwen2.5-Coder-14B-Instruct-rk3588-1.1.4
Text Generation
• Updated • 1
• 2
redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2
Text Generation
• 12B • Updated • 9
• 7
mradermacher/Hermes-3-Llama-3.2-3B-i1-GGUF
3B • Updated • 148
• 1
ericliu2007/Qwen2.5-32B-Instruct-Q2_K-GGUF
Text Generation
• 33B • Updated • 1