Inference Providers
Active filters: chat
mradermacher/Darkens-8B-GGUF
8B • Updated • 105
• 2
mradermacher/Baldur-8B-GGUF
8B • Updated • 171
8B • Updated • 96
mradermacher/Baldur-8B-i1-GGUF
8B • Updated • 302
mradermacher/Darkens-8B-i1-GGUF
8B • Updated • 465
• 2
mradermacher/Tor-8B-i1-GGUF
8B • Updated • 342
tensorblock/calme-3.2-instruct-3b-GGUF
Text Generation
• 3B • Updated • 8
Dracones/QwQ-32B-Preview_exl2_8.0bpw
Text Generation
• Updated • 3
Dracones/QwQ-32B-Preview_exl2_7.0bpw
Text Generation
• Updated • 5
Dracones/QwQ-32B-Preview_exl2_6.0bpw
Text Generation
• Updated • 4
• 1
Dracones/QwQ-32B-Preview_exl2_5.0bpw
Text Generation
• Updated • 3
Dracones/QwQ-32B-Preview_exl2_4.5bpw
Text Generation
• Updated • 4
Dracones/QwQ-32B-Preview_exl2_4.0bpw
Text Generation
• Updated • 3
tensorblock/calme-3.2-baguette-3b-GGUF
Text Generation
• 3B • Updated • 38
Text Generation
• 9B • Updated • 525
• 19
sail/Sailor2-20B-Chat-1203
Text Generation
• 19B • Updated • 273
• 24
Text Generation
• 1.0B • Updated • 487
• 16
bullerwins/QwQ-32B-Preview-exl2_4.5bpw
Text Generation
• Updated • 3
bullerwins/QwQ-32B-Preview-exl2_5.5bpw
Text Generation
• Updated • 3
Apel-sin/qwq-32b-coder-fusion-8020-exl2
Text Generation
• Updated tensorblock/calme-3.3-instruct-3b-GGUF
Text Generation
• 3B • Updated • 10
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
• 15B • Updated • 12
• 2
MarsupialAI/Monstral-123B-v2_GGUF
Text Generation
• 123B • Updated • 88
• 3
RedHatAI/Qwen2.5-3B-quantized.w8a8
Text Generation
• 3B • Updated • 8
• 1
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 8.01k
• • 179
tensorblock/Hermes-3-Llama-3.1-70B-GGUF
71B • Updated • 91
• 1
AIFunOver/QwQ-32B-Preview-openvino-8bit
Text Generation
• Updated • 4
AIFunOver/QwQ-32B-Preview-openvino-4bit
Text Generation
• Updated • 6
bartowski/Monstral-123B-v2-GGUF
Text Generation
• 123B • Updated • 627
• 7
cgus/Qwen2.5-1.5B-Instruct-exl2
Text Generation
• Updated • 24