Inference Providers
Active filters: chat
mradermacher/Baldur-8B-i1-GGUF
8B • Updated • 203
mradermacher/Darkens-8B-i1-GGUF
8B • Updated • 468
• 2
mradermacher/Tor-8B-i1-GGUF
8B • Updated • 274
tensorblock/calme-3.2-instruct-3b-GGUF
Text Generation
• 3B • Updated Dracones/QwQ-32B-Preview_exl2_8.0bpw
Text Generation
• Updated • 2
Dracones/QwQ-32B-Preview_exl2_7.0bpw
Text Generation
• Updated • 5
Dracones/QwQ-32B-Preview_exl2_6.0bpw
Text Generation
• Updated • 4
• 1
Dracones/QwQ-32B-Preview_exl2_5.0bpw
Text Generation
• Updated • 2
Dracones/QwQ-32B-Preview_exl2_4.5bpw
Text Generation
• Updated • 3
Dracones/QwQ-32B-Preview_exl2_4.0bpw
Text Generation
• Updated • 3
tensorblock/calme-3.2-baguette-3b-GGUF
Text Generation
• 3B • Updated • 3
Text Generation
• 9B • Updated • 10.2k
• 19
sail/Sailor2-20B-Chat-1203
Text Generation
• 19B • Updated • 13
• 24
Text Generation
• 1.0B • Updated • 779
• 16
bullerwins/QwQ-32B-Preview-exl2_4.5bpw
Text Generation
• Updated • 3
bullerwins/QwQ-32B-Preview-exl2_5.5bpw
Text Generation
• Updated • 3
Apel-sin/qwq-32b-coder-fusion-8020-exl2
Text Generation
• Updated tensorblock/calme-3.3-instruct-3b-GGUF
Text Generation
• 3B • Updated • 2
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
• 15B • Updated • 10
• 2
MarsupialAI/Monstral-123B-v2_GGUF
Text Generation
• 123B • Updated • 73
• 3
RedHatAI/Qwen2.5-3B-quantized.w8a8
Text Generation
• 3B • Updated • 7
• 1
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
• 3B • Updated • 14.1k
• 175
tensorblock/Hermes-3-Llama-3.1-70B-GGUF
71B • Updated • 27
• 1
AIFunOver/QwQ-32B-Preview-openvino-8bit
Text Generation
• Updated • 3
AIFunOver/QwQ-32B-Preview-openvino-4bit
Text Generation
• Updated • 7
bartowski/Monstral-123B-v2-GGUF
Text Generation
• 123B • Updated • 280
• 7
cgus/Qwen2.5-1.5B-Instruct-exl2
Text Generation
• Updated • 80
BigHuggyD/Monstral-123B-v2-FP8-Dynamic
Text Generation
• 123B • Updated • 5
• 1
tensorblock/calme-2.2-llama3.1-70b-GGUF
Text Generation
• 71B • Updated • 12
BigHuggyD/MarsupialAI_Monstral-123B-v2_exl2_5.0bpw_h6
Text Generation
• Updated • 12
• 2