Inference Providers
Active filters: chat
Text Generation
• 33B • Updated • 13
• 7
shashikanth-a/tinyllama-chat-4bit
Text Generation
• 0.2B • Updated • 1
shashikanth-a/llama-2-7b-chat-4bit
Text Generation
• 1B • Updated • 12
unsloth/QwQ-32B-Preview-bnb-4bit
Text Generation
• 34B • Updated • 377
• 4
unsloth/QwQ-32B-Preview-GGUF
Text Generation
• 33B • Updated • 492
• 12
RedHatAI/Qwen2.5-Coder-14B-Instruct-FP8-dynamic
Text Generation
• 15B • Updated • 683
• 1
Text Generation
• 1B • Updated • 3
• mlx-community/Qwen_QwQ-32B-Preview_MLX-8bit
Text Generation
• Updated • 15
• 4
tensorblock/ghost-8b-beta-GGUF
Text Generation
• 8B • Updated • 244
• 1
mradermacher/Llama-3.2-1B-DPO-GGUF
1B • Updated • 62
huihui-ai/QwQ-32B-Preview-abliterated
Text Generation
• 33B • Updated • 18
• • 104
mlx-community/Qwen_QwQ-32B-Preview_MLX-4bit
Text Generation
• Updated • 11
• 1
DrNicefellow/Qwen-QwQ-32B-Preview-4.25bpw-exl2
Text Generation
• Updated • 4
• 3
tensorblock/calme-2.8-qwen2-7b-GGUF
Text Generation
• 8B • Updated • 36
MarsupialAI/Monstral-123B-v2
Text Generation
• 123B • Updated • 26
• 43
async0x42/QwQ-32B-Preview-exl2_3.5bpw
Text Generation
• Updated • 7
tensorblock/magnum-v2-12b-GGUF
Text Generation
• 12B • Updated async0x42/QwQ-32B-Preview-exl2_4.5bpw
Text Generation
• Updated • 7
• 1
deltanym/QwQ-32B-Preview-abliterated-Q5_K_M-GGUF
Text Generation
• 33B • Updated • 2
waltervix/QwQ-32B-Preview-Q2_K-GGUF
Text Generation
• 33B • Updated • 5
deltanym/QwQ-32B-Preview-abliterated-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 2
• 2
Hack337/QwQ-32B-Preview-abliterated-Q3_K_S-GGUF
Text Generation
• 33B • Updated • 48
• 2
async0x42/QwQ-32B-Preview-exl2_5.0bpw
Text Generation
• Updated • 4
DavidAU/L3.1-Instruct-Guru-8B-GGUF
Text Generation
• 8B • Updated • 153
• 4
huihui-ai/QwQ-32B-Coder-Fusion-9010
Text Generation
• 33B • Updated • 40
• 12
lmstudio-community/QwQ-32B-Preview-MLX-4bit
Text Generation
• 5B • Updated • 2
lmstudio-community/QwQ-32B-Preview-MLX-8bit
Text Generation
• 9B • Updated • 1
alvinrach/Qwen2.5-Coder-32B-Instruct-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 5
gmonsoon/QwQ-32B-Preview-abliterated-Q4_K_M-GGUF
Text Generation
• 33B • Updated • 9
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1
Text Generation
• 33B • Updated • 9
• 51