Inference Providers
Active filters: chat
Goekdeniz-Guelmez/josie-7b-v6.0-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 8
• 1
Goekdeniz-Guelmez/josie-3b-v6.0-Q5_K_M-GGUF
Text Generation
• 3B • Updated • 2
nitky/Llama-3.3-SuperSwallowX-70B-RP-v0.1
Text Generation
• 71B • Updated • 4
mradermacher/Qwen2.5-32B-Instruct-abliterated-GGUF
33B • Updated • 1.53k
• 2
mradermacher/Qwen1.5-4B-Chat-GGUF
4B • Updated • 104
mradermacher/Qwen1.5-4B-Chat-i1-GGUF
4B • Updated • 153
evilfreelancer/o1_t-lite-it-1.0_gguf
Question Answering
• 8B • Updated • 47
• 2
yujiepan/qvq-preview-tiny-random
Image-Text-to-Text
• 4.9M • Updated • 112
ubaitur5/Qwen2.5-Coder-7B-Instruct-Q3-mlx
Text Generation
• Updated • 14
mradermacher/Qwen2.5-32B-Instruct-abliterated-i1-GGUF
33B • Updated • 368
• 1
mradermacher/Llama-3.3-SuperSwallowX-70B-Instruct-v0.1-GGUF
71B • Updated • 19
Text Generation
• 22B • Updated • 4
• 5
mradermacher/Mistral-portuguese-luana-7b-chat-GGUF
7B • Updated • 53
• 1
gghfez/WizardLM-2-22b-RP-GGUF
Text Generation
• 22B • Updated • 3
• 1
gghfez/WizardLM-2-22b-RP-AWQ
Text Generation
• 22B • Updated • 1
gghfez/WizardLM-2-22B-RP-exl2
Text Generation
• Updated • 2
mradermacher/Mistral-portuguese-luana-7b-chat-i1-GGUF
7B • Updated • 219
• 1
taobao-mnn/Meta-Llama-3-8B-Instruct-MNN
Text Generation
• Updated • 49
tensorblock/QwQ-32B-Coder-Fusion-9010-GGUF
Text Generation
• 33B • Updated • 5
• 2
tensorblock/Llama-3.1-SuperSwallow-70B-Instruct-v0.1-GGUF
Text Generation
• 71B • Updated • 16
tensorblock/Llama-DNA-1.0-8B-Instruct-GGUF
Text Generation
• 8B • Updated • 25
tensorblock/Qwen2.5-Coder-14B-Instruct-abliterated-GGUF
Text Generation
• 15B • Updated • 74
tensorblock/Hermes-3-Llama-3.2-3B-GGUF
3B • Updated • 5
• 1
mradermacher/WizardLM-2-22b-RP-GGUF
22B • Updated • 69
• 3
pipihand01/QwQ-32B-Preview-abliterated-linear75
Text Generation
• 33B • Updated • 7
mradermacher/WizardLM-2-22b-RP-i1-GGUF
22B • Updated • 157
• 4
tensorblock/Qwen2.5-3B-Instruct-abliterated-GGUF
Text Generation
• 3B • Updated • 23
pipihand01/QwQ-32B-Preview-abliterated-linear75-GGUF
33B • Updated • 18
akhbar/Qwen2.5-32B-Instruct-abliterated-8bit-128g-actorder_True-GPTQ
Text Generation
• 33B • Updated • 67
quantflex/SmallThinker-3B-Preview-abliterated-GGUF
Text Generation
• 3B • Updated • 57
• 2