-
-
-
-
-
-
Inference Providers
Active filters:
instruct
mradermacher/DeepHermes-3-Llama-3-3B-Preview-i1-GGUF
3B
•
Updated
•
172
•
1
DavidAU/Gemma-3-12b-it-MAX-HORROR-Imatrix-GGUF
Text Generation
•
12B
•
Updated
•
605
•
16
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q4_K_S-GGUF
3B
•
Updated
•
4
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q4_K_M-GGUF
3B
•
Updated
•
4
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q5_K_S-GGUF
3B
•
Updated
•
3
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q5_K_M-GGUF
3B
•
Updated
•
10
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q6_K-GGUF
3B
•
Updated
•
2
Triangle104/DeepHermes-3-Llama-3-3B-Preview-Q8_0-GGUF
3B
•
Updated
t83714/llama-3.1-8b-instruct-limo
Text Generation
•
8B
•
Updated
•
46
t83714/llama-3.1-8b-instruct-limo-lora-adapter
Text Generation
•
Updated
•
222
DavidAU/Llama3.2-DeepHermes-3-3B-Preview-Reasoning-MAX-NEO-Imatrix-GGUF
Text Generation
•
3B
•
Updated
•
302
•
3
empirischtech/Kiwi-1.0-0.7B-32k-Instruct
mradermacher/Kiwi-1.0-0.7B-32k-Instruct-GGUF
0.7B
•
Updated
•
46
DavidAU/Gemma-3-it-4B-Uncensored-DBL-X-GGUF
Text Generation
•
4B
•
Updated
•
2.98k
•
56
Mungert/DeepHermes-3-Llama-3-8B-Preview-GGUF
8B
•
Updated
•
499
•
6
CuckmeisterFuller/DeepHermes-3-Llama-3-3B-Preview-mlx-6Bit
Text Generation
•
0.7B
•
Updated
•
1
DavidAU/Mistral-Small-3.1-24B-Instruct-2503-MAX-NEO-Imatrix-GGUF
Text Generation
•
24B
•
Updated
•
410
•
36
Gryphe/Pantheon-RP-1.8-24b-Small-3.1
Updated
•
21
•
70
bartowski/Gryphe_Pantheon-RP-1.8-24b-Small-3.1-GGUF
Text Generation
•
Updated
•
761
•
20
lucyknada/Gryphe_Pantheon-RP-1.8-24b-Small-3.1-exl2
mradermacher/Pantheon-RP-1.8-24b-Small-3.1-GGUF
24B
•
Updated
•
136
•
1
DavidAU/Gemma-3-4b-it-MAX-HORROR-Uncensored-DBL-X-Imatrix-GGUF
Text Generation
•
4B
•
Updated
•
796
•
6
DavidAU/Gemma-3-4b-it-Uncensored-DBL-X
Text Generation
•
5B
•
Updated
•
91
•
6
mradermacher/Pantheon-RP-1.8-24b-Small-3.1-i1-GGUF
24B
•
Updated
•
145
•
8
mradermacher/Gemma-3-4b-it-Uncensored-DBL-X-GGUF
4B
•
Updated
•
434
•
3
mradermacher/Gemma-3-4b-it-Uncensored-DBL-X-i1-GGUF
4B
•
Updated
•
237
•
2
DavidAU/Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF
Text Generation
•
21B
•
Updated
•
1.04k
•
56
Cseti/PULI-LlumiX-Llama-3.1-Instruct-LoRA
Compumacy/d_hermes_24_24b
Text Generation
•
24B
•
Updated
tensorblock/DeepHermes-3-Llama-3-3B-Preview-GGUF
3B
•
Updated
•
60
•
1