Inference Providers
Active filters: instruct
mradermacher/Llama-3-portuguese-Tom-cat-8b-instruct-i1-GGUF
8B • Updated • 43
Satwik11/Llama-3.3-70B-Instruct-AutoRound-GPTQ-4bit
Text Generation
• 71B • Updated • 12
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
• 33B • Updated • 16
• 16
numen-tech/Hermes-3-Llama-3.1-8B-w3a16g40sym
Text Generation
• Updated numen-tech/Hermes-3-Llama-3.1-8B-w4a16g128asym
Text Generation
• Updated mradermacher/BgGPT-7B-Instruct-v0.2-GGUF
7B • Updated • 18
• 1
mradermacher/BgGPT-7B-Instruct-v0.2-i1-GGUF
7B • Updated • 68
• 1
itlwas/Hermes-3-Llama-3.2-3B-Q4_K_M-GGUF
3B • Updated • 113
• 1
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
• 33B • Updated • 5
• 14
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX
7B • Updated • 4
mradermacher/AlphaMonarch-dora-GGUF
7B • Updated • 59
• 1
mlx-community/Qwen2.5-7B-Instruct-kowiki-qa
Text Generation
• Updated • 6
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX-0cb1b
0.7B • Updated • 5
mradermacher/AlphaMonarch-dora-i1-GGUF
7B • Updated • 122
• 1
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX-104ce
0.9B • Updated • 6
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX-196c8
1B • Updated • 3
NaniDAO/Llama-3.3-70B-Instruct-ablated
Text Generation
• 71B • Updated • 14
• 21
mlx-community/Qwen2.5-7B-Instruct-kowiki-qa-8bit
Text Generation
• Updated • 2
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX-8777b
2B • Updated • 6
mlx-community/Qwen2.5-7B-Instruct-kowiki-qa-4bit
Text Generation
• Updated • 9
itlwas/Hermes-2-Pro-Llama-3-8B-Q4_K_M-GGUF
8B • Updated • 21
itlwas/Hermes-2-Pro-Mistral-7B-Q4_K_M-GGUF
7B • Updated • 22
itlwas/Nous-Hermes-2-SOLAR-10.7B-Q4_K_M-GGUF
11B • Updated • 22
TheBlueObserver/OpenHermes-2.5-Mistral-7B-MLX-393a7
2B • Updated • 9
itlwas/Nous-Hermes-2-Yi-34B-Q4_K_M-GGUF
34B • Updated • 92
mradermacher/Llama-3.3-70B-Instruct-ablated-GGUF
71B • Updated • 19
mradermacher/Llama-3.3-70B-Instruct-ablated-i1-GGUF
71B • Updated • 1.1k
• 1
bartowski/Llama-3.3-70B-Instruct-ablated-GGUF
Text Generation
• 71B • Updated • 2.52k
• 16
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
• 10B • Updated • 9
• 3
mosama/Qwen2.5-1.5B-Instruct-CoT-Reflection
Text Generation
• 2B • Updated • 12
• 1