-
-
-
-
-
-
Inference Providers
Active filters: cuda
ussoewwin/Flash-Attention-2_for_Windows
dougeeai/llama-cpp-python-wheels
ValiantLabs/Ministral-3-14B-Reasoning-2512-ShiningValiant3
Text Generation
• 14B • Updated
• 102
• 5
gravermistakes/Ministral-3-14B-Reasoning-2512-PlumEsper1.1-i1-GGUF
14B • Updated
• 3.3k
• 2
Text Generation
• Updated
• 3
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated
• 5
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated
• 8
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated
• 187
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated
• 359
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated
• 5
• 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated
• 26
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated
• 182
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated
• 6
• 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated
• 6
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated
• 5
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated
• 140
• 2
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated
• 65
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated
• 396
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated
• 86
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated
• 62
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-i1-GGUF
3B • Updated
• 194
• 1
mradermacher/Qwen3-Shining-Valiant-Instruct-Fast-CODER-Reasoning-2.4B-GGUF
2B • Updated
• 60
mradermacher/Qwen3-Shining-Valiant-Instruct-Fast-CODER-Reasoning-2.4B-i1-GGUF
2B • Updated
• 111
mradermacher/Qwen3-Shining-Valiant-Instruct-CODER-Reasoning-2.7B-GGUF
3B • Updated
• 61
mradermacher/Qwen3-Shining-Valiant-Instruct-CODER-Reasoning-2.7B-i1-GGUF
3B • Updated
• 149
mradermacher/Qwen3-Shining-Lucy-CODER-3.4B-Brainstorm20x-e32-GGUF
3B • Updated
• 174
mradermacher/Qwen3-Shining-Lucy-CODER-3.4B-Brainstorm20x-e32-i1-GGUF
3B • Updated
• 128
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-e32-mix2-GGUF
2B • Updated
• 46