Inference Providers
Active filters: cuda
Text Generation
• 8B • Updated • 74.4k
• 561
Text Generation
• 4B • Updated • 9.44k
• 28
prism-ml/Bonsai-1.7B-gguf
Text Generation
• 2B • Updated • 12.9k
• 43
ussoewwin/Flash-Attention-2_for_Windows
Multilingual-Multimodal-NLP/IndustrialCoder
Text Generation
• 32B • Updated • 1.59k
• 55
JusteLeo/Nunchaku-Zimage-Win-Wheels
Hellohal2064/vllm-dgx-spark-gb10
Text Generation
• Updated • 3
Kevletesteur/chimere-system
Text Generation
• Updated • 1
waltgrace/llama-cpp-expert-sniper
Text Generation
• Updated • 1
Text Generation
• 8B • Updated • 847
• 1
mradermacher/Convergent-7B-GGUF
8B • Updated • 794
• 1
mradermacher/Convergent-7B-i1-GGUF
8B • Updated • 5.42k
• 1
Text Generation
• Updated • 6
• 23
CalderaAI/13B-Ouroboros-GPTQ4bit-128g-CUDA
Text Generation
• Updated • 5
marcorez8/llama-cpp-python-windows-blackwell-cuda
ValiantLabs/Qwen3-8B-ShiningValiant3
Text Generation
• 8B • Updated • 8
• 3
mradermacher/Qwen3-8B-ShiningValiant3-GGUF
8B • Updated • 1.48k
• 2
mradermacher/Qwen3-8B-ShiningValiant3-i1-GGUF
8B • Updated • 438
• 2
ValiantLabs/Qwen3-1.7B-ShiningValiant3
Text Generation
• 2B • Updated • 81
• 5
mradermacher/Qwen3-1.7B-ShiningValiant3-GGUF
2B • Updated • 149
mradermacher/Qwen3-1.7B-ShiningValiant3-i1-GGUF
2B • Updated • 316
ValiantLabs/Qwen3-4B-ShiningValiant3
Text Generation
• 4B • Updated • 35
• 7
sequelbox/Qwen3-8B-PlumEsper
Text Generation
• 8B • Updated • 2
sequelbox/Qwen3-4B-PlumEsper
Text Generation
• 4B • Updated • 3
mradermacher/Qwen3-Shining-Lucy-CODER-3.5B-Brainstorm20x-e32-GGUF
3B • Updated • 211
• 2
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-GGUF
2B • Updated • 160
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-GGUF
2B • Updated • 93
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-mix2-i1-GGUF
2B • Updated • 99
• 1
mradermacher/Qwen3-Shining-Lucy-CODER-2.4B-i1-GGUF
2B • Updated • 599