Inference Providers
Active filters: nvidia
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 5
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 8
• 1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 5B • Updated • 114
• 2
cyankiwi/Nemotron-Cascade-8B-AWQ-4bit
Text Generation
• 2B • Updated • 5
• 1
cyankiwi/Nemotron-Cascade-8B-AWQ-8bit
Text Generation
• 3B • Updated • 7
• 1
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM
Text Generation
• 32B • Updated • 1.42k
• 27
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
• Updated • 12
introvoyz041/Nemotron-Cascade-8B-mlx-4Bit
Text Generation
• 1B • Updated • 20
introvoyz041/Nemotron-Cascade-14B-Thinking-mlx-4Bit
Text Generation
• 15B • Updated • 16
introvoyz041/Nemotron-Cascade-8B-Thinking-mlx-4Bit
Text Generation
• 1B • Updated • 16
Stan31/quantumflow-prototypes
Updated
introvoyz041/Cosmos-Reason1-7B-mlx-4Bit
Image-Text-to-Text
• 1B • Updated • 5
Jong-Seong/qwen3-next-gb10-guide
Updated
mradermacher/Nemotron-Cascade-8B-GGUF
8B • Updated • 66
mradermacher/Nemotron-Cascade-8B-Thinking-GGUF
8B • Updated • 74
mradermacher/Nemotron-Cascade-14B-Thinking-GGUF
15B • Updated • 33
mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-i1-GGUF
235B • Updated • 122
mradermacher/Nemotron-Cascade-8B-i1-GGUF
8B • Updated • 58
mradermacher/Nemotron-Cascade-8B-Thinking-i1-GGUF
8B • Updated • 3.36k
mradermacher/Nemotron-Cascade-14B-Thinking-i1-GGUF
15B • Updated • 126
• 2
Mungert/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16-GGUF
Text Generation
• 32B • Updated • 8.46k
• 1
Edge-Quant/OpenReasoning-Nemotron-1.5B-Q4_K_M-GGUF
Text Generation
• 2B • Updated • 5
Edge-Quant/AceReason-Nemotron-1.1-7B-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 27
RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
• 32B • Updated • 1.21k
• 4
nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4
119B • Updated • 782
TevunahAi/Nemotron-3-Nano-30B-A3B-GPTQ
Text Generation
• 6B • Updated • 374
• 2
mradermacher/Cascade-Droidz-GGUF
15B • Updated • 81
• 1
mradermacher/Cascade-Droidz-i1-GGUF
15B • Updated • 50
• 1
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q4
Text Generation
• 12B • Updated • 419
huihui-ai/Huihui-NVIDIA-Nemotron-Nano-9B-v2-abliterated
Text Generation
• 9B • Updated • 18
• 2