-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q6
Text Generation
•
12B
•
Updated
•
40
leonsarmiento/Nemotron-Cascade-8B-8bit-mlx
Text Generation
•
8B
•
Updated
•
12
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
•
18B
•
Updated
•
13.7k
•
11
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-4bit
Text Generation
•
4B
•
Updated
•
198
•
1
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-8bit
Text Generation
•
5B
•
Updated
•
36
cyankiwi/Nemotron-Cascade-8B-Thinking-AWQ-4bit
Text Generation
•
2B
•
Updated
•
2
cyankiwi/Nemotron-Cascade-8B-Thinking-AWQ-8bit
Text Generation
•
3B
•
Updated
•
2
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
2B
•
Updated
•
33
•
1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
3B
•
Updated
•
6
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
5B
•
Updated
•
260
•
1
cyankiwi/Nemotron-Cascade-8B-AWQ-4bit
Text Generation
•
2B
•
Updated
•
73
•
1
cyankiwi/Nemotron-Cascade-8B-AWQ-8bit
Text Generation
•
3B
•
Updated
•
11
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
•
Updated
•
11
introvoyz041/Nemotron-Cascade-8B-mlx-4Bit
Text Generation
•
1B
•
Updated
•
15
introvoyz041/Nemotron-Cascade-14B-Thinking-mlx-4Bit
Text Generation
•
15B
•
Updated
•
38
introvoyz041/Nemotron-Cascade-8B-Thinking-mlx-4Bit
Text Generation
•
1B
•
Updated
•
44
Stan31/quantumflow-prototypes
Updated
introvoyz041/Cosmos-Reason1-7B-mlx-4Bit
Image-Text-to-Text
•
1B
•
Updated
•
5
Jong-Seong/qwen3-next-gb10-guide
Updated
mradermacher/Nemotron-Cascade-8B-GGUF
8B
•
Updated
•
26
mradermacher/Nemotron-Cascade-8B-Thinking-GGUF
8B
•
Updated
•
77
mradermacher/Nemotron-Cascade-14B-Thinking-GGUF
15B
•
Updated
•
69
mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-i1-GGUF
235B
•
Updated
•
180
mradermacher/Nemotron-Cascade-8B-i1-GGUF
8B
•
Updated
•
156
mradermacher/Nemotron-Cascade-8B-Thinking-i1-GGUF
8B
•
Updated
•
228
mradermacher/Nemotron-Cascade-14B-Thinking-i1-GGUF
15B
•
Updated
•
334
•
2
Mungert/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16-GGUF
Text Generation
•
32B
•
Updated
•
424
•
1
Edge-Quant/OpenReasoning-Nemotron-1.5B-Q4_K_M-GGUF
Text Generation
•
2B
•
Updated
•
7
Edge-Quant/AceReason-Nemotron-1.1-7B-Q4_K_M-GGUF
Text Generation
•
8B
•
Updated
•
1
RedHatAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
•
734
•
4