-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
bartowski/nvidia_Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
•
32B
•
Updated
•
2.05k
•
9
leonsarmiento/Nemotron-Cascade-14B-Thinking-8bit-mlx
Text Generation
•
15B
•
Updated
•
62
•
1
leonsarmiento/Nemotron-Cascade-8B-Thinking-8bit-mlx
Text Generation
•
8B
•
Updated
•
10
bartowski/nvidia_Nemotron-Cascade-8B-GGUF
Text Generation
•
8B
•
Updated
•
591
•
2
bartowski/nvidia_Nemotron-Cascade-8B-Thinking-GGUF
Text Generation
•
8B
•
Updated
•
596
•
2
bartowski/nvidia_Nemotron-Cascade-14B-Thinking-GGUF
Text Generation
•
15B
•
Updated
•
3.38k
•
8
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4bit
Text Generation
•
32B
•
Updated
•
175k
•
1
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5bit
Text Generation
•
32B
•
Updated
•
169k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6bit
Text Generation
•
32B
•
Updated
•
169k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8bit
Text Generation
•
32B
•
Updated
•
170k
•
3
mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-GGUF
235B
•
Updated
•
1.47k
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-MXFP4
Text Generation
•
32B
•
Updated
•
381
•
2
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-BF16
Text Generation
•
32B
•
Updated
•
94
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8Bit
Text Generation
•
32B
•
Updated
•
113
•
1
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6Bit
Text Generation
•
32B
•
Updated
•
65
•
1
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5Bit
Text Generation
•
32B
•
Updated
•
54
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4Bit
Text Generation
•
32B
•
Updated
•
91
•
1
NikolayKozloff/Nemotron-Cascade-8B-Q8_0-GGUF
Text Generation
•
8B
•
Updated
•
7
•
1
NikolayKozloff/Nemotron-Cascade-8B-Thinking-Q8_0-GGUF
Text Generation
•
8B
•
Updated
•
8
•
1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q5_K_S-GGUF
Text Generation
•
15B
•
Updated
•
11
•
1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_M-GGUF
Text Generation
•
15B
•
Updated
•
24
•
1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_S-GGUF
Text Generation
•
15B
•
Updated
•
6
•
1
smcleod/Nemotron-Cascade-14B-Thinking-mlx-6Bit
Text Generation
•
15B
•
Updated
•
18
•
1
smcleod/Nemotron-Cascade-8B-mlx-6Bit
Text Generation
•
8B
•
Updated
•
15
yueqis/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
•
9B
•
Updated
•
49
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q8
Text Generation
•
12B
•
Updated
•
24
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q6
Text Generation
•
12B
•
Updated
•
40
leonsarmiento/Nemotron-Cascade-8B-8bit-mlx
Text Generation
•
8B
•
Updated
•
12
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
•
18B
•
Updated
•
13.7k
•
11
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-4bit
Text Generation
•
4B
•
Updated
•
198
•
1