Inference Providers
Active filters: nvidia
nvidia/Nemotron-Cascade-8B
Text Generation
• Updated • 9.51k
• 65
bartowski/nvidia_Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
• 32B • Updated • 10.7k
• 9
bartowski/nvidia_Nemotron-Cascade-8B-GGUF
Text Generation
• 8B • Updated • 487
• 3
bartowski/nvidia_Nemotron-Cascade-8B-Thinking-GGUF
Text Generation
• 8B • Updated • 208
• 2
bartowski/nvidia_Nemotron-Cascade-14B-Thinking-GGUF
Text Generation
• 15B • Updated • 713
• 8
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4bit
Text Generation
• 32B • Updated • 183k
• 2
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5bit
Text Generation
• 32B • Updated • 172k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6bit
Text Generation
• 32B • Updated • 172k
lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8bit
Text Generation
• 32B • Updated • 174k
• 3
moxin-org/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
• 32B • Updated • 258
• 3
mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-GGUF
235B • Updated • 30
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-MXFP4
Text Generation
• 32B • Updated • 628
• 2
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-BF16
Text Generation
• 32B • Updated • 525
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8Bit
Text Generation
• 32B • Updated • 334
• 2
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6Bit
Text Generation
• 32B • Updated • 264
• 1
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5Bit
Text Generation
• 32B • Updated • 349
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4Bit
Text Generation
• 32B • Updated • 315
• 1
NikolayKozloff/Nemotron-Cascade-8B-Q8_0-GGUF
Text Generation
• 8B • Updated • 4
• 1
NikolayKozloff/Nemotron-Cascade-8B-Thinking-Q8_0-GGUF
Text Generation
• 8B • Updated • 2
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q5_K_S-GGUF
Text Generation
• 15B • Updated • 5
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_M-GGUF
Text Generation
• 15B • Updated • 14
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_S-GGUF
Text Generation
• 15B • Updated • 5
• 1
smcleod/Nemotron-Cascade-14B-Thinking-mlx-6Bit
Text Generation
• 15B • Updated • 62
• 1
smcleod/Nemotron-Cascade-8B-mlx-6Bit
Text Generation
• 8B • Updated • 15
yueqis/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 42
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q8
Text Generation
• 12B • Updated • 44
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q6
Text Generation
• 12B • Updated • 199
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 4.76k
• 14
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-4bit
Text Generation
• 4B • Updated • 38
• 1
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-8bit
Text Generation
• 5B • Updated • 2