Inference Providers
Active filters: nvidia
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-BF16
Text Generation
• 32B • Updated • 1.16k
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8Bit
Text Generation
• 32B • Updated • 300
• 2
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6Bit
Text Generation
• 32B • Updated • 227
• 2
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5Bit
Text Generation
• 32B • Updated • 1.05k
mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4Bit
Text Generation
• 32B • Updated • 306
• 1
NikolayKozloff/Nemotron-Cascade-8B-Q8_0-GGUF
Text Generation
• 8B • Updated • 1
• 1
NikolayKozloff/Nemotron-Cascade-8B-Thinking-Q8_0-GGUF
Text Generation
• 8B • Updated • 1
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q5_K_S-GGUF
Text Generation
• 15B • Updated • 13
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_M-GGUF
Text Generation
• 15B • Updated • 8
• 1
NikolayKozloff/Nemotron-Cascade-14B-Thinking-Q4_K_S-GGUF
Text Generation
• 15B • Updated • 8
• 1
smcleod/Nemotron-Cascade-14B-Thinking-mlx-6Bit
Text Generation
• 15B • Updated • 40
• 1
smcleod/Nemotron-Cascade-8B-mlx-6Bit
Text Generation
• 8B • Updated • 7
yueqis/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 944
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q8
Text Generation
• 12B • Updated • 36
SiddhJagani/NVIDIA-Nemotron-Nano-12B-v2-mlx-Q6
Text Generation
• 12B • Updated • 976
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 1.92k
• 13
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-4bit
Text Generation
• 4B • Updated • 22
• 1
cyankiwi/Nemotron-Cascade-14B-Thinking-AWQ-8bit
Text Generation
• 5B • Updated • 4
cyankiwi/Nemotron-Cascade-8B-Thinking-AWQ-4bit
Text Generation
• 2B • Updated • 1
cyankiwi/Nemotron-Cascade-8B-Thinking-AWQ-8bit
Text Generation
• 3B • Updated • 4
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 7
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 5
• 1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 5B • Updated • 201
• 2
cyankiwi/Nemotron-Cascade-8B-AWQ-4bit
Text Generation
• 2B • Updated • 78
• 1
cyankiwi/Nemotron-Cascade-8B-AWQ-8bit
Text Generation
• 3B • Updated • 1
• 1
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM
Text Generation
• 32B • Updated • 1.29k
• 27
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM-NVFP4
Text Generation
• 16B • Updated • 1.12k
• 6
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
• Updated • 13
introvoyz041/Nemotron-Cascade-8B-mlx-4Bit
Text Generation
• 1B • Updated • 1
introvoyz041/Nemotron-Cascade-14B-Thinking-mlx-4Bit
Text Generation
• 15B • Updated • 20