Inference Providers
Active filters: nvidia
nvidia/Nemotron-H-8B-Reasoning-128K
Text Generation
• 8B • Updated • 1.44k
• 27
nvidia/Qwen3-235B-A22B-FP8
Text Generation
• 235B • Updated • 2.31k
• 4
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated • 75.3k
• 12
NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 27.7k
• 19
Text Generation
• 4B • Updated • 339
• 30
cyankiwi/NVIDIA-Nemotron-Nano-9B-v2-AWQ-4bit
Text Generation
• 2B • Updated • 389
• 3
nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 1.62k
• 8
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated • 806
• 6
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated • 780
• 5
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 1.13k
• 7
unsloth/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 354
• 3
nvidia/NVIDIA-Nemotron-Parse-v1.1
Image-Text-to-Text
• Updated • 523k
• 162
nvidia/DeepSeek-V3.1-NVFP4
Text Generation
• 394B • Updated • 10.4k
• 14
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation
• Updated • 15.3k
• 29
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
• Updated • 1k
• 75
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 48.3k
• 35
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
• Updated • 111k
• 54
bartowski/nvidia_Nemotron-Cascade-8B-GGUF
Text Generation
• 8B • Updated • 478
• 3
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM
Text Generation
• 32B • Updated • 1.64k
• 27
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM-NVFP4
Text Generation
• 16B • Updated • 36
• 5
mradermacher/Huihui-NVIDIA-Nemotron-Nano-9B-v2-abliterated-i1-GGUF
9B • Updated • 799
• 10
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
• 241B • Updated • 2.33k
• 9
unsloth/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
• 18B • Updated • 2.15k
• 11
Text Generation
• 5B • Updated • 295
• 1
nvidia/Nemotron-Terminal-32B
Text Generation
• 33B • Updated • 1.41k
• 35
embedl/Cosmos-Reason2-2B-W4A16-Edge2
Image-Text-to-Text
• 2B • Updated • 22.9k
• 12
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B
Text Generation
• 124B • Updated • 541
• 3
unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation
• 124B • Updated • 3.26k
• 9
bartowski/nvidia_Nemotron-3-Super-120B-A12B-GGUF
Text Generation
• 121B • Updated • 21.8k
• 7
mradermacher/NVIDIA-Nemotron-3-Nano-4B-BF16-i1-GGUF
4B • Updated • 1.22k
• 2