Inference Providers
Active filters: nvidia
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF
50B • Updated • 240
• 1
cyankiwi/Llama-3_3-Nemotron-Super-49B-v1_5-AWQ-4bit
Text Generation
• 8B • Updated • 305
• 3
groxaxo/OpenCodeReasoning-Nemotron-1.1-32B-GPTQ-W8A16
Text Generation
• Updated • 1
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation
• 50B • Updated • 28
• 3
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
• 50B • Updated • 710
• 10
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
• 397B • Updated • 1
DeusImperator/Llama-3_3-Nemotron-Super-49B-v1_5_exl3_4.0bpw_H6
Text Generation
• 13B • Updated • 4
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-GGUF
50B • Updated • 74
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-i1-GGUF
50B • Updated • 104
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
Text Generation
• 50B • Updated • 142k
• 28
TomBombadyl/Qwen2.5-Coder-7B-Instruct-Omni1.0
Question Answering
• Updated • 3
ArtusDev/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-AWQ
2B • Updated • 2
• 1
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
• 16B • Updated • 553
• 12
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
• 16B • Updated • 1.18k
• 4
Prince-1/OpenReasoning-Nemotron-7B
Text Generation
• Updated onnx-community/OpenReasoning-Nemotron-7B
Text Generation
• Updated onnx-community/OpenReasoning-Nemotron-1.5B
Text Generation
• Updated Prince-1/OpenReasoning-Nemotron-1.5B
Text Generation
• Updated NVFP4/Qwen3-Coder-30B-A3B-Instruct-FP4
Text Generation
• 16B • Updated • 7.27k
• 27
Text Generation
• 0.4B • Updated • 340
• 2
pamanseau/OpenReasoning-Nemotron-32B
Text Generation
• 33B • Updated • 2
tensorblock/nvidia_OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
• 2B • Updated • 21
tensorblock/nvidia_AceReason-Nemotron-14B-GGUF
Text Generation
• 15B • Updated • 12
nvidia/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
• 9B • Updated • 489k
• 492
imgailab/flux1-schnell-bf16-ampere
Text-to-Image
• Updated imgailab/flux1-dev-bf16-ampere
Text-to-Image
• Updated imgailab/sdxl-bf16-ampere
Text-to-Image
• Updated nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base
Text Generation
• Updated • 3.57k
• 90
TomBombadyl/Qwen2.5-Coder-7B-Instruct-Omni1.1
Text Generation
• 0.2B • Updated • 14
• 2
tensorblock/nvidia_OpenCodeReasoning-Nemotron-1.1-32B-GGUF
Text Generation
• 33B • Updated • 22