Inference Providers
Active filters: nvidia
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
• 118B • Updated • 274
• 4
NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4
Text Generation
• 241B • Updated • 2.48k
• 2
nvidia/Qwen3-235B-A22B-Eagle3
Text Generation
• 0.3B • Updated • 319
• 12
ArtusDev/nvidia_OpenReasoning-Nemotron-32B-EXL3
Mungert/OpenReasoning-Nemotron-32B-GGUF
Text Generation
• 33B • Updated • 64
• 3
codys12/OpenReasoning-Nemotron-32B
Text Generation
• 33B • Updated • 13
Mungert/OpenReasoning-Nemotron-7B-GGUF
Text Generation
• 8B • Updated • 222
• 4
Mungert/OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
• 2B • Updated • 49
• 4
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation
• 50B • Updated • 177k
• 228
gabriellarson/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
• 50B • Updated • 294
• 6
jncraton/OpenReasoning-Nemotron-1.5B-ct2-int8
Text Generation
• Updated • 12
tensorblock/nvidia_Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF
Text Generation
• 5B • Updated • 32
• 1
jncraton/Llama-3.1-Nemotron-Nano-4B-v1.1-ct2-int8
Text Generation
• Updated • 14
• 1
ArtusDev/nvidia_Llama-3_3-Nemotron-Super-49B-v1_5-EXL3
Text Generation
• Updated • 1
• 6
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
50B • Updated • 18
• 1
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
• 118B • Updated • 8
• 2
Mungert/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
• 50B • Updated • 132
• 7
Mungert/OpenReasoning-Nemotron-14B-GGUF
Text Generation
• 15B • Updated • 119
• 3
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF
50B • Updated • 103
• 1
cyankiwi/Llama-3_3-Nemotron-Super-49B-v1_5-AWQ-4bit
Text Generation
• 8B • Updated • 953
• 3
groxaxo/OpenCodeReasoning-Nemotron-1.1-32B-GPTQ-W8A16
Text Generation
• Updated • 1
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation
• 50B • Updated • 295
• 3
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
• 50B • Updated • 852
• 10
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
• 397B • Updated • 1
DeusImperator/Llama-3_3-Nemotron-Super-49B-v1_5_exl3_4.0bpw_H6
Text Generation
• 13B • Updated • 2
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-GGUF
50B • Updated • 36
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-i1-GGUF
50B • Updated • 69
TomBombadyl/Qwen2.5-Coder-7B-Instruct-Omni1.0
Question Answering
• Updated • 3
ArtusDev/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-AWQ
2B • Updated • 49
• 1
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
• 16B • Updated • 175
• 4