-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
nvidia/Qwen3-235B-A22B-Eagle3
Text Generation
•
0.3B
•
Updated
•
3.49k
•
9
ArtusDev/nvidia_OpenReasoning-Nemotron-32B-EXL3
Updated
•
1
•
1
Mungert/OpenReasoning-Nemotron-32B-GGUF
Text Generation
•
33B
•
Updated
•
31
•
3
codys12/OpenReasoning-Nemotron-32B
Text Generation
•
33B
•
Updated
•
1
Mungert/OpenReasoning-Nemotron-7B-GGUF
Text Generation
•
8B
•
Updated
•
71
•
4
Mungert/OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
•
2B
•
Updated
•
33
•
4
gabriellarson/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
•
50B
•
Updated
•
3.63k
•
6
jncraton/OpenReasoning-Nemotron-1.5B-ct2-int8
Text Generation
•
Updated
•
1
tensorblock/nvidia_Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF
Text Generation
•
5B
•
Updated
•
68
•
1
jncraton/Llama-3.1-Nemotron-Nano-4B-v1.1-ct2-int8
Text Generation
•
Updated
•
3
ArtusDev/nvidia_Llama-3_3-Nemotron-Super-49B-v1_5-EXL3
Text Generation
•
Updated
•
6
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
50B
•
Updated
•
29
•
1
NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4
Text Generation
•
118B
•
Updated
•
162
•
2
Mungert/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
•
50B
•
Updated
•
78
•
7
Mungert/OpenReasoning-Nemotron-14B-GGUF
Text Generation
•
15B
•
Updated
•
65
•
3
mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF
50B
•
Updated
•
78
•
1
cyankiwi/Llama-3_3-Nemotron-Super-49B-v1_5-AWQ-4bit
Text Generation
•
8B
•
Updated
•
312
•
3
groxaxo/OpenCodeReasoning-Nemotron-1.1-32B-GPTQ-W8A16
Text Generation
•
Updated
•
1
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation
•
50B
•
Updated
•
20
•
3
unsloth/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF
Text Generation
•
50B
•
Updated
•
2.42k
•
10
BitPhinix/DeepSeek-V3-0324-FP4
Text Generation
•
397B
•
Updated
•
3
DeusImperator/Llama-3_3-Nemotron-Super-49B-v1_5_exl3_4.0bpw_H6
Text Generation
•
13B
•
Updated
•
8
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-GGUF
50B
•
Updated
•
17
mradermacher/Llama-3_3-Nemotron-Super-49B-GenRM-i1-GGUF
50B
•
Updated
•
127
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
Text Generation
•
50B
•
Updated
•
6.17k
•
24
TomBombadyl/Qwen2.5-Coder-7B-Instruct-Omni1.0
Question Answering
•
Updated
•
3
ArtusDev/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-AWQ
2B
•
Updated
•
6
•
1
NVFP4/Qwen3-30B-A3B-Instruct-2507-FP4
Text Generation
•
16B
•
Updated
•
1.33k
•
11
NVFP4/Qwen3-30B-A3B-Thinking-2507-FP4
Text Generation
•
16B
•
Updated
•
1.47k
•
4
Prince-1/OpenReasoning-Nemotron-7B
Text Generation
•
Updated