-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
Text Generation
•
Updated
nvidia/Nemotron-H-4B-Base-8K
Text Generation
•
4B
•
Updated
•
4.53k
•
5
bartowski/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
•
8B
•
Updated
•
364
•
10
mradermacher/Llama-3_3-Nemotron-Super-49B-v1-GGUF
50B
•
Updated
•
43
mradermacher/Llama-3_3-Nemotron-Super-49B-v1-i1-GGUF
50B
•
Updated
•
506
•
5
ilintar/Llama-3-1-Nemotron-Nano-8B-v1-i-GGUF
Text Generation
•
8B
•
Updated
•
61
•
1
Mungert/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
•
8B
•
Updated
•
95
•
8
tensorblock/AceInstruct-1.5B-GGUF
Text Generation
•
2B
•
Updated
•
37
QuantFactory/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
•
8B
•
Updated
•
158
•
4
mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
8B
•
Updated
•
128
•
2
mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-i1-GGUF
8B
•
Updated
•
229
•
3
aifeifei798/Llama-3.1-Nemotron-Nano-8B-v1-bnb-4bit
Text Generation
•
8B
•
Updated
•
2
GrimsenClory/Llama-3.1-Nemotron-Nano-8B-v1-Q6_K-GGUF
Text Generation
•
8B
•
Updated
•
1
ysn-rfd/AceInstruct-1.5B-GGUF
Text Generation
•
2B
•
Updated
•
11
•
1
ysn-rfd/AceInstruct-7B-GGUF
Text Generation
•
8B
•
Updated
•
10
mradermacher/nemo_sup-GGUF
50B
•
Updated
•
1
Mungert/Llama-3_3-Nemotron-Super-49B-v1-GGUF
Text Generation
•
50B
•
Updated
•
50
•
5
openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF
Text Generation
•
50B
•
Updated
•
24
•
8
openfree/Llama-3_3-Nemotron-Super-49B-v1-Q4_K_M-GGUF
Text Generation
•
50B
•
Updated
•
12
•
5
mradermacher/Llama-3_1-Nemotron-51B-Instruct-abliterated-GGUF
52B
•
Updated
•
102
mradermacher/Llama-3_1-Nemotron-51B-Instruct-abliterated-i1-GGUF
52B
•
Updated
•
115
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
Text Generation
•
Updated
•
1.25k
•
•
343
lmstudio-community/Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Text Generation
•
253B
•
Updated
•
97
•
1
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1
Text Generation
•
253B
•
Updated
•
32
•
6
nvidia/Nemotron-H-47B-Base-8K
Text Generation
•
Updated
•
433
•
21
nvidia/Nemotron-H-56B-Base-8K
Text Generation
•
Updated
•
10.7k
•
32
mlx-community/Llama-3_3-Nemotron-Super-49B-v1-mlx-6bit
Text Generation
•
11B
•
Updated
•
11
mlx-community/Llama-3_3-Nemotron-Super-49B-v1-mlx-4bit
Text Generation
•
8B
•
Updated
•
13
unsloth/Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Text Generation
•
253B
•
Updated
•
379
•
9
FriendliAI/Llama-3_1-Nemotron-Ultra-253B-v1
Text Generation
•
253B
•
Updated
•
9