Inference Providers
Active filters: nvidia
bartowski/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
• 8B • Updated • 1.43k
• 10
mradermacher/Llama-3_3-Nemotron-Super-49B-v1-GGUF
50B • Updated • 60
mradermacher/Llama-3_3-Nemotron-Super-49B-v1-i1-GGUF
50B • Updated • 224
• 5
ilintar/Llama-3-1-Nemotron-Nano-8B-v1-i-GGUF
Text Generation
• 8B • Updated • 50
• 1
Mungert/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
• 8B • Updated • 156
• 8
tensorblock/AceInstruct-1.5B-GGUF
Text Generation
• 2B • Updated • 87
QuantFactory/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
Text Generation
• 8B • Updated • 100
• 4
mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-GGUF
8B • Updated • 186
• 2
mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-i1-GGUF
8B • Updated • 451
• 3
aifeifei798/Llama-3.1-Nemotron-Nano-8B-v1-bnb-4bit
Text Generation
• 8B • Updated • 2
GrimsenClory/Llama-3.1-Nemotron-Nano-8B-v1-Q6_K-GGUF
Text Generation
• 8B • Updated • 11
ysn-rfd/AceInstruct-1.5B-GGUF
Text Generation
• 2B • Updated • 13
• 1
ysn-rfd/AceInstruct-7B-GGUF
Text Generation
• 8B • Updated • 10
mradermacher/nemo_sup-GGUF
50B • Updated • 22
Mungert/Llama-3_3-Nemotron-Super-49B-v1-GGUF
Text Generation
• 50B • Updated • 103
• 5
openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF
Text Generation
• 50B • Updated • 4
• 8
openfree/Llama-3_3-Nemotron-Super-49B-v1-Q4_K_M-GGUF
Text Generation
• 50B • Updated • 28
• 5
mradermacher/Llama-3_1-Nemotron-51B-Instruct-abliterated-GGUF
52B • Updated • 96
mradermacher/Llama-3_1-Nemotron-51B-Instruct-abliterated-i1-GGUF
52B • Updated • 118
lmstudio-community/Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Text Generation
• 253B • Updated • 53
• 1
bartowski/nvidia_Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Text Generation
• 253B • Updated • 316
• 5
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1
Text Generation
• 253B • Updated • 994
• 6
nvidia/Nemotron-H-47B-Base-8K
Text Generation
• Updated • 1.21k
• 22
nvidia/Nemotron-H-56B-Base-8K
Text Generation
• Updated • 16.5k
• 33
mlx-community/Llama-3_3-Nemotron-Super-49B-v1-mlx-6bit
Text Generation
• 11B • Updated • 56
mlx-community/Llama-3_3-Nemotron-Super-49B-v1-mlx-4bit
Text Generation
• 8B • Updated • 206
• 2
unsloth/Llama-3_1-Nemotron-Ultra-253B-v1-GGUF
Text Generation
• 253B • Updated • 2.09k
• 9
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B • Updated • 179k
• 32
FriendliAI/Llama-3_1-Nemotron-Ultra-253B-v1
Text Generation
• 253B • Updated • 879
FriendliAI/Llama-3_3-Nemotron-Super-49B-v1
Text Generation
• 50B • Updated • 2