-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
QuantFactory/OpenReasoning-Nemotron-7B-GGUF
Text Generation
•
8B
•
Updated
•
150
•
2
Lumia101/NVIDIA-Nemotron-Nano-9B-v2-Q4_K_M-GGUF
Text Generation
•
9B
•
Updated
•
46
mlx-community/NVIDIA-Nemotron-Nano-9B-v2-6bit
Text Generation
•
Updated
•
23
Mungert/NVIDIA-Nemotron-Nano-12B-v2-GGUF
Text Generation
•
12B
•
Updated
•
245
•
2
nvidia/Phi-4-multimodal-instruct-FP8
6B
•
Updated
•
35.4k
•
4
nvidia/Phi-4-reasoning-plus-FP8
15B
•
Updated
•
536
•
3
nvidia/Phi-4-reasoning-plus-NVFP4
8B
•
Updated
•
7.09k
•
6
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B
•
Updated
•
100k
•
6
DBMe/Llama-3_1-Nemotron-Ultra-253B-v1-exl3-2.7bpw
Text Generation
•
45B
•
Updated
•
7
maxrubin629/Nemotron-H-8B-Reasoning-128K-6bit
Text Generation
•
8B
•
Updated
•
6
straino/NVIDIA-Nemotron-Nano-9B-v2-Base-Q4_K_M-GGUF
Text Generation
•
9B
•
Updated
•
11
•
1
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-3bit
Text Generation
•
1B
•
Updated
•
15
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-4bit
Text Generation
•
1B
•
Updated
•
12
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-6bit
Text Generation
•
9B
•
Updated
•
5
NexVeridian/NVIDIA-Nemotron-Nano-9B-v2-8bit
Text Generation
•
9B
•
Updated
•
21
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-3bit
Text Generation
•
12B
•
Updated
•
20
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-4bit
Text Generation
•
12B
•
Updated
•
18
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-5bit
Text Generation
•
12B
•
Updated
•
10
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-6bit
Text Generation
•
12B
•
Updated
•
15
NexVeridian/NVIDIA-Nemotron-Nano-12B-v2-8bit
Text Generation
•
12B
•
Updated
•
13
Text Generation
•
5B
•
Updated
•
11.8k
•
13
Text Generation
•
8B
•
Updated
•
4.97k
•
3
Text Generation
•
8B
•
Updated
•
17.8k
•
5
Text Generation
•
17B
•
Updated
•
17k
•
5
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
•
8B
•
Updated
•
691
•
7
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
5.61k
•
13
SandLogicTechnologies/OpenReasoning-Nemotron-1.5B-GGUF
Text Generation
•
2B
•
Updated
•
3
SandLogicTechnologies/OpenReasoning-Nemotron-7B-GGUF
Text Generation
•
8B
•
Updated
•
9
MorsiKK/Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M-GGUF
Text Generation
•
8B
•
Updated
•
51
mlx-community/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit
Text Generation
•
50B
•
Updated
•
63
•
3