-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
mradermacher/nemotron-nano-30b-a3b-clinical-instruct-GGUF
32B
•
Updated
•
849
•
2
stelterlab/NVIDIA-Nemotron-3-Nano-30B-A3B-AWQ
Text Generation
•
5B
•
Updated
•
1.28k
•
2
Text Generation
•
Updated
•
959
•
136
nvidia/Cosmos-1.0-Tokenizer-CV8x8x8
Updated
•
149
•
23
nvidia/Cosmos-1.0-Diffusion-7B-Text2World
Text-to-Video
•
Updated
•
3.08k
•
231
nvidia/AceMath-7B-Instruct
Text Generation
•
8B
•
Updated
•
804
•
•
31
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
27.3k
•
20
nexuslrf/diffusion_renderer-inverse-svd
Updated
•
33
•
3
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
Text Generation
•
5B
•
Updated
•
2.8k
•
112
lmstudio-community/OpenCodeReasoning-Nemotron-7B-GGUF
Text Generation
•
8B
•
Updated
•
69
•
4
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
•
926k
•
175
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
•
397B
•
Updated
•
14.6k
•
41
nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-DROID
Updated
•
99
•
2
city96/Cosmos-Predict2-14B-Text2Image-gguf
14B
•
Updated
•
243
•
11
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
24.1k
•
23
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5
Text Generation
•
50B
•
Updated
•
53.5k
•
223
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
Text Generation
•
50B
•
Updated
•
5.53k
•
24
nvidia/gpt-oss-120b-Eagle3-long-context
Text Generation
•
0.2B
•
Updated
•
4.35k
•
58
nvidia/Phi-4-multimodal-instruct-NVFP4
4B
•
Updated
•
3.27k
•
7
Text Generation
•
15B
•
Updated
•
3.15k
•
3
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
5.6k
•
13
unsloth/NVIDIA-Nemotron-Nano-9B-v2
Text Generation
•
9B
•
Updated
•
171
•
2
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-FP4-QAD
Image-Text-to-Text
•
6B
•
Updated
•
113
•
12
nvidia/NVIDIA-Nemotron-Nano-9B-v2-NVFP4
Text Generation
•
6B
•
Updated
•
22.2k
•
18
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16
Image-Text-to-Text
•
13B
•
Updated
•
79.9k
•
74
Image-to-Image
•
Updated
•
508
•
14
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
•
17B
•
Updated
•
126
•
1
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
•
17B
•
Updated
•
144
•
1
nvidia/KVzap-linear-Qwen3-8B
Other
•
1.18M
•
Updated
•
111
•
1
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation
•
235B
•
Updated
•
265
•
22