-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
Triangle104/OpenCodeReasoning-Nemotron-7B-Q5_K_S-GGUF
Text Generation
•
8B
•
Updated
Triangle104/OpenCodeReasoning-Nemotron-7B-Q5_K_M-GGUF
Text Generation
•
8B
•
Updated
•
2
Triangle104/OpenCodeReasoning-Nemotron-7B-Q6_K-GGUF
Text Generation
•
8B
•
Updated
Triangle104/OpenCodeReasoning-Nemotron-7B-Q8_0-GGUF
Text Generation
•
8B
•
Updated
•
2
•
1
marcorez8/llama-cpp-python-windows-blackwell-cuda
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
Image-Text-to-Text
•
Updated
•
925k
•
175
mradermacher/Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF
5B
•
Updated
•
91
•
3
mradermacher/Llama-3.1-Nemotron-Nano-4B-v1.1-i1-GGUF
5B
•
Updated
•
143
•
1
mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-bf16
Text Generation
•
5B
•
Updated
•
16
mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-8bit
Text Generation
•
1B
•
Updated
•
3
mlx-community/Llama-3.1-Nemotron-Nano-4B-v1.1-4bit
Text Generation
•
0.7B
•
Updated
•
46
Triangle104/OpenMath-Nemotron-14B-Q4_K_S-GGUF
15B
•
Updated
•
1
Triangle104/OpenMath-Nemotron-14B-Q4_K_M-GGUF
15B
•
Updated
•
2
Triangle104/OpenMath-Nemotron-14B-Q5_K_S-GGUF
15B
•
Updated
Triangle104/OpenMath-Nemotron-14B-Q5_K_M-GGUF
15B
•
Updated
Triangle104/OpenMath-Nemotron-14B-Q6_K-GGUF
15B
•
Updated
Triangle104/OpenMath-Nemotron-14B-Q8_0-GGUF
15B
•
Updated
nvidia/Nemotron-H-8B-Reasoning-128K
Text Generation
•
8B
•
Updated
•
94
•
25
nvidia/Nemotron-H-8B-Reasoning-128K-FP8
Text Generation
•
8B
•
Updated
•
60
•
12
nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-GR1
Updated
•
79
•
3
nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-DROID
Updated
•
92
•
2
nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-mcore
Image-Text-to-Text
•
Updated
•
2
KalaiarasiS14/Llama-3.1-Nemotron-Nano-8B-v1-Q4_0-GGUF
Text Generation
•
8B
•
Updated
greenwich157/Llama-3.1-Minitron-4B-Width-Base-Q4_0-GGUF
Text Generation
•
5B
•
Updated
•
1
Jazco4/Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M-GGUF
Text Generation
•
8B
•
Updated
botirk/tiny-prompt-task-complexity-classifier
Text Classification
•
Updated
•
6
•
2
nvidia/OpenCodeReasoning-Nemotron-1.1-14B
Text Generation
•
15B
•
Updated
•
84
•
12
nvidia/OpenCodeReasoning-Nemotron-1.1-32B
Text Generation
•
33B
•
Updated
•
77
•
46
nvidia/OpenCodeReasoning-Nemotron-1.1-7B
Text Generation
•
8B
•
Updated
•
79
•
12
nvidia/Cosmos-Predict2-2B-Sample-Action-Conditioned
Updated
•
37
•
6