Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,916

Base only

Active filters: nvidia

unsloth/OpenReasoning-Nemotron-32B

Text Generation • 33B • Updated Jul 21, 2025 • 11 • 2

unsloth/OpenReasoning-Nemotron-32B-GGUF

Text Generation • 33B • Updated Jul 21, 2025 • 952 • 11

prithivMLmods/OpenReasoning-Nemotron-1.5B-F32-GGUF

Text Generation • 2B • Updated Jul 22, 2025 • 136 • 1

mlx-community/OpenReasoning-Nemotron-32B-4bit

Text Generation • Updated Jul 22, 2025 • 5

nvidia/DeepSeek-R1-0528-NVFP4-v2

Text Generation • 394B • Updated Sep 2, 2025 • 763k • 23

LogicBombaklot/OpenReasoning-Nemotron-32B-mlx-8Bit

Text Generation • 9B • Updated Jul 21, 2025 • 1

nvidia/DeepSeek-R1-NVFP4-v2

Text Generation • 394B • Updated Jul 22, 2025 • 2.4k • 8

Ericwang/OpenReasoning-Nemotron-32B-Q4_K_M-GGUF

Text Generation • 33B • Updated Jul 22, 2025 • 1

NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4

Text Generation • 118B • Updated Jul 23, 2025 • 107 • 4

NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4

Text Generation • 241B • Updated Jul 23, 2025 • 29 • 3

nvidia/Qwen3-235B-A22B-Eagle3

Text Generation • 0.3B • Updated Jan 26 • 734 • 13

ArtusDev/nvidia_OpenReasoning-Nemotron-32B-EXL3

Updated Jul 24, 2025 • 2 • 1

Mungert/OpenReasoning-Nemotron-32B-GGUF

Text Generation • 33B • Updated Sep 24, 2025 • 63 • 3

codys12/OpenReasoning-Nemotron-32B

Text Generation • 33B • Updated Jul 24, 2025 • 2

Mungert/OpenReasoning-Nemotron-7B-GGUF

Text Generation • 8B • Updated Sep 24, 2025 • 261 • 4

Mungert/OpenReasoning-Nemotron-1.5B-GGUF

Text Generation • 2B • Updated Sep 24, 2025 • 211 • 4

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Oct 15, 2025 • 53.6k • • 234

gabriellarson/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF

Text Generation • 50B • Updated Jul 26, 2025 • 198 • 6

jncraton/OpenReasoning-Nemotron-1.5B-ct2-int8

Text Generation • Updated Jul 26, 2025 • 4

tensorblock/nvidia_Llama-3.1-Nemotron-Nano-4B-v1.1-GGUF

Text Generation • 5B • Updated Jan 27 • 42 • 1

jncraton/Llama-3.1-Nemotron-Nano-4B-v1.1-ct2-int8

Text Generation • Updated Jul 26, 2025 • 4 • 1

ArtusDev/nvidia_Llama-3_3-Nemotron-Super-49B-v1_5-EXL3

Text Generation • Updated Jul 26, 2025 • 2 • 6

mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF

50B • Updated Jul 27, 2025 • 100 • 1

NVFP4/Qwen3-235B-A22B-Thinking-2507-FP4

Text Generation • 118B • Updated Jul 26, 2025 • 231 • 2

Mungert/Llama-3_3-Nemotron-Super-49B-v1_5-GGUF

Text Generation • 50B • Updated Sep 24, 2025 • 182 • 7

Mungert/OpenReasoning-Nemotron-14B-GGUF

Text Generation • 15B • Updated Sep 24, 2025 • 269 • 3

mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF

50B • Updated Dec 18, 2025 • 86 • 1

cyankiwi/Llama-3_3-Nemotron-Super-49B-v1_5-AWQ-4bit

Text Generation • 8B • Updated Jul 31, 2025 • 767 • 4

groxaxo/OpenCodeReasoning-Nemotron-1.1-32B-GPTQ-W8A16

Text Generation • Updated Jul 28, 2025 • 1

unsloth/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Jul 31, 2025 • 95 • 3