Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,916

Base only

Active filters: nvidia

nvidia/Qwen3-Nemotron-8B-BRRM

Text Generation • Updated Dec 18, 2025 • 141 • • 9

nvidia/Qwen3-Nemotron-14B-BRRM

Updated Dec 18, 2025 • 26 • 13

RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-quantized.w4a16

Text Generation • 2B • Updated Apr 28 • 345 • 5

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8

Image-Text-to-Text • 13B • Updated Nov 13, 2025 • 26.5k • 51

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD

Image-Text-to-Text • 8B • Updated May 5 • 32.3k • 28

bourn23/nvidia-llama-3.1-nemotron-nano-8b-v1-mlx

Text Generation • 8B • Updated Oct 24, 2025 • 14

bourn23/nvidia-llama-3.1-nemotron-nano-8b-v1-mlx-4bit

Text Generation • 1B • Updated Oct 24, 2025 • 22

sommerzen/Nemotron-H-4B-Instruct-128K-Q8_0-GGUF

Text Generation • 4B • Updated Oct 25, 2025 • 34

sillykiwi/Nemotron-H-4B-Instruct-128K-Q6_K-GGUF

Text Generation • 4B • Updated Oct 27, 2025 • 73

TheStageAI/thewhisper-large-v3

Automatic Speech Recognition • 2B • Updated Dec 13, 2025 • 3 • 2

TheStageAI/thewhisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated 14 days ago • 1.75k • 25

GrimsenClory/NVIDIA-Nemotron-Nano-12B-v2-Q6_K-GGUF

Text Generation • 12B • Updated Oct 29, 2025 • 8

ArtusDev/nvidia_Qwen-3-Nemotron-32B-RLBFF-EXL3

Updated Oct 30, 2025

ArtusDev/nvidia_Qwen-3-Nemotron-32B-GenRM-Principle-EXL3

Updated Oct 30, 2025 • 15

ArtusDev/nvidia_Llama-3.3-Nemotron-70B-Reward-Principle-EXL3

Updated Oct 30, 2025 • 2

mradermacher/Llama-3.3-Nemotron-70B-Reward-Principle-GGUF

71B • Updated Oct 30, 2025 • 30

mradermacher/Qwen3-Nemotron-8B-BRRM-GGUF

8B • Updated Oct 30, 2025 • 89

mradermacher/Llama-3.3-Nemotron-70B-Reward-Principle-i1-GGUF

71B • Updated Dec 10, 2025 • 973

mradermacher/Qwen3-Nemotron-32B-GenRM-Principle-GGUF

33B • Updated Oct 30, 2025 • 70

mradermacher/Qwen3-Nemotron-8B-BRRM-i1-GGUF

8B • Updated Dec 10, 2025 • 124

mradermacher/Qwen3-Nemotron-32B-RLBFF-GGUF

33B • Updated Oct 30, 2025 • 84 • 5

mradermacher/Qwen3-Nemotron-32B-GenRM-Principle-i1-GGUF

33B • Updated Dec 10, 2025 • 201

mradermacher/Qwen3-Nemotron-32B-RLBFF-i1-GGUF

33B • Updated Dec 10, 2025 • 396 • 1

cyankiwi/Qwen3-Nemotron-32B-RLBFF-AWQ-4bit

Text Generation • 6B • Updated Oct 30, 2025 • 6

cyankiwi/Qwen3-Nemotron-32B-RLBFF-AWQ-8bit

Text Generation • 10B • Updated Oct 30, 2025 • 13

mradermacher/Qwen3-Nemotron-14B-BRRM-GGUF

15B • Updated Oct 31, 2025 • 60 • 1

ExaltedSlayer/Qwen3-Nemotron-32B-RLBFF-mxfp4-mlx

33B • Updated Oct 31, 2025 • 3

mradermacher/Qwen3-Nemotron-14B-BRRM-i1-GGUF

15B • Updated Dec 5, 2025 • 226 • 2

Nayana-cognitivelab/Llama_Nemotron_SectionOCR_SFT_En_Kn_15000

9B • Updated Nov 3, 2025 • 1

Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4

Text Generation • 17B • Updated Nov 3, 2025 • 1.77k • 1