Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,914

Base only

Active filters: nvidia

nvidia/DeepSeek-V3.2-NVFP4

Text Generation • 394B • Updated Jan 21 • 8.25k • 15

nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4

Text Generation • 120B • Updated Jan 30 • 1.54k • 8

nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4

Text Generation • 120B • Updated Jan 30 • 3.62k • 10

SiddhJagani/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-mlx-Q6

Text Generation • 8B • Updated Dec 31, 2025 • 6

SiddhJagani/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-mlx-Q4

Text Generation • 1B • Updated Dec 31, 2025 • 10

nvidia/Qwen2.5-CascadeRL-RM-72B

Text Generation • 71B • Updated Jan 1 • 335 • 13

HuangRT/TensorRT_Llama-3.1-Minitron-4B-Width-Base

Text Generation • 5B • Updated Jan 5 • 4

mradermacher/Huihui-NVIDIA-Nemotron-Nano-9B-v2-abliterated-i1-GGUF

9B • Updated Jan 5 • 666 • 10

cybermotaz/Falcon-H1R-7B-NVFP4

Text Generation • 4B • Updated Jan 6 • 7 • 1

RowanBird779/cosmos-reason2-8b-abliterated

Image-Text-to-Text • 9B • Updated Jan 7 • 3 • 2

icefog72/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM-Q5_K_M

Text Generation • 32B • Updated Jan 6 • 161 • 1

Cheeeeeeeeky/affine-pondering

Text Generation • 32B • Updated Jan 7 • 4

Cheeeeeeeeky/affine-basedmaxxing

Text Generation • 15B • Updated Jan 8 • 2

sonic-speech/parakeet-tdt-0.6b-v3

Automatic Speech Recognition • 0.6B • Updated Mar 8 • 148 • 1

mradermacher/cosmos-reason2-8b-abliterated-GGUF

8B • Updated Jan 8 • 29 • 2

mradermacher/cosmos-reason2-8b-abliterated-i1-GGUF

8B • Updated 8 days ago • 1.35k • 2

mradermacher/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16-GGUF

32B • Updated Mar 19 • 155 • 2

mradermacher/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16-i1-GGUF

32B • Updated Mar 19 • 309 • 1

yepthatsjason/gemma-3-12b-it-nvfp4

Image-Text-to-Text • 8B • Updated Jan 10 • 37

gpudad/groot-pick-place-cube-v1

Robotics • 2B • Updated Jan 10 • 5

nvidia/Nemotron-Research-GooseReason-4B-Instruct

Text Generation • 4B • Updated Mar 1 • 79 • • 8

maxim-igenbergs/dave2

vipertsniper/DeepSeek-R1-Distill-Qwen-14B-NVFP4

Text Generation • 8B • Updated Jan 16 • 168

MuXodious/Nemotron-Cascade-14B-Thinking-impotent-heresy

Text Generation • 15B • Updated Jan 20 • 19 • • 1

vipertsniper/Qwen3-30B-A3B-NVFP4

16B • Updated Jan 18 • 5

nvidia/Qwen3-8B-DMS-8x

8B • Updated Jan 22 • 73 • 37

mradermacher/Nemotron-Cascade-14B-Thinking-impotent-heresy-GGUF

15B • Updated Jan 20 • 656

mradermacher/Nemotron-Cascade-14B-Thinking-impotent-heresy-i1-GGUF

15B • Updated Jan 20 • 333 • 1

raipolymath/triton-windows

Updated Jan 23 • 1

amihai4by/logic-v2

Image-to-Text • 8B • Updated Jan 24 • 8