Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,924

Base only

Active filters: nvidia

nvidia/Cosmos-Tokenize1-CV8x8x8-720p

Updated Apr 23, 2025 • 2.05k • 13

nvidia/Cosmos-Tokenize1-DV8x16x16-720p

Updated Mar 18, 2025 • 782 • 3

nvidia/Llama-3.3-Nemotron-70B-Feedback

Text Generation • 71B • Updated Mar 18, 2025 • 52 • • 9

nvidia/Llama-3.3-Nemotron-70B-Edit

Text Generation • 71B • Updated Mar 18, 2025 • 38 • 4

nvidia/Llama-3.3-Nemotron-70B-Select

Text Generation • 71B • Updated Mar 18, 2025 • 224 • • 12

nvidia/Cosmos-UpsamplePrompt1-12B-Text2World

Updated Apr 1, 2025 • 1.98k • 2

nvidia/Cosmos-Predict1-7B-Decoder-DV8x16x16ToCV8x8x8-720p

Updated Mar 16, 2025 • 36 • 1

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • 8B • Updated Oct 15, 2025 • 33.6k • • 220

nvidia/Cosmos-UpsamplePrompt1-12B-Transfer

Updated Mar 26, 2025 • 6

shylane/Llama-3.1-Nemotron-Nano-8B-v1-Q6_K-GGUF

Text Generation • 8B • Updated Mar 18, 2025 • 11 • 1

mradermacher/Llama-3.3-Nemotron-70B-Select-GGUF

71B • Updated Jul 31, 2025 • 58

tensorblock/Llama-3.1-Nemotron-Nano-8B-v1-GGUF

Text Generation • 8B • Updated Jan 27 • 45 • 6

mradermacher/Llama-3.3-Nemotron-70B-Select-i1-GGUF

71B • Updated Jul 11, 2025 • 178

bartowski/nvidia_Llama-3_3-Nemotron-Super-49B-v1-GGUF

Text Generation • 50B • Updated Mar 19, 2025 • 1.8k • 50

medmekk/Llama-3.1-Nemotron-70B-Instruct-HF-bnb-4bit

Text Generation • 72B • Updated Mar 19, 2025 • 6

NikolayKozloff/Llama-3.1-Nemotron-Nano-8B-v1-Q8_0-GGUF

Text Generation • 8B • Updated Mar 19, 2025 • 9 • 1

nvidia/Cosmos-Transfer1-7B-4KUpscaler

Updated Mar 20, 2025 • 3 • 10

nvidia/Cosmos-Predict1-7B-WorldInterpolator

Updated Apr 8, 2025 • 26 • 6

nvidia/Nemotron-H-8B-Base-8K

Text Generation • 8B • Updated Aug 21, 2025 • 78.7k • 58

airsmtp/tripl3sixmafia

Text Generation • Updated Mar 20, 2025

nvidia/Nemotron-H-4B-Base-8K

Text Generation • 4B • Updated Oct 24, 2025 • 4.91k • 7

bartowski/nvidia_Llama-3.1-Nemotron-Nano-8B-v1-GGUF

Text Generation • 8B • Updated Mar 20, 2025 • 1.25k • 11

mradermacher/Llama-3_3-Nemotron-Super-49B-v1-GGUF

50B • Updated Jul 31, 2025 • 349

mradermacher/Llama-3_3-Nemotron-Super-49B-v1-i1-GGUF

50B • Updated Jul 11, 2025 • 556 • 5

ilintar/Llama-3-1-Nemotron-Nano-8B-v1-i-GGUF

Text Generation • 8B • Updated Mar 21, 2025 • 15 • 1

Mungert/Llama-3.1-Nemotron-Nano-8B-v1-GGUF

Text Generation • 8B • Updated Sep 24, 2025 • 377 • 8

tensorblock/AceInstruct-1.5B-GGUF

Text Generation • 2B • Updated Jan 27 • 31

QuantFactory/Llama-3.1-Nemotron-Nano-8B-v1-GGUF

Text Generation • 8B • Updated Mar 23, 2025 • 91 • 4

mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-GGUF

8B • Updated Jul 11, 2025 • 81 • 2

mradermacher/Llama-3.1-Nemotron-Nano-8B-v1-i1-GGUF

8B • Updated Jul 11, 2025 • 607 • 3