Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,924

Base only

Active filters: nvidia

backyardai/Llama-3.1-Nemotron-70B-Instruct-GGUF

Text Generation • 71B • Updated Dec 22, 2024 • 110

second-state/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • 52B • Updated Dec 23, 2024 • 272

gaianet/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • 52B • Updated Dec 24, 2024 • 241

tensorblock/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • 52B • Updated Jan 27 • 62

RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16

Text Generation • 71B • Updated Jan 3, 2025 • 35

RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8

Text Generation • 71B • Updated Jan 3, 2025 • 5

mradermacher/llama-3-nvidia-ChatQA-1.5-8B-GGUF

8B • Updated Jan 4, 2025 • 54

mradermacher/llama-3-nvidia-ChatQA-1.5-8B-i1-GGUF

8B • Updated Jan 4, 2025 • 294

matatonic/Llama-3.1-Nemotron-70B-Instruct-HF-abliterated-6.5bpw-h8-exl2

Text Generation • Updated Jan 6, 2025 • 5

nvidia/Cosmos-1.0-Tokenizer-CV8x8x8

Updated May 7, 2025 • 102 • 26

nvidia/Cosmos-1.0-Tokenizer-DV8x16x16

Updated Jan 12, 2025 • 35 • 20

nvidia/Cosmos-1.0-Prompt-Upsampler-12B-Text2World

Updated Jan 10, 2025 • 90 • 14

nvidia/Cosmos-1.0-Diffusion-7B-Video2World

Updated May 7, 2025 • 1.06k • 40

nvidia/Cosmos-1.0-Diffusion-14B-Text2World

Updated May 7, 2025 • 10 • 61

nvidia/Cosmos-1.0-Diffusion-14B-Video2World

Updated May 7, 2025 • 26 • 59

nvidia/Cosmos-1.0-Autoregressive-13B-Video2World

Updated Feb 8, 2025 • 33

nvidia/Cosmos-1.0-Autoregressive-12B

Updated Feb 11, 2025 • 7 • 31

nvidia/Cosmos-1.0-Autoregressive-5B-Video2World

Updated Feb 8, 2025 • 31

nvidia/Cosmos-1.0-Diffusion-7B-Decoder-DV8x16x16ToCV8x8x8

Updated Jan 10, 2025 • 10

nvidia/Cosmos-1.0-Autoregressive-4B

Updated Feb 11, 2025 • 37 • 57

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Text-to-Video • Updated May 7, 2025 • 1.65k • 234

mradermacher/Llama-3_1-Nemotron-51B-Instruct-GGUF

52B • Updated Jan 10, 2025 • 24 • 1

mradermacher/Llama-3_1-Nemotron-51B-Instruct-i1-GGUF

52B • Updated Jan 10, 2025 • 86 • 1

mradermacher/Llama-3.2-Nemotron-3B-Instruct-GGUF

3B • Updated Jan 12, 2025 • 45

mradermacher/Llama-3.2-Nemotron-3B-Instruct-i1-GGUF

3B • Updated Jan 12, 2025 • 60

nvidia/AceMath-1.5B-Instruct

Text Generation • 2B • Updated Jan 17, 2025 • 148 • • 17

nvidia/AceMath-7B-Instruct

Text Generation • 8B • Updated Jan 17, 2025 • 411 • 32

nvidia/AceMath-72B-Instruct

Text Generation • 73B • Updated Jan 17, 2025 • 119 • • 21

nvidia/AceMath-72B-RM

Text Generation • 71B • Updated Jan 17, 2025 • 83 • 10

nvidia/AceMath-7B-RM

Text Generation • 7B • Updated Jan 17, 2025 • 110 • 7