Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,899

Base only

Active filters: nvidia

KnutJaegersberg/Llama3-ChatQA-2-70B-4.0bpw-exl2

Text Generation • Updated Sep 13, 2024 • 3

nvidia/Llama-3_1-Nemotron-51B-Instruct

Text Generation • 52B • Updated Jul 6, 2025 • 834 • 210

lmstudio-community/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • 52B • Updated Jan 6, 2025 • 121 • 1

bartowski/Llama-3_1-Nemotron-51B-Instruct-GGUF

Text Generation • 52B • Updated Jan 1, 2025 • 1.86k • 5

nvidia/Llama-3.1-Nemotron-70B-Reward

Updated Apr 13, 2025 • 21 • 82

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

71B • Updated Apr 13, 2025 • 98 • 93

nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14, 2025 • 158k • 776

nvidia/OpenMath2-Llama3.1-8B

Text Generation • 8B • Updated Nov 25, 2024 • 602 • • 33

nvidia/OpenMath2-Llama3.1-70B

Text Generation • 71B • Updated Nov 25, 2024 • 248 • 22

nvidia/OpenMath2-Llama3.1-8B-nemo

Updated Nov 25, 2024 • 8

nvidia/OpenMath2-Llama3.1-70B-nemo

Updated Nov 25, 2024 • 10

mlx-community/nvidia-Llama-3.1-Nemotron-70B-Reward-HF-AQ41

Updated Oct 2, 2024 • 19

zsolx2/OpenMath2-Llama3.1-8B-Q4_0-GGUF

8B • Updated Oct 4, 2024 • 1

QuantFactory/OpenMath2-Llama3.1-8B-GGUF

8B • Updated Oct 5, 2024 • 104 • 2

QuantFactory/Llama3-ChatQA-2-8B-GGUF

Text Generation • 8B • Updated Oct 5, 2024 • 136 • 3

mradermacher/OpenMath2-Llama3.1-8B-GGUF

8B • Updated Oct 11, 2024 • 226 • 1

mradermacher/OpenMath2-Llama3.1-70B-GGUF

71B • Updated Aug 1, 2025 • 87

mradermacher/OpenMath2-Llama3.1-8B-i1-GGUF

8B • Updated Oct 11, 2024 • 192

mradermacher/OpenMath2-Llama3.1-70B-i1-GGUF

71B • Updated Oct 11, 2024 • 77

alvarobartt/NVLM-D-72B-IE-compatible

Image-Text-to-Text • 79B • Updated Nov 19, 2024 • 7

quartzermz/BroGANv2.0.0

Updated Oct 12, 2024 • 4

nvidia/Llama-3.1-Nemotron-70B-Instruct

Updated Apr 13, 2025 • 108 • 568

MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Text Generation • 71B • Updated Oct 16, 2024 • 360 • 2

bartowski/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • 71B • Updated Oct 16, 2024 • 3.52k • 105

lmstudio-community/Llama-3.1-Nemotron-70B-Instruct-HF-GGUF

Text Generation • 71B • Updated Oct 15, 2024 • 3.92k • 38

poisson-fish/Llama-3.1-Nemotron-70B-Instruct-GGUF

71B • Updated Oct 16, 2024 • 164

bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-8.0bpw-8hb-exl2

Text Generation • Updated Oct 16, 2024 • 7 • 3

bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-7.0bpw-8hb-exl2

Text Generation • Updated Oct 16, 2024 • 4 • 1

mlx-community/nvidia_Llama-3.1-Nemotron-70B-Instruct-HF_4bit

Text Generation • Updated Oct 16, 2024 • 940 • 12

bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-6.0bpw-8hb-exl2

Text Generation • Updated Oct 16, 2024 • 1