Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,917

Base only

Active filters: nvidia

nvidia/Nemotron-H-8B-Reasoning-128K-FP8

Text Generation • 8B • Updated Aug 21, 2025 • 96 • 13

nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-GR1

Updated Jun 17, 2025 • 40 • 6

nvidia/Cosmos-Predict2-14B-Sample-GR00T-Dreams-DROID

Updated Jun 17, 2025 • 23 • 3

nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1-mcore

Image-Text-to-Text • Updated Jun 25, 2025 • 3

KalaiarasiS14/Llama-3.1-Nemotron-Nano-8B-v1-Q4_0-GGUF

Text Generation • 8B • Updated Jun 12, 2025 • 32 • 1

greenwich157/Llama-3.1-Minitron-4B-Width-Base-Q4_0-GGUF

Text Generation • 5B • Updated Jun 12, 2025 • 8

Jazco4/Llama-3.1-Nemotron-Nano-8B-v1-Q4_K_M-GGUF

Text Generation • 8B • Updated Jun 12, 2025 • 11

botirk/tiny-prompt-task-complexity-classifier

Text Classification • Updated Jun 12, 2025 • 35 • 3

nvidia/OpenCodeReasoning-Nemotron-1.1-14B

Text Generation • 15B • Updated Jul 8, 2025 • 79 • • 13

nvidia/OpenCodeReasoning-Nemotron-1.1-32B

Text Generation • 33B • Updated Jul 8, 2025 • 103 • 49

nvidia/OpenCodeReasoning-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 8, 2025 • 80 • 13

nvidia/Cosmos-Predict2-2B-Sample-Action-Conditioned

Updated Jun 17, 2025 • 35 • 10

IntelligentEstate/Gambit-7B-Q4_K_M-GGUF

Text Generation • 8B • Updated Jun 12, 2025 • 7 • 2

city96/Cosmos-Predict2-14B-Text2Image-gguf

14B • Updated Jun 14, 2025 • 444 • 12

nvidia/AceReason-Nemotron-1.1-7B

Text Generation • 8B • Updated Jul 11, 2025 • 35.1k • 59

nvidia/NFT-7B

Text Generation • 8B • Updated Jul 15, 2025 • 54 • 3

nvidia/NFT-32B

Text Generation • 33B • Updated Jul 15, 2025 • 73 • • 8

NikolayKozloff/AceReason-Nemotron-1.1-7B-Q8_0-GGUF

Text Generation • 8B • Updated Jun 17, 2025 • 6 • 1

mlx-community/AceReason-Nemotron-1.1-7B-bf16

Text Generation • Updated Jun 17, 2025 • 5

mlx-community/AceReason-Nemotron-1.1-7B-8bit

Text Generation • Updated Jun 17, 2025 • 7

mlx-community/AceReason-Nemotron-1.1-7B-4bit

Text Generation • Updated Jun 17, 2025 • 10

mradermacher/AceReason-Nemotron-1.1-7B-GGUF

8B • Updated Jul 11, 2025 • 92 • 1

bndp/AceReason-Nemotron-1.1-7B-Q4_K_M-GGUF

Text Generation • 8B • Updated Jun 17, 2025 • 5

mradermacher/AceReason-Nemotron-1.1-7B-i1-GGUF

8B • Updated Jul 11, 2025 • 221

gabriellarson/AceReason-Nemotron-1.1-7B-GGUF

Text Generation • 8B • Updated Jun 18, 2025 • 92 • 2

bartowski/nvidia_AceReason-Nemotron-1.1-7B-GGUF

Text Generation • 8B • Updated Jun 17, 2025 • 659 • 1

lmstudio-community/AceReason-Nemotron-1.1-7B-GGUF

Text Generation • 8B • Updated Jun 17, 2025 • 80 • 1

denizaybey/Cosmos-Reason1-7B

Image-Text-to-Text • 8B • Updated Jun 19, 2025 • 2

kmouratidis/Llama-3_3-Nemotron-Super-49B-v1-exl3-8bpw

Text Generation • 25B • Updated Jun 20, 2025 • 2

QuantFactory/AceReason-Nemotron-1.1-7B-GGUF

Text Generation • 8B • Updated Jun 21, 2025 • 99 • 2