Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,915

Base only

Active filters: nvidia

Aashraf995/Nemotron-Nano-9B

Text Generation • 9B • Updated Nov 3, 2025 • 10

Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K

Text Generation • 17B • Updated Nov 4, 2025 • 178 • 1

jesusoctavioas/Llama-3_3-Nemotron-Super-49B-v1_5-mlx-4Bit

Text Generation • 50B • Updated Nov 8, 2025 • 39

jenerallee78/parakeet-tdt-1.1b-onnx

Automatic Speech Recognition • Updated Nov 10, 2025

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4

Text Generation • 26B • Updated Nov 27, 2025 • 7.14k • 20

AIExxplorer/AIEXX_GENAI_IMAGE_TO_3D

Image-to-3D • Updated Nov 11, 2025 • 3

argmaxinc/parakeetkit-litert-pro

Automatic Speech Recognition • Updated 29 days ago • 35.2k

nvidia/NVIDIA-Nemotron-Parse-v1.1

Image-Text-to-Text • 1.0B • Updated May 7 • 611k • 171

nvidia/NVIDIA-Nemotron-Parse-v1.1-TC

Image-Text-to-Text • 1.0B • Updated Apr 15 • 510 • 13

HeshamHaroon/fintuned_PII

Token Classification • Updated Nov 19, 2025

nvidia/DeepSeek-V3.1-NVFP4

Text Generation • 394B • Updated Jan 13 • 10.5k • 15

AXONVERTEX-AI-RESEARCH/NVIDIA-Nemotron-Nano-9B-v2-Q8_0-GGUF

Text Generation • 9B • Updated Nov 26, 2025 • 104

introvoyz041/nvidia-llama-3.1-nemotron-nano-8b-v1-mlx-4bit-mlx-4Bit

Text Generation • 1B • Updated Nov 28, 2025 • 26

introvoyz041/NVIDIA-Nemotron-Nano-9B-v2-4bits-mlx-4Bit

Text Generation • 1B • Updated Nov 28, 2025 • 18

introvoyz041/Llama-3.1-Nemotron-Nano-4B-v1.1-4bit-mlx-4Bit

Text Generation • 0.7B • Updated Nov 30, 2025 • 4

wangjazz/parakeet-tdt-ja-coreml

Automatic Speech Recognition • Updated Dec 1, 2025 • 1

introvoyz041/AceReason-Nemotron-7B-8bit-mlx-8Bit

Text Generation • 8B • Updated Dec 2, 2025 • 3

introvoyz041/AceReason-Nemotron-14B-4bit-mlx-4Bit

Text Generation • 15B • Updated Dec 2, 2025 • 4

nvidia/Kimi-K2-Thinking-NVFP4

Text Generation • Updated Feb 10 • 29.4k • 29

introvoyz041/AceReason-Nemotron-1.1-7B-bf16-mlx-4Bit

Text Generation • 1B • Updated Dec 2, 2025 • 8

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16

Text Generation • 32B • Updated Mar 15 • 86.4k • 128

nvidia/KVzap-linear-Qwen3-8B

Other • 1.18M • Updated Jan 21 • 21 • 2

nvidia/KVzap-mlp-Qwen3-8B

Other • 75.7M • Updated Jan 21 • 674k • 4

nvidia/KVzap-mlp-Qwen3-32B

Other • 0.2B • Updated Jan 21 • 11 • 6

nvidia/KVzap-linear-Qwen3-32B

Other • 2.62M • Updated Jan 21 • 6 • 4

nvidia/KVzap-linear-Llama-3.1-8B-Instruct

Other • 1.05M • Updated Jan 21 • 12 • 1

nvidia/KVzap-mlp-Llama-3.1-8B-Instruct

Other • 67.3M • Updated Jan 21 • 141k • 4

nvidia/Qwen3-Nemotron-235B-A22B-GenRM

Text Generation • 235B • Updated Dec 15, 2025 • 90 • 31

sainikhiljuluri2015/NVIDIA-Orchestrator-Cybersecurity-8B-Merged

Text Generation • 8B • Updated Dec 5, 2025 • 9 • 1

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated Mar 15 • 350k • 353