Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,915

Base only

Active filters: nvidia

mradermacher/NVIDIA-Orchestrator-Cybersecurity-8B-Merged-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 339 • 2

nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated Jan 1 • 400 • • 41

nvidia/Nemotron-Cascade-14B-Thinking

Text Generation • 15B • Updated Jan 1 • 1.43k • • 79

unsloth/Nemotron-3-Nano-30B-A3B

Text Generation • 32B • Updated Mar 26 • 5.24k • 14

nvidia/gpt-oss-120b-Eagle3-throughput

Text Generation • 0.8B • Updated Jan 26 • 1.2k • 35

nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4

Text Generation • Updated Feb 9 • 46.6k • 41

introvoyz041/OpenMath-Nemotron-14B-Kaggle-mlx-4Bit

Text Generation • 15B • Updated Dec 12, 2025 • 2

introvoyz041/OpenMath-Mistral-7B-v0.1-hf-mlx-4Bit

1B • Updated Dec 12, 2025 • 3

introvoyz041/OpenMath2-Llama3.1-8B-mlx-4Bit

Text Generation • 1B • Updated Dec 12, 2025 • 12

introvoyz041/OpenMath-Nemotron-7B-mlx-4Bit

Text Generation • 1B • Updated Dec 12, 2025 • 2

introvoyz041/OpenMath-Nemotron-32B-mlx-4Bit

Text Generation • 33B • Updated Dec 12, 2025 • 4

introvoyz041/OpenMath-Nemotron-14B-mlx-4Bit

Text Generation • 15B • Updated Dec 12, 2025 • 3

unsloth/Nemotron-3-Nano-30B-A3B-Base

Text Generation • 32B • Updated Mar 26 • 476 • 4

unsloth/Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated Mar 26 • 226 • 7

FriendliAI/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Dec 15, 2025 • 90

FriendliAI/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated Dec 15, 2025 • 3

nvidia/Qwen3-235B-A22B-Thinking-2507-FP4-Eagle3

Text Generation • 0.9B • Updated Mar 10 • 51 • 1

mlx-community/NVIDIA-Nemotron-3-Nano-30B-A3B-4bit

Text Generation • Updated Dec 31, 2025 • 806 • 4

ExaltedSlayer/nvidia-nemotron-3-nano-30b-a3b-mlx-mxfp4

Text Generation • 32B • Updated Dec 16, 2025 • 128 • 1

nvidia/Nemotron-Cascade-8B

Text Generation • 8B • Updated Jan 1 • 634 • • 67

bartowski/nvidia_Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated Dec 19, 2025 • 2.1k • 10

bartowski/nvidia_Nemotron-Cascade-8B-GGUF

Text Generation • 8B • Updated Dec 16, 2025 • 1.21k • 4

bartowski/nvidia_Nemotron-Cascade-8B-Thinking-GGUF

Text Generation • 8B • Updated Dec 16, 2025 • 476 • 2

bartowski/nvidia_Nemotron-Cascade-14B-Thinking-GGUF

Text Generation • 15B • Updated Dec 16, 2025 • 1.72k • 8

lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-4bit

Text Generation • 32B • Updated Dec 16, 2025 • 57.6k • 2

lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-5bit

Text Generation • 32B • Updated Dec 16, 2025 • 50.8k

lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-6bit

Text Generation • 32B • Updated Dec 16, 2025 • 50.8k

lmstudio-community/NVIDIA-Nemotron-3-Nano-30B-A3B-MLX-8bit

Text Generation • 32B • Updated Dec 16, 2025 • 51.6k • 3

moxin-org/Nemotron-3-Nano-30B-A3B-GGUF

Text Generation • 32B • Updated Dec 17, 2025 • 25 • 3

mradermacher/Qwen3-Nemotron-235B-A22B-GenRM-GGUF

235B • Updated Dec 20, 2025 • 127