Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,720

Base only

Active filters: 2-bit

garrison/GLM-4.5-Air-Derestricted-mlx-2Bit

Text Generation • 107B • Updated Nov 25, 2025 • 21

garrison/Olmo-3-32B-Think-mlx-2Bit

Text Generation • 32B • Updated Nov 26, 2025 • 9

ncls-p/INTELLECT-3-mlx-2Bit

Text Generation • 107B • Updated Nov 27, 2025 • 4 • 1

MaziyarPanahi/Olmo-3-32B-Think-GGUF

Text Generation • 32B • Updated Nov 28, 2025 • 98

MaziyarPanahi/Olmo-3-7B-Think-GGUF

Text Generation • 7B • Updated Nov 28, 2025 • 53

MaziyarPanahi/Olmo-3-7B-Instruct-GGUF

Text Generation • 7B • Updated Nov 28, 2025 • 30

MaziyarPanahi/gpt-oss-20b-Derestricted-GGUF

Text Generation • 21B • Updated Nov 28, 2025 • 37.9k • 2

MaziyarPanahi/NVIDIA-Nemotron-Nano-9B-v2-GGUF

Text Generation • 9B • Updated Nov 29, 2025 • 207 • 3

introvoyz041/Apriel-1.5-15b-Thinker-2bit-MLX-mlx-4Bit

Image-Text-to-Text • 1B • Updated Nov 30, 2025 • 2

MaziyarPanahi/Ministral-3-3B-Reasoning-2512-GGUF

Text Generation • 3B • Updated Dec 2, 2025 • 76.4k • 4

MaziyarPanahi/Ministral-3-8B-Reasoning-2512-GGUF

Text Generation • 8B • Updated Dec 2, 2025 • 124 • 2

MaziyarPanahi/Ministral-3-14B-Reasoning-2512-GGUF

Text Generation • 14B • Updated Dec 2, 2025 • 38.4k • 1

MaziyarPanahi/Trinity-Nano-Preview-GGUF

Text Generation • 6B • Updated Dec 2, 2025 • 135 • 1

MaziyarPanahi/Trinity-Mini-GGUF

Text Generation • 26B • Updated Dec 16, 2025 • 36.6k • 1

MaziyarPanahi/Nemotron-Orchestrator-8B-GGUF

Text Generation • 8B • Updated Dec 6, 2025 • 36.8k • 6

MaziyarPanahi/Hermes-4.3-36B-GGUF

Text Generation • 36B • Updated Dec 6, 2025 • 317 • 3

apthebest01931/rnj-1-instruct-mlx-2Bit

0.8B • Updated Dec 7, 2025 • 6

jesusoctavioas/Olmo-3-1125-32B-mlx-2Bit

Text Generation • 32B • Updated Dec 8, 2025 • 6

MaziyarPanahi/GLM-4.6V-Flash-GGUF

Text Generation • 9B • Updated Dec 8, 2025 • 39.6k • 6

PKU-DS-LAB/Fairy2i-W2

Text Generation • 7B • Updated Dec 12, 2025 • 96 • • 2

shubhamg2208/tomoro-ai-colqwen3-embed-4b-auto-round-w2a16

1B • Updated Dec 9, 2025 • 2

shubhamg2208/tomoro-ai-colqwen3-embed-4b-auto-round-w2a16g32

1B • Updated Dec 9, 2025 • 1

Matt300209/autoround_test

1B • Updated Dec 9, 2025 • 1

mradermacher/Fairy2i-W2-GGUF

Text Generation • 7B • Updated Dec 10, 2025 • 175

mradermacher/Fairy2i-W2-i1-GGUF

Text Generation • 7B • Updated Dec 10, 2025 • 54

alexgusevski/Ministral-3-3B-Instruct-2512-q2-mlx

Text Generation • 0.3B • Updated Dec 11, 2025 • 6

alexgusevski/Ministral-3-3B-Reasoning-2512-q2-mlx

Text Generation • 0.3B • Updated Dec 11, 2025 • 2

alexgusevski/Ministral-3-8B-Instruct-2512-q2-mlx

Text Generation • 0.8B • Updated Dec 11, 2025 • 9

alexgusevski/Ministral-3-8B-Reasoning-2512-q2-mlx

Text Generation • 0.8B • Updated Dec 11, 2025 • 6

alexgusevski/Lightning-1.7B-q2-mlx

Text Generation • 0.2B • Updated Dec 11, 2025 • 1