Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

2,563

Base only

Active filters: 2-bit

DanyDA/Qwen3-32B-exl3-2.0bpw

Text Generation • 5B • Updated May 2, 2025 • 2

Siddharth63/Qwen3-8B-Base-2bits-AutoRound-GPTQ-sym

8B • Updated May 2, 2025 • 10

Siddharth63/Qwen3-8B-Base-2bits-AutoRound-sym

2B • Updated May 2, 2025

MaziyarPanahi/INTELLECT-2-GGUF

Text Generation • 33B • Updated May 12, 2025 • 156k • 3

kaitchup/Qwen3-14B-Base-autoround-2bit-gptq

15B • Updated May 13, 2025 • 4

kaitchup/Qwen3-32B-autoround-2bit-128g-gptq

33B • Updated May 13, 2025 • 6

DanyDA/Kevin-32B-exl3-2.0bpw

5B • Updated May 15, 2025 • 1

BitDistiller/Llama-3.1-8B-Instruct-w2g64-gptq

8B • Updated May 20, 2025 • 8

kaitchup/Qwen3-30B-A3B-autoround-2bit-gptq

31B • Updated May 20, 2025 • 1.65k

DanyDA/AM-Thinking-v1-exl3-2.0bpw

Text Generation • 5B • Updated May 21, 2025 • 1

BitDistiller/Qwen-8B-w2g64-gptq

8B • Updated May 21, 2025 • 63 • 1

Erland/softpick-1.8B-4096-model-GPTQ-2bit

Text Generation • 2B • Updated May 27, 2025 • 1

Erland/vanilla-1.8B-4096-model-GPTQ-2bit

Text Generation • 2B • Updated May 27, 2025 • 1

tvpavan/sarvam-m-mlx-2Bit

Text Generation • 2B • Updated May 24, 2025 • 6

Fang77777/Llama-3.2-3B-Instruct-2bit-exllamav2

Text Generation • Updated May 29, 2025

steampunque/Mistral-Small-3.1-24B-Instruct-2503-MP-GGUF

24B • Updated Feb 18 • 36 • 1

MaziyarPanahi/DeepSeek-R1-0528-Qwen3-8B-GGUF

Text Generation • 8B • Updated May 29, 2025 • 161k • 10

nhe-ai/Muse-12B-mlx-2Bit

Text Generation • 1B • Updated Jun 3, 2025 • 9

mlx-community-staging/gemma-3-1b-it-mlx-4Bit-dynamic

Text Generation • Updated Jun 4, 2025 • 11

mlx-community-staging/gemma-3-1b-it-mlx-6Bit-dynamic

Text Generation • Updated Jun 4, 2025 • 12

MetaphoricalCode/gemma3-27b-abliterated-dpo-exl3-2bpw-hb6

Image-Text-to-Text • 6B • Updated Jun 6, 2025

PepitaxX/qwen3-0.6b-gptq_2bit

Text Generation • 0.6B • Updated Jun 6, 2025 • 1

PepitaxX/qwen3-0.6B-openQA_prefinetune_deepseek210k_2bit

Text Generation • 0.6B • Updated Jun 6, 2025 • 1

TheS3b/Qwen3-EfficientQAT-w2g64

0.6B • Updated Jun 8, 2025 • 1

irish-quant/01-ai-Yi-1.5-6B-Chat-2bit

6B • Updated Jun 10, 2025

irish-quant/01-ai-Yi-1.5-6B-2bit

6B • Updated Jun 10, 2025

irish-quant/01-ai-Yi-1.5-9B-Chat-2bit

9B • Updated Jun 10, 2025 • 1

irish-quant/01-ai-Yi-1.5-9B-2bit

9B • Updated Jun 10, 2025 • 1

irish-quant/HuggingFaceTB-SmolLM-1.7B-Instruct-2bit

2B • Updated Jun 10, 2025 • 2

irish-quant/HuggingFaceTB-SmolLM-1.7B-2bit

2B • Updated Jun 10, 2025 • 3