Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

7,941

Base only

Active filters: gptq

AngelSlim/Hy3-GPTQ-Int4

Text Generation • 295B • Updated 6 days ago • 432 • 6

palmfuture/Qwen3.6-35B-A3B-GPTQ-Int4

Image-Text-to-Text • 36B • Updated 15 days ago • 129k • 28

canada-quant/GLM-5.2-W4A16-MTP

Text Generation • 116B • Updated 18 days ago • 8.79k • 18

compute1/Agents-A1-GPTQ-INT4-Sym

Text Generation • 7B • Updated 9 days ago • 471 • 2

canada-quant/hy3-w4a16-mtp

Text Generation • 47B • Updated 3 days ago • 534 • 2

Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4

Text Generation • 2B • Updated Aug 21, 2024 • 36.6k • 6

tencent/Hunyuan-A13B-Instruct-GPTQ-Int4

Text Generation • 80B • Updated Jul 11, 2025 • 255 • 53

Qwen/Qwen3.5-122B-A10B-GPTQ-Int4

Image-Text-to-Text • 125B • Updated Apr 24 • 246k • 44

Qwen/Qwen3.5-27B-GPTQ-Int4

Image-Text-to-Text • 28B • Updated Apr 24 • 53.8k • 57

Qwen/Qwen3.5-35B-A3B-GPTQ-Int4

Image-Text-to-Text • 36B • Updated Apr 24 • 706k • 93

canada-quant/DeepSeek-V4-Flash-W4A16-FP8

Text Generation • 44B • Updated May 25 • 807 • 17

LordNeel/DeepSeek-V4-Flash-Acti-MTP-W4A16-FP8

Text Generation • 44B • Updated 11 days ago • 3.89k • 16

canada-quant/DeepSeek-V4-Flash-W4A16-FP8-MTP

Text Generation • 51B • Updated May 30 • 3.67k • 19

TheStageAI/gemma-4-E2B-it

Image-Text-to-Text • Updated Jun 2 • 153 • 5

TheStageAI/gemma-4-E4B-it

Image-Text-to-Text • Updated Jun 2 • 120 • 10

XReyRobert/Ornith-1.0-35B-GPTQ-Pro-FOEM-4bit-g128-ns256

Text Generation • 35B • Updated 18 days ago • 5.98k • 3

elinas/alpaca-13b-lora-int4

Text Generation • Updated Apr 5, 2023 • 19 • 40

elinas/alpaca-30b-lora-int4

Text Generation • Updated Apr 5, 2023 • 16 • 68

mayaeary/pygmalion-6b-4bit-128g

Text Generation • Updated Mar 28, 2023 • 12 • 40

mayaeary/pygmalion-6b_dev-4bit-128g

Text Generation • Updated Mar 28, 2023 • 11 • 121

mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g

Text Generation • Updated Mar 31, 2023 • 6 • 2

mayaeary/PPO_Pygway-6b-Mix-4bit-128g

Text Generation • Updated Mar 31, 2023 • 3 • 2

elinas/vicuna-13b-4bit

Text Generation • Updated Apr 5, 2023 • 11 • 45

TheBloke/koala-7B-GPTQ

Text Generation • 7B • Updated Aug 21, 2023 • 57 • 31

TheBloke/koala-7B-HF

Text Generation • Updated Jun 5, 2023 • 131 • • 20

TheBloke/koala-13B-HF

Text Generation • Updated Jun 5, 2023 • 158 • 40

TheBloke/koala-13B-GPTQ

Text Generation • 13B • Updated Aug 21, 2023 • 48 • 38

TheBloke/galpaca-30B-GPTQ

Text Generation • Updated Aug 21, 2023 • 11 • 48

Ancestral/Dolly_Shygmalion-6b-4bit-128g

Text Generation • Updated Apr 9, 2023 • 6 • 5

Ancestral/PPO_Shygmalion-6b-4bit-128g

Text Generation • Updated Apr 9, 2023 • 3