-
-
-
-
-
-
Inference Providers
Active filters:
gptq
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
33B
•
Updated
•
119k
•
598
TheBloke/MythoMax-L2-13B-GPTQ
Text Generation
•
Updated
•
565
•
218
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
19.3k
•
41
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
386
•
6
fbaldassarri/meta-llama_Llama-3.2-11B-Vision-Instruct-OpenVino
Text Generation
•
Updated
•
6
•
1
AgeOfAlgorithms/Llasa-1b-GPTQ-4bit
Text Generation
•
1B
•
Updated
•
10
•
1
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
234k
•
45
Qwen/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
3.52k
•
7
Qwen/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
3.66k
•
8
Qwen/Qwen3-235B-A22B-GPTQ-Int4
Text Generation
•
235B
•
Updated
•
5.58k
•
27
AngelSlim/Qwen3-32B_int4_gptq
33B
•
Updated
•
24.2k
•
1
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
248B
•
Updated
•
451
•
3
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
20
•
3
thomasip/Qwen3-Omni-30B-A3B-Instruct-GPTQ-4bit
35B
•
Updated
•
752
•
2
tencent/HY-MT1.5-7B-GPTQ-Int4
Translation
•
8B
•
Updated
•
916
•
9
krishhx/Hymba-1.5B-Eigen-Hybrid-4bit
Jon-Nielsen/GLM-4.7-REAP-30-W4A16
Text Generation
•
2B
•
Updated
•
145
•
2
baichuan-inc/Baichuan-M3-235B-GPTQ-INT4
Text Generation
•
Updated
•
768
•
10
Ubuku/Qwen2.5-Math-72B-Instruct-GPTQ-Int4-TP2
73B
•
Updated
•
8
•
1
elinas/alpaca-13b-lora-int4
Text Generation
•
Updated
•
14
•
41
elinas/alpaca-30b-lora-int4
Text Generation
•
Updated
•
19
•
68
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
9
•
40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
•
Updated
•
9
•
121
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
•
Updated
•
3
•
2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
•
Updated
•
3
•
2
Text Generation
•
Updated
•
7
•
45
Text Generation
•
7B
•
Updated
•
40
•
31
Text Generation
•
Updated
•
1.25k
•
21
Text Generation
•
Updated
•
1.26k
•
41
Text Generation
•
13B
•
Updated
•
14
•
38