Inference Providers
Active filters: GPTQ
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 128
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 10
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 28.3k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 228
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 49
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 117
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 2.35k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 242
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 811
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 710
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 139
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 460
iqbalamo93/Phi-4-mini-instruct-GPTQ-4bit
Text Generation
• 4B • Updated • 534
iqbalamo93/Phi-4-mini-instruct-GPTQ-8bit
Text Generation
• 4B • Updated • 17
• 2
GusPuffy/Legion-V2.1-LLaMa-70B-GPTQ
Text Generation
• 11B • Updated • 2
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
• 11B • Updated • 6
• 4
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 847
• 13
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
• 721B • Updated • 56
• 2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
• 847B • Updated • 13
• 5
AXERA-TECH/Qwen2.5-0.5B-Instruct-CTX-Int8
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
• 912B • Updated • 38
• 1
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-8bit
Text Generation
• 8B • Updated • 9
kxdw2580/DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5-gptqv2-4bit
Text Generation
• 8B • Updated • 10
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
• 15B • Updated • 14
• 2
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 636
• 12
GusPuffy/BlackSheep-24B-GPTQ
Text Generation
• 4B • Updated • 10
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
• 248B • Updated • 77
• 4
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
• 15B • Updated • 11
• 1
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
• 534B • Updated • 604
• 7
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
• 253B • Updated • 12
• 4