Inference Providers
Active filters: vLLM
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 10
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 27.9k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 253
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 47
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 127
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 4.57k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 232
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 770
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 758
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 131
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 447
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
• 235B • Updated • 3
BeastyZ/Qwen2.5-3B-ConvSearch-R1-TopiOCQA
3B • Updated • 10
QuantTrio/DeepSeek-R1-0528-Qwen3-8B-GPTQ-Int4-Int8Mix
Text Generation
• 11B • Updated • 8
• 4
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
• 721B • Updated • 92
• 2
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
• 847B • Updated • 11
• 5
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
• 912B • Updated • 30
• 1
brandonbeiler/InternVL3-38B-FP8-Dynamic
Image-Text-to-Text
• 38B • Updated • 9
brandonbeiler/InternVL3-78B-FP8-Dynamic
Image-Text-to-Text
• 78B • Updated • 138
brandonbeiler/InternVL3-8B-FP8-Dynamic
Image-Text-to-Text
• 8B • Updated • 24
• 2
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
• 15B • Updated • 13
• 2
dengcao/GLM-4.1V-9B-Thinking-AWQ
Image-Text-to-Text
• 10B • Updated • 236k
• 1
brandonbeiler/Skywork-R1V3-38B-FP8-Dynamic
Image-Text-to-Text
• 38B • Updated • 16
• 2
koushd/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
• 235B • Updated • 1.6k
• 4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
• 248B • Updated • 80
• 4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
• 235B • Updated • 3.78k
• 10
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
• 15B • Updated • 10
• 1
QuantTrio/GLM-4.1V-9B-Thinking-AWQ
Text Generation
• 10B • Updated • 481
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
• 480B • Updated • 1.66k
• 8
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
• 534B • Updated • 183
• 7