-
-
-
-
-
-
Inference Providers
Active filters: vLLM
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
• 15B • Updated
• 4
• 2
Image-Text-to-Text
• 17B • Updated
• 680
• 19
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
• 36B • Updated
• 449
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated
• 129
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated
• 69
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated
• 5
• 3
amakhov/tiny-random-llama
Text Generation
• 4.18M • Updated
• 5
Text Generation
• 41B • Updated
• 3
• 2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated
• 487
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated
• 3
• 1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
• 684B • Updated
• 8
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated
• 70
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated
• 1.32k
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated
• 550
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated
• 1.66k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated
• 3
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated
• 217
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated
• 2
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated
• 4
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated
• 2
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated
• 1
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated
• 2
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated
• 2
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
• 236B • Updated
• 5.94k
• 13
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
• Updated
• 105
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
• 236B • Updated
• 1.49k
• 8
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
• 236B • Updated
• 29
QuantTrio/DeepSeek-V3.2-Exp-AWQ
Text Generation
• 486B • Updated
• 110
• 4
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
• 685B • Updated
• 28
• 4
Text Generation
• 50B • Updated
• 8.6k
• 5