Inference Providers
Active filters: awq
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 39.5k
• 21
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 241k
• 24
Image-Text-to-Text
• 10B • Updated • 144k
• 7
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 15.2k
• 14
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 1.1M
• 19
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
• Updated • 677k
• 39
mratsim/MiniMax-M2.5-BF16-INT4-AWQ
Text Generation
• 39B • Updated • 33.6k
• 36
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated • 10.3k
• 16
TheBloke/Llama-2-70B-Chat-AWQ
Text Generation
• 69B • Updated • 2.2k
• 24
Text Generation
• 34B • Updated • 38
• 14
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
• Updated • 195k
• 89
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
• Updated • 105k
• 108
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
• 15B • Updated • 1.7M
• 28
casperhansen/deepseek-r1-distill-llama-70b-awq
Updated • 69.7k
• 15
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
• 24B • Updated • 442k
• 27
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
• 6B • Updated • 22.4k
• 39
Orion-zhen/Qwen3-1.7B-AWQ
2B • Updated • 962
• 2
Text Generation
• 15B • Updated • 328k
• 60
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 418k
• 41
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
• 33B • Updated • 101k
• 12
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 90.7k
• 11
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4
Text Generation
• 33B • Updated • 929
• 2
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 129k
• 11
HugJerry99/SKT-AX-4.0-Light-AWQ
7B • Updated • 373
• 1
Brooooooklyn/qwen3.5-9B-unsloth-mlx
Text Generation
• 4B • Updated • 773
• 1
Brooooooklyn/qwen3.5-35B-A3B-unsloth-mlx
Text Generation
• 6B • Updated • 178
• 1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
• Updated • 10
• 3
casperhansen/falcon-7b-awq
Text Generation
• Updated • 6
• 1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
• Updated • 10
• 3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
• Updated • 9
• 1