Inference Providers
Active filters: awq
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated • 39.5k
• 21
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated • 241k
• 24
Image-Text-to-Text
• 10B • Updated • 144k
• 7
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 15.3k
• 14
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 1M
• 19
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
• Updated • 690k
• 39
mratsim/MiniMax-M2.5-BF16-INT4-AWQ
Text Generation
• 39B • Updated • 19.9k
• 36
TheBloke/Llama-2-70B-Chat-AWQ
Text Generation
• 69B • Updated • 2.13k
• 24
Text Generation
• 34B • Updated • 39
• 14
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
• Updated • 186k
• 89
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
• 15B • Updated • 1.73M
• 28
casperhansen/deepseek-r1-distill-llama-70b-awq
Updated • 64.2k
• 15
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
• 24B • Updated • 453k
• 27
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
• 6B • Updated • 22.7k
• 39
Orion-zhen/Qwen3-1.7B-AWQ
2B • Updated • 955
• 2
Text Generation
• 15B • Updated • 330k
• 60
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated • 90.7k
• 11
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated • 10.3k
• 16
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4
Text Generation
• 33B • Updated • 929
• 2
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated • 129k
• 11
HugJerry99/SKT-AX-4.0-Light-AWQ
7B • Updated • 373
• 1
Brooooooklyn/qwen3.5-9B-unsloth-mlx
Text Generation
• 4B • Updated • 773
• 1
Brooooooklyn/qwen3.5-35B-A3B-unsloth-mlx
Text Generation
• 6B • Updated • 178
• 1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
• Updated • 11
• 3
casperhansen/falcon-7b-awq
Text Generation
• Updated • 8
• 1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
• Updated • 10
• 3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
• Updated • 10
• 1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
• Updated • 9
casperhansen/opt-125m-awq
Text Generation
• 0.2B • Updated • 102
• 3
casperhansen/tinyllama-1b-awq
Text Generation
• Updated • 12
• 1