<8B instruct models with AWQ, GPTQ quantizations available
-
Qwen/Qwen2.5-7B-Instruct
Text Generation • 8B • Updated • 12.9M • • 1.39k -
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int4
Text Generation • 8B • Updated • 65.2k • 33 -
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation • 8B • Updated • 3.52M • 47 -
meta-llama/Llama-3.2-3B-Instruct
Text Generation • 3B • Updated • 2.15M • • 2.29k