Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

QuantTrio
/
Qwen3.5-27B-AWQ

Image-Text-to-Text
Transformers
Safetensors
qwen3_5
vLLM
AWQ
conversational
4-bit precision
awq
Model card Files Files and versions
xet
Community
6
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Request for AWQ Quant of Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-i1

#6 opened 1 day ago by
0xburakcelik

AWQ 4-bit version of this Opus-Distilled-v2 model?

9
#5 opened 18 days ago by
0xburakcelik

More info about data-free quantization

๐Ÿ‘ 1
#4 opened about 1 month ago by
naripok

--max-model-len 32768 seems a bit too small for agent use cases ?

3
#3 opened about 1 month ago by
edwarddukewu

This is the best quant version in the world,better than FP8

๐Ÿš€ 5
3
#2 opened about 1 month ago by
kq

My personal vLLM launch cmd on my old personal 2x3090 workstation

7
#1 opened about 2 months ago by
tclf90
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs