Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

QuantTrio
/
GLM-4.5V-AWQ

Image-Text-to-Text
Transformers
Safetensors
Chinese
English
glm4v_moe
any-to-any
AWQ
vLLM
conversational
4-bit precision
awq_marlin
Model card Files Files and versions
xet
Community
5
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

max model length is only 64k

#5 opened 16 days ago by
mtcl

RuntimeError: operator _C::marlin_qqq_gemm does not exist

3
#4 opened 4 months ago by
sunnykaibai

Not running ond vllm / transformer

1
#3 opened 4 months ago by
abiteddie

Keep get model type `glm4v_moe` not recognized error

1
#2 opened 4 months ago by
QiliangGoose

model is not performing as good as GLM-4.5-Air-AWQ-FP16Mix

3
#1 opened 4 months ago by
hareram241
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs