Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QuantTrio
/
GLM-4.7-Flash-AWQ
like
2
Follow
QuantTrio
175
Text Generation
Transformers
Safetensors
English
Chinese
glm4_moe_lite
vLLM
AWQ
conversational
4-bit precision
awq
arxiv:
2508.06471
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
GLM-4.7-Flash-AWQ
/
tokenizer.json
Commit History
Add files using upload-large-folder tool
4bc5c5d
verified
JunHowie
commited on
8 days ago