Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QuantTrio
/
GLM-4.7-Flash-AWQ
like
3
Follow
QuantTrio
182
Text Generation
Transformers
Safetensors
English
Chinese
glm4_moe_lite
vLLM
AWQ
conversational
4-bit precision
awq
arxiv:
2508.06471
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
GLM-4.7-Flash-AWQ
File size: 48 Bytes
4bc5c5d
1
{
"framework"
:
"Pytorch"
,
"task"
:
"text-generation"
}