Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
QuantTrio
/
GLM-4.6-GPTQ-Int4-Int8Mix
like
4
Follow
QuantTrio
160
Text Generation
Transformers
Safetensors
glm4_moe
GPTQ
vLLM
conversational
4-bit precision
gptq_marlin
arxiv:
2508.06471
License:
mit
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
c37a163
GLM-4.6-GPTQ-Int4-Int8Mix
8.98 GB
2 contributors
History:
4 commits
JunHowie
Upload model-00005-of-00083.safetensors with huggingface_hub
c37a163
verified
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
README.md
31 Bytes
initial commit
3 months ago
model-00001-of-00083.safetensors
3 GB
xet
Upload model-00001-of-00083.safetensors with huggingface_hub
3 months ago
model-00003-of-00083.safetensors
3 GB
xet
Upload model-00003-of-00083.safetensors with huggingface_hub
3 months ago
model-00005-of-00083.safetensors
2.99 GB
xet
Upload model-00005-of-00083.safetensors with huggingface_hub
3 months ago