Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Intel
/
GLM-5-int4-mixed-AutoRound
like
2
Follow
Intel
3.55k
Safetensors
glm_moe_dsa
text-generation-inference
4-bit precision
auto-round
arxiv:
2309.05516
Model card
Files
Files and versions
xet
Community
2
Deploy
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
vLLM fails to serve Intel/GLM-5-int4-mixed-AutoRound on NVIDIA DGX Spark (GB10, sm121) due to no valid MLA attention backend (qk_nope_head_dim 192)
1
#2 opened 9 days ago by
oliverjohnwilson
This model always predicts some few nonsense sequences
4
#1 opened 13 days ago by
CharlesChen2023