Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
marksverdhei
/
GLM-4.7-Flash-FP8
like
10
Text Generation
Transformers
Safetensors
glm4_moe_lite
fp8
quantized
glm4
Mixture of Experts
conversational
License:
mit
Model card
Files
Files and versions
xet
Community
4
Deploy
Use this model
main
GLM-4.7-Flash-FP8
32.2 GB
1 contributor
History:
23 commits
marksverdhei
Update README.md
8921e2e
verified
about 7 hours ago
.gitattributes
1.57 kB
Upload FP8 quantized GLM-4.7-Flash
about 23 hours ago
README.md
2.08 kB
Update README.md
about 7 hours ago
chat_template.jinja
3.12 kB
Upload FP8 quantized GLM-4.7-Flash
about 23 hours ago
config.json
1.25 kB
Upload folder using huggingface_hub
about 20 hours ago
model.safetensors
32.2 GB
xet
Upload folder using huggingface_hub
about 20 hours ago
tokenizer.json
20.2 MB
xet
Upload FP8 quantized GLM-4.7-Flash
about 23 hours ago
tokenizer_config.json
7.23 kB
Upload folder using huggingface_hub
about 20 hours ago