Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

marksverdhei
/

GLM-4.7-Flash-FP8

Text Generation

Mixture of Experts

Model card Files Files and versions

GLM-4.7-Flash-FP8

32.2 GB

Ctrl+K

Ctrl+K

1 contributor

History: 22 commits

marksverdhei's picture

Add vLLM fork link for MLA detection support

5d2df64 verified 4 months ago

.gitattributes

1.57 kB
Upload FP8 quantized GLM-4.7-Flash 4 months ago
README.md

1.83 kB
Add vLLM fork link for MLA detection support 4 months ago
chat_template.jinja

3.12 kB
Upload FP8 quantized GLM-4.7-Flash 4 months ago
config.json

1.25 kB
Upload folder using huggingface_hub 4 months ago
model.safetensors

32.2 GB
xet

Upload folder using huggingface_hub 4 months ago
tokenizer.json

20.2 MB
xet

Upload FP8 quantized GLM-4.7-Flash 4 months ago
tokenizer_config.json

7.23 kB
Upload folder using huggingface_hub 4 months ago