GLM-4.7-Flash-FP8 / README.md

Commit History

Update README.md
8921e2e
verified

marksverdhei commited on

Add vLLM fork link for MLA detection support
5d2df64
verified

marksverdhei commited on

Add performance benchmarks (19.4 tok/s on 2x RTX 3090)
60a4777
verified

marksverdhei commited on

Upload folder using huggingface_hub
180a767
verified

marksverdhei commited on

Fix license to MIT
02419e3
verified

marksverdhei commited on

Update README.md
82cc0ee
verified

marksverdhei commited on

Update README.md
adc406d
verified

marksverdhei commited on

Upload FP8 quantized GLM-4.7-Flash
760b1d9
verified

marksverdhei commited on