Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -17,7 +17,7 @@ model_type: glm4-moe-lite
|
|
| 17 |
quantized_by: solarkyle
|
| 18 |
---
|
| 19 |
|
| 20 |
-
# GLM-4.7-Flash-GGUF
|
| 21 |
|
| 22 |
GGUF quantization of [zai-org/GLM-4.7-Flash](https://huggingface.co/zai-org/GLM-4.7-Flash) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp) and compatible inference engines.
|
| 23 |
|
|
|
|
| 17 |
quantized_by: solarkyle
|
| 18 |
---
|
| 19 |
|
| 20 |
+
# GLM-4.7-Flash-GGUF (16.9GB Q4_K_M)
|
| 21 |
|
| 22 |
GGUF quantization of [zai-org/GLM-4.7-Flash](https://huggingface.co/zai-org/GLM-4.7-Flash) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp) and compatible inference engines.
|
| 23 |
|