solarkyle
/

GLM-4.7-Flash-GGUF

Text Generation

Mixture of Experts

4-bit precision

Model card Files Files and versions

solarkyle commited on 2 days ago

Commit

302a97b

·

verified ·

1 Parent(s): 78f3074

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ model_type: glm4-moe-lite
 quantized_by: solarkyle
 ---
-# GLM-4.7-Flash-GGUF
 GGUF quantization of [zai-org/GLM-4.7-Flash](https://huggingface.co/zai-org/GLM-4.7-Flash) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp) and compatible inference engines.

 quantized_by: solarkyle
 ---
+# GLM-4.7-Flash-GGUF (16.9GB Q4_K_M)
 GGUF quantization of [zai-org/GLM-4.7-Flash](https://huggingface.co/zai-org/GLM-4.7-Flash) for use with [llama.cpp](https://github.com/ggml-org/llama.cpp) and compatible inference engines.