unsloth
/

GLM-4.7-Flash-FP8-Dynamic

Text Generation

compressed-tensors

Model card Files Files and versions

danielhanchen commited on Jan 26

Commit

1174de1

·

verified ·

1 Parent(s): 56ee0da

Update README.md

Files changed (1) hide show

README.md +7 -5

README.md CHANGED Viewed

@@ -10,9 +10,12 @@ library_name: transformers
 license: mit
 pipeline_tag: text-generation
 ---
-> [!NOTE]
->  Includes Unsloth **chat template fixes**! <br> For `llama.cpp`, use `--jinja`
->
 <div>
 <p style="margin-top: 0;margin-bottom: 0;">
@@ -25,13 +28,12 @@ pipeline_tag: text-generation
     <a href="https://discord.gg/unsloth">
       <img src="https://github.com/unslothai/unsloth/raw/main/images/Discord%20button.png" width="173">
     </a>
-    <a href="https://docs.unsloth.ai/">
       <img src="https://raw.githubusercontent.com/unslothai/unsloth/refs/heads/main/images/documentation%20green%20button.png" width="143">
     </a>
   </div>
 </div>
 # GLM-4.7-Flash
 <div align="center">

 license: mit
 pipeline_tag: text-generation
 ---
+# Read our How to [Run FP8 GLM-4.7-Flash Guide!](https://unsloth.ai/docs/models/glm-4.7-flash#glm-4.7-flash-in-vllm)
+## FP8 Dynamically quantized GLM-4.7-Flash
+FP8 Dynamically quantized by Unsloth for fast and premium inference.<br>You can read our [vLLM deployment guide](https://unsloth.ai/docs/models/glm-4.7-flash#glm-4.7-flash-in-vllm).
+---
 <div>
 <p style="margin-top: 0;margin-bottom: 0;">
     <a href="https://discord.gg/unsloth">
       <img src="https://github.com/unslothai/unsloth/raw/main/images/Discord%20button.png" width="173">
     </a>
+    <a href="https://unsloth.ai/docs/models/glm-4.7-flash">
       <img src="https://raw.githubusercontent.com/unslothai/unsloth/refs/heads/main/images/documentation%20green%20button.png" width="143">
     </a>
   </div>
 </div>
 # GLM-4.7-Flash
 <div align="center">