Updated W4A16 Link
Browse files
README.md
CHANGED
|
@@ -41,7 +41,7 @@ This repository provides an **AWQ quantized** build of **GLM-4.7-Flash** repacka
|
|
| 41 |
- **W4A16_GS32** — **Weight INT4**, **Activation 16-bit**, **Group Size 32** (highest fidelity among W4A16 variants)
|
| 42 |
- **W8A16_GS32** — **Weight INT8**, **Activation 16-bit**, **Group Size 32** (highest fidelity among W8A16 variants)
|
| 43 |
**Quick link:**
|
| 44 |
-
- https://huggingface.co/TheHouseOfTheDude/GLM-4.7-Flash_AWQ/tree/
|
| 45 |
- https://huggingface.co/TheHouseOfTheDude/GLM-4.7-Flash_AWQ/tree/W8A16_GS32
|
| 46 |
|
| 47 |
---
|
|
|
|
| 41 |
- **W4A16_GS32** — **Weight INT4**, **Activation 16-bit**, **Group Size 32** (highest fidelity among W4A16 variants)
|
| 42 |
- **W8A16_GS32** — **Weight INT8**, **Activation 16-bit**, **Group Size 32** (highest fidelity among W8A16 variants)
|
| 43 |
**Quick link:**
|
| 44 |
+
- https://huggingface.co/TheHouseOfTheDude/GLM-4.7-Flash_AWQ/tree/W4A16_GS32
|
| 45 |
- https://huggingface.co/TheHouseOfTheDude/GLM-4.7-Flash_AWQ/tree/W8A16_GS32
|
| 46 |
|
| 47 |
---
|