Update README.md
Browse files
README.md
CHANGED
|
@@ -7,14 +7,14 @@ Available AWQ Quantisations:
|
|
| 7 |
|
| 8 |
| Revision | Model Size | Group Size | w_bit |
|
| 9 |
|--------------|------------|------------|-------|
|
| 10 |
-
| `main` |
|
| 11 |
-
| `gs_2_4bit` |
|
| 12 |
-
| `gs_4_4bit` |
|
| 13 |
-
| `gs_8_4bit` |
|
| 14 |
-
| `gs_16_4bit` |
|
| 15 |
-
| `gs_64_4bit` |
|
| 16 |
-
| `gs_128_4bit`|
|
| 17 |
-
| `gs_512_4bit`|
|
| 18 |
|
| 19 |
**This model was based on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)**
|
| 20 |
|
|
|
|
| 7 |
|
| 8 |
| Revision | Model Size | Group Size | w_bit |
|
| 9 |
|--------------|------------|------------|-------|
|
| 10 |
+
| `main` | ~6.13 GB | 32 | 4 |
|
| 11 |
+
| `gs_2_4bit` | ~x.xx GB | 2 | 4 |
|
| 12 |
+
| `gs_4_4bit` | ~x.xx GB | 4 | 4 |
|
| 13 |
+
| `gs_8_4bit` | ~x.xx GB | 8 | 4 |
|
| 14 |
+
| [`gs_16_4bit`](https://huggingface.co/hivata/functionary-small-v3.2-AWQ/tree/gs_16-4bit) | ~6.68 GB | 16 | 4 |
|
| 15 |
+
| `gs_64_4bit` | ~x.xx GB | 64 | 4 |
|
| 16 |
+
| `gs_128_4bit`| ~x.xx GB | 128 | 4 |
|
| 17 |
+
| `gs_512_4bit`| ~x.xx GB | 512 | 4 |
|
| 18 |
|
| 19 |
**This model was based on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct)**
|
| 20 |
|