Update README.md
Browse files
README.md
CHANGED
|
@@ -17,14 +17,14 @@ pipeline_tag: text-generation
|
|
| 17 |
|
| 18 |
| | Quant type | File Size | ~Vram*|
|
| 19 |
| -------- | ---------- | --------- | -------- |
|
| 20 |
-
| [phi-4_3bpw](https://huggingface.co/cmh/phi-4_exl3/tree/3bpw) | 3.00 bits per weight | 6.08 GB | **
|
| 21 |
-
| [phi-4_4bpw](https://huggingface.co/cmh/phi-4_exl3/tree/4bpw) | 4.00 bits per weight | 7.67 GB | **
|
| 22 |
-
| [phi-4_5bpw](https://huggingface.co/cmh/phi-4_exl3/tree/5bpw) | 5.00 bits per weight | 9.25 GB |
|
| 23 |
-
| [phi-4_6bpw](https://huggingface.co/cmh/phi-4_exl3/tree/6bpw) | 6.00 bits per weight | 10.8 GB |
|
| 24 |
| [phi-4_7bpw](https://huggingface.co/cmh/phi-4_exl3/tree/7bpw) | 7.00 bits per weight | 12.4 GB | tbd |
|
| 25 |
| [phi-4_8bpw](https://huggingface.co/cmh/phi-4_exl3/tree/8bpw) | 8.00 bits per weight | 14.0 GB | tbd |
|
| 26 |
|
| 27 |
-
<sub>*approximate value at **
|
| 28 |
|
| 29 |
---------------------------------------------
|
| 30 |
|
|
|
|
| 17 |
|
| 18 |
| | Quant type | File Size | ~Vram*|
|
| 19 |
| -------- | ---------- | --------- | -------- |
|
| 20 |
+
| [phi-4_3bpw](https://huggingface.co/cmh/phi-4_exl3/tree/3bpw) | 3.00 bits per weight | 6.08 GB | **9.4 GB** |
|
| 21 |
+
| [phi-4_4bpw](https://huggingface.co/cmh/phi-4_exl3/tree/4bpw) | 4.00 bits per weight | 7.67 GB | **11.0 GB** |
|
| 22 |
+
| [phi-4_5bpw](https://huggingface.co/cmh/phi-4_exl3/tree/5bpw) | 5.00 bits per weight | 9.25 GB | tbd |
|
| 23 |
+
| [phi-4_6bpw](https://huggingface.co/cmh/phi-4_exl3/tree/6bpw) | 6.00 bits per weight | 10.8 GB | tbd |
|
| 24 |
| [phi-4_7bpw](https://huggingface.co/cmh/phi-4_exl3/tree/7bpw) | 7.00 bits per weight | 12.4 GB | tbd |
|
| 25 |
| [phi-4_8bpw](https://huggingface.co/cmh/phi-4_exl3/tree/8bpw) | 8.00 bits per weight | 14.0 GB | tbd |
|
| 26 |
|
| 27 |
+
<sub>*approximate value at **16k context**.<sup>
|
| 28 |
|
| 29 |
---------------------------------------------
|
| 30 |
|