cmh commited on
Commit
856ede3
·
verified ·
1 Parent(s): a5ab734

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -17,14 +17,14 @@ pipeline_tag: text-generation
17
 
18
  | | Quant type | File Size | ~Vram*|
19
  | -------- | ---------- | --------- | -------- |
20
- | [phi-4_3bpw](https://huggingface.co/cmh/phi-4_exl3/tree/3bpw) | 3.00 bits per weight | 6.08 GB | **7.0 GB** |
21
- | [phi-4_4bpw](https://huggingface.co/cmh/phi-4_exl3/tree/4bpw) | 4.00 bits per weight | 7.67 GB | **8.6 GB** |
22
- | [phi-4_5bpw](https://huggingface.co/cmh/phi-4_exl3/tree/5bpw) | 5.00 bits per weight | 9.25 GB | **10.1 GB** |
23
- | [phi-4_6bpw](https://huggingface.co/cmh/phi-4_exl3/tree/6bpw) | 6.00 bits per weight | 10.8 GB | **11.9 GB** |
24
  | [phi-4_7bpw](https://huggingface.co/cmh/phi-4_exl3/tree/7bpw) | 7.00 bits per weight | 12.4 GB | tbd |
25
  | [phi-4_8bpw](https://huggingface.co/cmh/phi-4_exl3/tree/8bpw) | 8.00 bits per weight | 14.0 GB | tbd |
26
 
27
- <sub>*approximate value at **4k context**.<sup>
28
 
29
  ---------------------------------------------
30
 
 
17
 
18
  | | Quant type | File Size | ~Vram*|
19
  | -------- | ---------- | --------- | -------- |
20
+ | [phi-4_3bpw](https://huggingface.co/cmh/phi-4_exl3/tree/3bpw) | 3.00 bits per weight | 6.08 GB | **9.4 GB** |
21
+ | [phi-4_4bpw](https://huggingface.co/cmh/phi-4_exl3/tree/4bpw) | 4.00 bits per weight | 7.67 GB | **11.0 GB** |
22
+ | [phi-4_5bpw](https://huggingface.co/cmh/phi-4_exl3/tree/5bpw) | 5.00 bits per weight | 9.25 GB | tbd |
23
+ | [phi-4_6bpw](https://huggingface.co/cmh/phi-4_exl3/tree/6bpw) | 6.00 bits per weight | 10.8 GB | tbd |
24
  | [phi-4_7bpw](https://huggingface.co/cmh/phi-4_exl3/tree/7bpw) | 7.00 bits per weight | 12.4 GB | tbd |
25
  | [phi-4_8bpw](https://huggingface.co/cmh/phi-4_exl3/tree/8bpw) | 8.00 bits per weight | 14.0 GB | tbd |
26
 
27
+ <sub>*approximate value at **16k context**.<sup>
28
 
29
  ---------------------------------------------
30