Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ full CPU expert offload.
|
|
| 62 |
Rough performance metrics on a 9900k (128G RAM) and 4070 (12G VRAM)
|
| 63 |
|
| 64 |
CPU EXP OFFLOAD | QKV | Context size| gen rate | ot config
|
| 65 |
-
------------|------|-------------|-----------|------------
|
| 66 |
all | F16 | 256K | 20.1 | OT="-ot exps=CPU -ngl 99"
|
| 67 |
4-48 | F16 | 256K | 8 | OT="-ot blk\\.[4-9]\|1[0-9]\|2[0-9]\|3[0-9]\|4[0-7].*exps=CPU -ngl 99"
|
| 68 |
7-48 | F16 | 128K | 10 | OT="-ot blk\\.[7-9]\|1[0-9]\|2[0-9]\|3[0-9]\|4[0-7].*exps=CPU -ngl 99"
|
|
|
|
| 62 |
Rough performance metrics on a 9900k (128G RAM) and 4070 (12G VRAM)
|
| 63 |
|
| 64 |
CPU EXP OFFLOAD | QKV | Context size| gen rate | ot config
|
| 65 |
+
----------------|------|-------------|-----------|------------
|
| 66 |
all | F16 | 256K | 20.1 | OT="-ot exps=CPU -ngl 99"
|
| 67 |
4-48 | F16 | 256K | 8 | OT="-ot blk\\.[4-9]\|1[0-9]\|2[0-9]\|3[0-9]\|4[0-7].*exps=CPU -ngl 99"
|
| 68 |
7-48 | F16 | 128K | 10 | OT="-ot blk\\.[7-9]\|1[0-9]\|2[0-9]\|3[0-9]\|4[0-7].*exps=CPU -ngl 99"
|