Your Template Misses a Parenthesis
#4
by
qpqpqpqpqpqp
- opened
README.md
CHANGED
|
@@ -112,7 +112,7 @@ As of llama.cpp build [b4282](https://github.com/ggml-org/llama.cpp/releases/tag
|
|
| 112 |
Additionally, if you want to get slightly better quality for , you can use IQ4_NL thanks to [this PR](https://github.com/ggml-org/llama.cpp/pull/10541) which will also repack the weights for ARM, though only the 4_4 for now. The loading time may be slower but it will result in an overall speed incrase.
|
| 113 |
|
| 114 |
<details>
|
| 115 |
-
<summary>Click to view Q4_0_X_X information (deprecated</summary>
|
| 116 |
|
| 117 |
I'm keeping this section to show the potential theoretical uplift in performance from using the Q4_0 with online repacking.
|
| 118 |
|
|
|
|
| 112 |
Additionally, if you want to get slightly better quality for , you can use IQ4_NL thanks to [this PR](https://github.com/ggml-org/llama.cpp/pull/10541) which will also repack the weights for ARM, though only the 4_4 for now. The loading time may be slower but it will result in an overall speed incrase.
|
| 113 |
|
| 114 |
<details>
|
| 115 |
+
<summary>Click to view Q4_0_X_X information (deprecated)</summary>
|
| 116 |
|
| 117 |
I'm keeping this section to show the potential theoretical uplift in performance from using the Q4_0 with online repacking.
|
| 118 |
|