Update README.md
Browse files
README.md
CHANGED
|
@@ -60,13 +60,11 @@ You can play and set for your needs, eg 8-snippets a 2048t, or 28-snippets a 512
|
|
| 60 |
<li>32000t (~24000words) ~3GB VRAM usage</li>
|
| 61 |
</ul>
|
| 62 |
<br>
|
| 63 |
-
here is
|
| 64 |
-
|
| 65 |
and a Vram calculator - (you need the original model link NOT the GGUF)<br>
|
| 66 |
-
|
| 67 |
|
| 68 |
-
<br>
|
| 69 |
-
QwQ-LCoT- (7/14b) - https://huggingface.co/mradermacher/QwQ-LCoT-14B-Conversational-GGUF<br>
|
| 70 |
...
|
| 71 |
<br>
|
| 72 |
|
|
|
|
| 60 |
<li>32000t (~24000words) ~3GB VRAM usage</li>
|
| 61 |
</ul>
|
| 62 |
<br>
|
| 63 |
+
here is a tokenizer calculator<br>
|
| 64 |
+
<a href="https://quizgecko.com/tools/token-counter"> <br>
|
| 65 |
and a Vram calculator - (you need the original model link NOT the GGUF)<br>
|
| 66 |
+
https://huggingface.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator<br>
|
| 67 |
|
|
|
|
|
|
|
| 68 |
...
|
| 69 |
<br>
|
| 70 |
|