Update README.md
Browse files
README.md
CHANGED
|
@@ -62,8 +62,8 @@ Set your (Max Tokens)context-lenght 16000t main-LLM-model, set your embedder-mod
|
|
| 62 |
Your document will be embedd in x times 1024t chunks(snippets),<br>
|
| 63 |
You can receive 14-snippets a 1024t (~14000t) from your document ~10000words(10pages) and ~2000t left (from 16000t) for the answer ~1000words (2 pages)
|
| 64 |
<br>
|
| 65 |
-
You can play and set for your needs, eg 8-snippets a 2048t, or 28-snippets a 512t ... (every time you change the chunk-length the document must be embedd again). With these settings everything fits best for
|
| 66 |
-
<ul style="line-height: 1.05;"
|
| 67 |
english vs german differ 50%<br>
|
| 68 |
~5000 character is one page of a book (no matter ger/en) but words in german are longer, that means per word more token<br>
|
| 69 |
the example is english, for german you can add apox 50% more token (1000 words ~1800t)<br>
|
|
|
|
| 62 |
Your document will be embedd in x times 1024t chunks(snippets),<br>
|
| 63 |
You can receive 14-snippets a 1024t (~14000t) from your document ~10000words(10pages) and ~2000t left (from 16000t) for the answer ~1000words (2 pages)
|
| 64 |
<br>
|
| 65 |
+
You can play and set for your needs, eg 8-snippets a 2048t, or 28-snippets a 512t ... (every time you change the chunk-length the document must be embedd again). With these settings everything fits best for ONE answer, if you need more for a conversation, you should set lower and/or disable the document.
|
| 66 |
+
<ul style="line-height: 1.05;">
|
| 67 |
english vs german differ 50%<br>
|
| 68 |
~5000 character is one page of a book (no matter ger/en) but words in german are longer, that means per word more token<br>
|
| 69 |
the example is english, for german you can add apox 50% more token (1000 words ~1800t)<br>
|