Update README.md
Browse files
README.md
CHANGED
|
@@ -88,13 +88,13 @@ Now it searches for keywords or similar semantic terms in the document. if it ha
|
|
| 88 |
now a piece of text 1024token around this word “XYZ/ZYX” is cut out at this point. (In reality, it's all done with coded numbers, but dosnt matter - the principle)<br>
|
| 89 |
This text snippet is then used for your answer. <br>
|
| 90 |
<ul style="line-height: 1.05;">
|
| 91 |
-
<li>If, for example, the word “XYZ” occurs
|
| 92 |
<li>If only one snippet corresponds to your question all other snippets can negatively influence your answer because they do not fit the topic (usually 4 to 32 snippet are fine)</li>
|
| 93 |
<li>If you expect multible search results in your docs try 16-snippets or more, if you expect only 2 than dont use more!</li>
|
| 94 |
-
<li>If you use chunk-length ~
|
| 95 |
<li>A question for "summary of the document" is most time not useful, if the document has an introduction or summaries its searching there if you have luck.</li>
|
| 96 |
<li>If a book has a table of contents or a bibliography, I would delete these pages as they often contain relevant search terms but do not help answer your question.</li>
|
| 97 |
-
<li>If the documents small like 10-20 Pages, its better you copy the whole text inside the
|
| 98 |
</ul>
|
| 99 |
<br>
|
| 100 |
...
|
|
|
|
| 88 |
now a piece of text 1024token around this word “XYZ/ZYX” is cut out at this point. (In reality, it's all done with coded numbers, but dosnt matter - the principle)<br>
|
| 89 |
This text snippet is then used for your answer. <br>
|
| 90 |
<ul style="line-height: 1.05;">
|
| 91 |
+
<li>If, for example, the word “XYZ” occurs 50 times in one file, not all 50 are used for answer, only the number of snippets with a fast ranking are used</li>
|
| 92 |
<li>If only one snippet corresponds to your question all other snippets can negatively influence your answer because they do not fit the topic (usually 4 to 32 snippet are fine)</li>
|
| 93 |
<li>If you expect multible search results in your docs try 16-snippets or more, if you expect only 2 than dont use more!</li>
|
| 94 |
+
<li>If you use chunk-length ~2048(chars) you receive more content, if you use ~512chars you receive more facts BUT lower chunk-length are more chunks and need much longer time.</li>
|
| 95 |
<li>A question for "summary of the document" is most time not useful, if the document has an introduction or summaries its searching there if you have luck.</li>
|
| 96 |
<li>If a book has a table of contents or a bibliography, I would delete these pages as they often contain relevant search terms but do not help answer your question.</li>
|
| 97 |
+
<li>If the documents small like 10-20 Pages, its better you copy the whole text inside the CHAT, some options called "pin".</li>
|
| 98 |
</ul>
|
| 99 |
<br>
|
| 100 |
...
|