kalle07 commited on
Commit
edbeb79
·
verified ·
1 Parent(s): 08c8155

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -88,13 +88,13 @@ Now it searches for keywords or similar semantic terms in the document. if it ha
88
  now a piece of text 1024token around this word “XYZ/ZYX” is cut out at this point. (In reality, it's all done with coded numbers, but dosnt matter - the principle)<br>
89
  This text snippet is then used for your answer. <br>
90
  <ul style="line-height: 1.05;">
91
- <li>If, for example, the word “XYZ” occurs 100 times in one file, not all 100 are found.</li>
92
  <li>If only one snippet corresponds to your question all other snippets can negatively influence your answer because they do not fit the topic (usually 4 to 32 snippet are fine)</li>
93
  <li>If you expect multible search results in your docs try 16-snippets or more, if you expect only 2 than dont use more!</li>
94
- <li>If you use chunk-length ~1024t you receive more content, if you use ~256t you receive more facts BUT lower chunk-length are more chunks and need much longer time.</li>
95
  <li>A question for "summary of the document" is most time not useful, if the document has an introduction or summaries its searching there if you have luck.</li>
96
  <li>If a book has a table of contents or a bibliography, I would delete these pages as they often contain relevant search terms but do not help answer your question.</li>
97
- <li>If the documents small like 10-20 Pages, its better you copy the whole text inside the prompt, some options called "pin".</li>
98
  </ul>
99
  <br>
100
  ...
 
88
  now a piece of text 1024token around this word “XYZ/ZYX” is cut out at this point. (In reality, it's all done with coded numbers, but dosnt matter - the principle)<br>
89
  This text snippet is then used for your answer. <br>
90
  <ul style="line-height: 1.05;">
91
+ <li>If, for example, the word “XYZ” occurs 50 times in one file, not all 50 are used for answer, only the number of snippets with a fast ranking are used</li>
92
  <li>If only one snippet corresponds to your question all other snippets can negatively influence your answer because they do not fit the topic (usually 4 to 32 snippet are fine)</li>
93
  <li>If you expect multible search results in your docs try 16-snippets or more, if you expect only 2 than dont use more!</li>
94
+ <li>If you use chunk-length ~2048(chars) you receive more content, if you use ~512chars you receive more facts BUT lower chunk-length are more chunks and need much longer time.</li>
95
  <li>A question for "summary of the document" is most time not useful, if the document has an introduction or summaries its searching there if you have luck.</li>
96
  <li>If a book has a table of contents or a bibliography, I would delete these pages as they often contain relevant search terms but do not help answer your question.</li>
97
+ <li>If the documents small like 10-20 Pages, its better you copy the whole text inside the CHAT, some options called "pin".</li>
98
  </ul>
99
  <br>
100
  ...