kalle07
/

embedder_collection

Model card Files Files and versions

kalle07 commited on Mar 12, 2025

Commit

225a69e

·

verified ·

1 Parent(s): 28b99a8

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -68,6 +68,9 @@ If the documents small like 10-20 Pages, its better you copy the whole text insi
 <br>
 ...
 <br>
 <b>Important -> The Systemprompt (an example):</b><br>
 You are a helpful assistant who provides an overview of ... under the aspects of ... .
 You use attached excerpts from the collection to generate your answers!
@@ -77,6 +80,10 @@ Answer the user's question!
 After your answer, briefly explain why you included excerpts (1 to X) in your response and justify briefly if you considered some of them unimportant!<br>
 (change it for your needs, this example works well when I consult a book about a person and a term related to them)
 <br>
 ...
 <br>
 <br>

 <br>
 ...
 <br>
+Nevertheless, the main model is also important! especially to deal with the context length and I don't mean just the theoretical number you can set.
+Some models can handle 128k tokens, but even with 16k input the response with the same snippets as input is worse than with other models.
+<br>
 <b>Important -> The Systemprompt (an example):</b><br>
 You are a helpful assistant who provides an overview of ... under the aspects of ... .
 You use attached excerpts from the collection to generate your answers!
 After your answer, briefly explain why you included excerpts (1 to X) in your response and justify briefly if you considered some of them unimportant!<br>
 (change it for your needs, this example works well when I consult a book about a person and a term related to them)
 <br>
+usual models like (works):
+llama3.1, llama3.2, qwen2.5, deepseek-r1-distill, SauerkrautLM-Nemo(german) ... <br>
+(llama3 or phi3.5 are not working well)
 ...
 <br>
 <br>