Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -15,6 +15,7 @@ tags:
 - Granite
 - BGE
 - Jina
 - Snowflake
 - Qwen
 - text-embeddings-inference
@@ -43,6 +44,7 @@ to see all files<br>
 # <b>All models tested with ALLM(AnythingLLM) with LM-Studio as server, all models should be work with ollama</b>
 <b> the setup for local documents described below is allmost the same, GPT4All has only one model (nomic), and koboldcpp and JAN(Menlo) is not build in right now but in development</b><br>
 (sometimes the results are more truthful if the “chat with document only” option is used)<br>
 Incidentally, the Embedder model is only one part of a good RAG (Retrieval-Augmented Generation), but it should be tailored to your language and, if you want it to be completely accurate, also to the application, e.g. programming or medicine. <br>

 - Granite
 - BGE
 - Jina
+- gemma
 - Snowflake
 - Qwen
 - text-embeddings-inference
 # <b>All models tested with ALLM(AnythingLLM) with LM-Studio as server, all models should be work with ollama</b>
 <b> the setup for local documents described below is allmost the same, GPT4All has only one model (nomic), and koboldcpp and JAN(Menlo) is not build in right now but in development</b><br>
+I would always use f32 or f16 bit quality!<br>
 (sometimes the results are more truthful if the “chat with document only” option is used)<br>
 Incidentally, the Embedder model is only one part of a good RAG (Retrieval-Augmented Generation), but it should be tailored to your language and, if you want it to be completely accurate, also to the application, e.g. programming or medicine. <br>