aari1995
/

German_Semantic_V3b

Sentence Similarity

sentence-transformers

feature-extraction

loss:MatryoshkaLoss

text-embeddings-inference

Model card Files Files and versions

aari1995 commited on Jun 25, 2024

Commit

e58f964

·

verified ·

1 Parent(s): a15771c

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ The successors of [German_Semantic_STS_V2](https://huggingface.co/aari1995/Germa
 **Note:** To run this model properly, see "Usage".
-## Major updates and USPs:
 - **Flexibility:** Trained with flexible sequence-length and embedding truncation, flexibility is a core feature of the model. Yet, smaller dimensions bring a minor trade-off in quality.
 - **Sequence length:** Embed up to 8192 tokens (16 times more than V2 and other models)
@@ -42,7 +42,7 @@ The successors of [German_Semantic_STS_V2](https://huggingface.co/aari1995/Germa
 - **License:** Apache 2.0
-## Usage:
 This model has some build-in functionality that is rather hidden. To profit from it, use this code:
@@ -74,7 +74,7 @@ similarities = model.similarity(embeddings, embeddings)
 ```
-### Full Model Architecture
 ```
 SentenceTransformer(
@@ -84,7 +84,7 @@ SentenceTransformer(
 ```
-## FAQ
 **Q: Is this Model better than V2?**
@@ -111,17 +111,17 @@ Another noticable difference is that V3 has a broader cosine_similarity spectrum
 **A:** Broadly speaking, when going from 1024 to 512 dimensions, there is very little trade-off (1 percent). When going down to 64 dimensions, you may face a decrease of up to 3 percent.
-## Evaluation
 Storage comparison:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/5f3801ab7e583543386217ac/Aa5WzHanj-DXc86AKxpEz.png)
 Benchmarks: soon.
-## Up next:
-German_Semantic_V3_Instruct: Guiding your embeddings towards self-selected aspects
-## Thank You and Credits
 - To [jinaAI](https://huggingface.co/jinaai) for their BERT implementation that is used, especially ALiBi
 - To [deepset](https://huggingface.co/deepset) for the gbert-large, which is a really great model

 **Note:** To run this model properly, see "Usage".
+# Major updates and USPs:
 - **Flexibility:** Trained with flexible sequence-length and embedding truncation, flexibility is a core feature of the model. Yet, smaller dimensions bring a minor trade-off in quality.
 - **Sequence length:** Embed up to 8192 tokens (16 times more than V2 and other models)
 - **License:** Apache 2.0
+# Usage:
 This model has some build-in functionality that is rather hidden. To profit from it, use this code:
 ```
+## Full Model Architecture
 ```
 SentenceTransformer(
 ```
+# FAQ
 **Q: Is this Model better than V2?**
 **A:** Broadly speaking, when going from 1024 to 512 dimensions, there is very little trade-off (1 percent). When going down to 64 dimensions, you may face a decrease of up to 3 percent.
+# Evaluation
 Storage comparison:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/5f3801ab7e583543386217ac/Aa5WzHanj-DXc86AKxpEz.png)
 Benchmarks: soon.
+# Up next:
+German_Semantic_V3_Instruct: Guiding your embeddings towards self-selected aspects. - planned: 2024.
+# Thank You and Credits
 - To [jinaAI](https://huggingface.co/jinaai) for their BERT implementation that is used, especially ALiBi
 - To [deepset](https://huggingface.co/deepset) for the gbert-large, which is a really great model