Update README.md
Browse files
README.md
CHANGED
|
@@ -35,12 +35,12 @@ base_model:
|
|
| 35 |
- PartAI/TookaBERT-Base
|
| 36 |
---
|
| 37 |
|
| 38 |
-
#
|
| 39 |
|
| 40 |
|
| 41 |
This model is a Sentence Transformers model trained for semantic textual similarity and embedding tasks. It maps sentences and paragraphs to a dense vector space, where semantically similar texts are close together.
|
| 42 |
|
| 43 |
-
The model is trained in two sizes: **
|
| 44 |
|
| 45 |
## Usage
|
| 46 |
|
|
@@ -57,7 +57,7 @@ Then you can load this model and run inference.
|
|
| 57 |
from sentence_transformers import SentenceTransformer
|
| 58 |
|
| 59 |
# Download from the 🤗 Hub
|
| 60 |
-
model = SentenceTransformer("PartAI/
|
| 61 |
# Run inference
|
| 62 |
sentences = [
|
| 63 |
'درنا از پرندگان مهاجر با پاهای بلند و گردن دراز است.',
|
|
@@ -102,14 +102,13 @@ For *Retrieval* and *Reranking* tasks, we follow the same asymmetric structure,
|
|
| 102 |
- `"متن: "` to documents
|
| 103 |
|
| 104 |
|
| 105 |
-
| Model | #Params | Pair-Classification-Avg | Classification-Avg | Retrieval-Avg | Reranking-Avg |
|
| 106 |
-
|
| 107 |
-
| [
|
| 108 |
-
| [
|
| 109 |
-
| [jina-embeddings-v3](https://huggingface.co/jinaai/jina-embeddings-v3) | 572M | 71.88 | **79.27** | 65.18
|
| 110 |
-
|
|
| 111 |
-
|
|
| 112 |
-
| tooka-sbert-large-v2 | 353M | 80.24 | 74.73 | 59.80 | 73.44 | 72.05 |
|
| 113 |
|
| 114 |
|
| 115 |
### Task-Specific Datasets in PTEB
|
|
|
|
| 35 |
- PartAI/TookaBERT-Base
|
| 36 |
---
|
| 37 |
|
| 38 |
+
# Tooka-SBERT-V2-Small
|
| 39 |
|
| 40 |
|
| 41 |
This model is a Sentence Transformers model trained for semantic textual similarity and embedding tasks. It maps sentences and paragraphs to a dense vector space, where semantically similar texts are close together.
|
| 42 |
|
| 43 |
+
The model is trained in two sizes: **Small** and **Large**
|
| 44 |
|
| 45 |
## Usage
|
| 46 |
|
|
|
|
| 57 |
from sentence_transformers import SentenceTransformer
|
| 58 |
|
| 59 |
# Download from the 🤗 Hub
|
| 60 |
+
model = SentenceTransformer("PartAI/Tooka-SBERT-V2-Small")
|
| 61 |
# Run inference
|
| 62 |
sentences = [
|
| 63 |
'درنا از پرندگان مهاجر با پاهای بلند و گردن دراز است.',
|
|
|
|
| 102 |
- `"متن: "` to documents
|
| 103 |
|
| 104 |
|
| 105 |
+
| Model | #Params | Pair-Classification-Avg | Classification-Avg | Retrieval-Avg | Reranking-Avg | CrossTasks-Avg |
|
| 106 |
+
|--------------------------------------------------------------------------------|:-------:|-------------------------|--------------------|---------------|---------------|----------------|
|
| 107 |
+
| [Tooka-SBERT-V2-Large](https://huggingface.co/PartAI/Tooka-SBERT-V2-Large) | 353M | 80.24 | 74.73 | 59.80 | 73.44 | **72.05** |
|
| 108 |
+
| [Tooka-SBERT-V2-Small](https://huggingface.co/PartAI/Tooka-SBERT-V2-Small) | 123M | 75.69 | 72.16 | 61.24 | 73.40 | 70.62 |
|
| 109 |
+
| [jina-embeddings-v3](https://huggingface.co/jinaai/jina-embeddings-v3) | 572M | 71.88 | **79.27** | **65.18** | 64.62 | 70.24 |
|
| 110 |
+
| [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) | 278M | 70.76 | 69.71 | 63.90 | **76.01** | 70.09 |
|
| 111 |
+
| [Tooka-SBERT-V1-Large](https://huggingface.co/PartAI/Tooka-SBERT) | 353M | **81.52** | 71.54 | 45.61 | 60.44 | 64.78 |
|
|
|
|
| 112 |
|
| 113 |
|
| 114 |
### Task-Specific Datasets in PTEB
|