Update README.md
Browse files
README.md
CHANGED
|
@@ -29,8 +29,9 @@ All training and data preprocessing were performed on **a single GPU (46VRAM)**,
|
|
| 29 |
|
| 30 |
### Model Description
|
| 31 |
- **Model Type:** Sentence Transformer
|
| 32 |
-
- **Base Model:**
|
| 33 |
-
- **Maximum Sequence Length:**
|
|
|
|
| 34 |
- **Output Dimensionality:** 1024 / 512 dimensions
|
| 35 |
- **Similarity Function:** Cosine Similarity
|
| 36 |
- **Languages:** ko, en
|
|
@@ -66,21 +67,20 @@ One group is based on a specific sports regulation PDF, for which synthetic quer
|
|
| 66 |
The final group is a concatenation of all four aforementioned groups, providing a comprehensive mixed set.<br>
|
| 67 |
The following table presents the average retrieval performance across five dataset groups.
|
| 68 |
|
| 69 |
-
|
|
| 70 |
-
|
| 71 |
-
| frony-embed-medium
|
| 72 |
-
|
|
| 73 |
-
| frony-embed-
|
| 74 |
-
|
|
| 75 |
-
| frony-embed-
|
| 76 |
-
| frony-embed-
|
| 77 |
-
|
|
| 78 |
-
|
|
| 79 |
-
|
|
| 80 |
-
|
|
| 81 |
-
|
|
| 82 |
-
| openai-
|
| 83 |
-
**Transformer blocks only*
|
| 84 |
|
| 85 |
## Usage
|
| 86 |
|
|
|
|
| 29 |
|
| 30 |
### Model Description
|
| 31 |
- **Model Type:** Sentence Transformer
|
| 32 |
+
- **Base Model:** Snowflake/snowflake-arctic-embed-l-v2.0
|
| 33 |
+
- **Maximum Sequence Length:** 8192 tokens
|
| 34 |
+
> Important: The base model supports up to 8192 tokens, **but performance is only guaranteed up to 512 tokens**.
|
| 35 |
- **Output Dimensionality:** 1024 / 512 dimensions
|
| 36 |
- **Similarity Function:** Cosine Similarity
|
| 37 |
- **Languages:** ko, en
|
|
|
|
| 67 |
The final group is a concatenation of all four aforementioned groups, providing a comprehensive mixed set.<br>
|
| 68 |
The following table presents the average retrieval performance across five dataset groups.
|
| 69 |
|
| 70 |
+
| Architecture | Open/Closed | Accuracy@1 | Accuracy@3 | Accuracy@5 | Accuracy@10 |
|
| 71 |
+
|--------------------------------------------------------|-----------|-----------|-----------|-----------|------------|
|
| 72 |
+
| FronyAI/frony-embed-medium-arctic-ko-v2.5 | Open | **0.6875** | **0.8557** | 0.9044 | 0.9410 |
|
| 73 |
+
| upstage-large | Closed | 0.6323 | 0.8522 | 0.9068 | 0.9459 |
|
| 74 |
+
| FronyAI/frony-embed-medium-arctic-ko-v2.5 (half dim) | Open | **0.6715** | **0.8446** | 0.8935 | 0.9364 |
|
| 75 |
+
| dragonkue/snowflake-arctic-embed-l-v2.0-ko | Open | **0.6612** | **0.8396** | 0.8931 | 0.9390 |
|
| 76 |
+
| FronyAI/frony-embed-medium-ko-v2 | Open | **0.6806** | **0.8376** | 0.8819 | 0.9207 |
|
| 77 |
+
| FronyAI/frony-embed-medium-ko-v2 (half dim) | Open | 0.6723 | 0.8275 | 0.8712 | 0.9157 |
|
| 78 |
+
| nlpai-lab/KURE-v1 | Open | 0.6434 | 0.8240 | 0.8788 | 0.9285 |
|
| 79 |
+
| BAAI/bge-m3 | Open | 0.5849 | 0.7763 | 0.8420 | 0.8985 |
|
| 80 |
+
| intfloat/multilingual-e5-large | Open | 0.5764 | 0.7630 | 0.8267 | 0.8891 |
|
| 81 |
+
| Snowflake/snowflake-arctic-embed-l-v2.0 | Open | 0.5726 | 0.7591 | 0.8232 | 0.8917 |
|
| 82 |
+
| jinaai/jina-embeddings-v3 | Open | 0.5270 | 0.7242 | 0.7953 | 0.8644 |
|
| 83 |
+
| openai-large | Closed | 0.4903 | 0.6621 | 0.7316 | 0.8149 |
|
|
|
|
| 84 |
|
| 85 |
## Usage
|
| 86 |
|