Update README.md
Browse files
README.md
CHANGED
|
@@ -57,9 +57,9 @@ Our model, **telepix/PIXIE-Rune-v1.0**, achieves state-of-the-art performance ac
|
|
| 57 |
| Snowflake/snowflake-arctic-embed-l-v2.0 | 568M | 0.6592 | 0.6118 | 0.6542 | 0.6759 | 0.6949 |
|
| 58 |
| BAAI/bge-m3 | 568M | 0.6573 | 0.6099 | 0.6533 | 0.6732 | 0.6930 |
|
| 59 |
| Qwen/Qwen3-Embedding-0.6B | 595M | 0.6321 | 0.5894 | 0.6274 | 0.6455 | 0.6662 |
|
| 60 |
-
|
|
|
|
|
| 61 |
| openai/text-embedding-3-large | N/A | 0.6015 | 0.5466 | 0.5999 | 0.6187 | 0.6409 |
|
| 62 |
-
| Salesforce/SFR-Embedding-2_R | 7111M | 0.5979 | 0.5451 | 0.5959 | 0.6158 | 0.6348 |
|
| 63 |
|
| 64 |
Descriptions of the benchmark datasets used for evaluation are as follows:
|
| 65 |
- **Ko-StrategyQA**
|
|
@@ -82,9 +82,10 @@ Our model, **telepix/PIXIE-Rune-v1.0**, achieves strong performance on a wide ra
|
|
| 82 |
|
| 83 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|
| 84 |
|------|:---:|:---:|:---:|:---:|:---:|:---:|
|
| 85 |
-
| **telepix/PIXIE-Rune-v1.0** | 568M | **
|
| 86 |
| Snowflake/snowflake-arctic-embed-l-v2.0 | 568M | 0.5812 | 0.5725 | 0.5705 | 0.5811 | 0.6006 |
|
| 87 |
| Qwen/Qwen3-Embedding-0.6B | 595M | 0.5558 | 0.5321 | 0.5451 | 0.5620 | 0.5839 |
|
|
|
|
| 88 |
| BAAI/bge-m3 | 568M | 0.5318 | 0.5078 | 0.5231 | 0.5389 | 0.5573 |
|
| 89 |
| dragonekue/BGE-m3-ko | 568M | 0.5307 | 0.5125 | 0.5174 | 0.5362 | 0.5566 |
|
| 90 |
| nlpai-lab/KURE-v1 | 568M | 0.5272 | 0.5017 | 0.5171 | 0.5353 | 0.5548 |
|
|
|
|
| 57 |
| Snowflake/snowflake-arctic-embed-l-v2.0 | 568M | 0.6592 | 0.6118 | 0.6542 | 0.6759 | 0.6949 |
|
| 58 |
| BAAI/bge-m3 | 568M | 0.6573 | 0.6099 | 0.6533 | 0.6732 | 0.6930 |
|
| 59 |
| Qwen/Qwen3-Embedding-0.6B | 595M | 0.6321 | 0.5894 | 0.6274 | 0.6455 | 0.6662 |
|
| 60 |
+
| jinaai/jina-embeddings-v3 | 572M | 0.6293 | 0.5800 | 0.6254 | 0.6456 | 0.6665 |
|
| 61 |
+
| Alibaba-NLP/gte-multilingual-base | 305M | 0.6111 | 0.5542 | 0.6089 | 0.6302 | 0.6511 |
|
| 62 |
| openai/text-embedding-3-large | N/A | 0.6015 | 0.5466 | 0.5999 | 0.6187 | 0.6409 |
|
|
|
|
| 63 |
|
| 64 |
Descriptions of the benchmark datasets used for evaluation are as follows:
|
| 65 |
- **Ko-StrategyQA**
|
|
|
|
| 82 |
|
| 83 |
| Model Name | # params | Avg. NDCG | NDCG@1 | NDCG@3 | NDCG@5 | NDCG@10 |
|
| 84 |
|------|:---:|:---:|:---:|:---:|:---:|:---:|
|
| 85 |
+
| **telepix/PIXIE-Rune-v1.0** | 568M | **0.5781** | **0.5691** | **0.5663** | **0.5791** | **0.5979** |
|
| 86 |
| Snowflake/snowflake-arctic-embed-l-v2.0 | 568M | 0.5812 | 0.5725 | 0.5705 | 0.5811 | 0.6006 |
|
| 87 |
| Qwen/Qwen3-Embedding-0.6B | 595M | 0.5558 | 0.5321 | 0.5451 | 0.5620 | 0.5839 |
|
| 88 |
+
| Alibaba-NLP/gte-multilingual-base | 305M | 0.5541 | 0.5446 | 0.5426 | 0.5574 | 0.5746 |
|
| 89 |
| BAAI/bge-m3 | 568M | 0.5318 | 0.5078 | 0.5231 | 0.5389 | 0.5573 |
|
| 90 |
| dragonekue/BGE-m3-ko | 568M | 0.5307 | 0.5125 | 0.5174 | 0.5362 | 0.5566 |
|
| 91 |
| nlpai-lab/KURE-v1 | 568M | 0.5272 | 0.5017 | 0.5171 | 0.5353 | 0.5548 |
|