Abdelkareem commited on
Commit
da95b58
ยท
verified ยท
1 Parent(s): df3e30b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +123 -1
README.md CHANGED
@@ -18,13 +18,135 @@ pipeline_tag: sentence-similarity
18
  # Zarra Arabic Static Embedding
19
 
20
  This [Model2Vec](https://github.com/MinishLab/model2vec) model is a distilled version of a Sentence Transformer.
 
21
  It uses static embeddings, allowing text embeddings to be computed orders of magnitude faster on both GPU and CPU.
 
22
  It is designed for applications where computational resources are limited or where real-time performance is critical.
23
  Model2Vec models are the smallest, fastest, and most performant static embedders available.
24
  The distilled models are can beup to 50 times smaller and 500 times faster than traditional Sentence Transformers.
25
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
26
  ## Installation
27
 
 
28
  Install model2vec using pip:
29
  ```
30
  pip install model2vec
 
18
  # Zarra Arabic Static Embedding
19
 
20
  This [Model2Vec](https://github.com/MinishLab/model2vec) model is a distilled version of a Sentence Transformer.
21
+
22
  It uses static embeddings, allowing text embeddings to be computed orders of magnitude faster on both GPU and CPU.
23
+
24
  It is designed for applications where computational resources are limited or where real-time performance is critical.
25
  Model2Vec models are the smallest, fastest, and most performant static embedders available.
26
  The distilled models are can beup to 50 times smaller and 500 times faster than traditional Sentence Transformers.
27
+ ## Benchmark on Arabic
28
+
29
+ Model Evaluation Summary
30
+ โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”“
31
+ โ”ƒ Model โ”ƒ Avg โ”ƒ MIRAC โ”ƒ MLQAR โ”ƒ Massi โ”ƒ Multi โ”ƒ STS17 โ”ƒ STS22 โ”ƒ XNLI_ โ”ƒ
32
+ โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
33
+ โ”‚ arabic_triplet_matryoshka_v2 โ”‚ 0.6610 โ”‚ 0.6262 โ”‚ 0.5093 โ”‚ 0.5577 โ”‚ 0.5868 โ”‚ 0.8531 โ”‚ 0.6396 โ”‚ 0.8542 โ”‚
34
+ โ”‚ muffakir_embedding โ”‚ 0.6494 โ”‚ 0.6424 โ”‚ 0.5267 โ”‚ 0.5462 โ”‚ 0.5943 โ”‚ 0.8485 โ”‚ 0.6291 โ”‚ 0.7583 โ”‚
35
+ โ”‚ arabic_retrieval_v1.0 โ”‚ 0.6473 โ”‚ 0.6159 โ”‚ 0.5674 โ”‚ 0.5832 โ”‚ 0.5993 โ”‚ 0.8002 โ”‚ 0.6254 โ”‚ 0.7393 โ”‚
36
+ โ”‚ gate_arabert-v1 โ”‚ 0.6444 โ”‚ 0.5774 โ”‚ 0.4808 โ”‚ 0.5345 โ”‚ 0.5847 โ”‚ 0.8278 โ”‚ 0.6310 โ”‚ 0.8746 โ”‚
37
+ โ”‚ get_multilingual_base โ”‚ 0.6440 โ”‚ 0.7177 โ”‚ 0.5698 โ”‚ 0.5071 โ”‚ 0.5521 โ”‚ 0.7881 โ”‚ 0.6145 โ”‚ 0.7584 โ”‚
38
+ โ”‚ arabic_sts_matryoshka โ”‚ 0.6413 โ”‚ 0.5828 โ”‚ 0.4840 โ”‚ 0.5457 โ”‚ 0.5494 โ”‚ 0.8290 โ”‚ 0.6242 โ”‚ 0.8740 โ”‚
39
+ โ”‚ silma_ai_embedding_sts_v0.1 โ”‚ 0.6138 โ”‚ 0.3799 โ”‚ 0.5011 โ”‚ 0.5600 โ”‚ 0.5749 โ”‚ 0.8559 โ”‚ 0.6122 โ”‚ 0.8125 โ”‚
40
+ โ”‚ Arabic-MiniLM-L12-v2-all-nli-triplet โ”‚ 0.5431 โ”‚ 0.2240 โ”‚ 0.3612 โ”‚ 0.4775 โ”‚ 0.5698 โ”‚ 0.8111 โ”‚ 0.5540 โ”‚ 0.8043 โ”‚
41
+ โ”‚ paraphrase-multilingual-MiniLM-L12-v2 โ”‚ 0.5208 โ”‚ 0.2191 โ”‚ 0.3496 โ”‚ 0.4515 โ”‚ 0.5573 โ”‚ 0.7916 โ”‚ 0.4908 โ”‚ 0.7859 โ”‚
42
+ โ”‚ bojji โ”‚ 0.5177 โ”‚ 0.2941 โ”‚ 0.3989 โ”‚ 0.4667 โ”‚ 0.5433 โ”‚ 0.7233 โ”‚ 0.5880 โ”‚ 0.6094 โ”‚
43
+ โ”‚ zarra โ”‚ 0.4822 โ”‚ 0.2295 โ”‚ 0.3473 โ”‚ 0.4119 โ”‚ 0.5237 โ”‚ 0.6469 โ”‚ 0.6218 โ”‚ 0.5942 โ”‚
44
+ โ”‚ potion-multilingual-128M โ”‚ 0.4699 โ”‚ 0.1658 โ”‚ 0.3150 โ”‚ 0.4285 โ”‚ 0.5338 โ”‚ 0.6511 โ”‚ 0.5951 โ”‚ 0.5999 โ”‚
45
+ โ”‚ all_minilm_l6_v2 โ”‚ 0.2843 โ”‚ 0.0005 โ”‚ 0.0064 โ”‚ 0.1905 โ”‚ 0.4934 โ”‚ 0.5089 โ”‚ 0.2518 โ”‚ 0.5384 โ”‚
46
+ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
47
+
48
+
49
+
50
+ Sorted by STS17_main (Score)
51
+ โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”“
52
+ โ”ƒ Model Name โ”ƒ STS17_main โ”ƒ
53
+ โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
54
+ โ”‚ silma_ai_embedding_sts_v0.1 โ”‚ 0.856 โ”‚
55
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
56
+ โ”‚ arabic_triplet_matryoshka_v2 โ”‚ 0.853 โ”‚
57
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
58
+ โ”‚ muffakir_embedding โ”‚ 0.849 โ”‚
59
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
60
+ โ”‚ arabic_sts_matryoshka โ”‚ 0.829 โ”‚
61
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
62
+ โ”‚ gate_arabert-v1 โ”‚ 0.828 โ”‚
63
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
64
+ โ”‚ Arabic-MiniLM-L12-v2-all-nli-triplet โ”‚ 0.811 โ”‚
65
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
66
+ โ”‚ arabic_retrieval_v1.0 โ”‚ 0.800 โ”‚
67
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
68
+ โ”‚ paraphrase-multilingual-MiniLM-L12-v2 โ”‚ 0.792 โ”‚
69
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
70
+ โ”‚ get_multilingual_base โ”‚ 0.788 โ”‚
71
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
72
+ โ”‚ bojji โ”‚ 0.723 โ”‚
73
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
74
+ โ”‚ potion-multilingual-128M โ”‚ 0.651 โ”‚
75
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
76
+ โ”‚ zarra โ”‚ 0.647 โ”‚
77
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
78
+ โ”‚ all_minilm_l6_v2 โ”‚ 0.509 โ”‚
79
+ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
80
+
81
+ Sorted by STS22.v2_main (Score)
82
+ โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”“
83
+ โ”ƒ Model Name โ”ƒ STS22.v2_main โ”ƒ
84
+ โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
85
+ โ”‚ arabic_triplet_matryoshka_v2 โ”‚ 0.640 โ”‚
86
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
87
+ โ”‚ gate_arabert-v1 โ”‚ 0.631 โ”‚
88
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
89
+ โ”‚ muffakir_embedding โ”‚ 0.629 โ”‚
90
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
91
+ โ”‚ arabic_retrieval_v1.0 โ”‚ 0.625 โ”‚
92
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
93
+ โ”‚ arabic_sts_matryoshka โ”‚ 0.624 โ”‚
94
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
95
+ โ”‚ zarra โ”‚ 0.622 โ”‚
96
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
97
+ โ”‚ get_multilingual_base โ”‚ 0.615 โ”‚
98
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
99
+ โ”‚ silma_ai_embedding_sts_v0.1 โ”‚ 0.612 โ”‚
100
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
101
+ โ”‚ potion-multilingual-128M โ”‚ 0.595 โ”‚
102
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
103
+ โ”‚ bojji โ”‚ 0.588 โ”‚
104
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
105
+ โ”‚ Arabic-MiniLM-L12-v2-all-nli-triplet โ”‚ 0.554 โ”‚
106
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
107
+ โ”‚ paraphrase-multilingual-MiniLM-L12-v2 โ”‚ 0.491 โ”‚
108
+ โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
109
+ โ”‚ all_minilm_l6_v2 โ”‚ 0.252 โ”‚
110
+ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
111
+
112
+
113
+ ## Speed
114
+
115
+ Model Benchmark Results
116
+ โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”“
117
+ โ”ƒ Model โ”ƒ Speed (sentences/second) โ”ƒ Device โ”ƒ
118
+ โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
119
+ โ”‚ zarra โ”‚ 26893.63 โ”‚ cpu โ”‚
120
+ โ”‚ bojji โ”‚ 27478.15 โ”‚ cpu โ”‚
121
+ โ”‚ potion-multilingual-128M โ”‚ 27145.31 โ”‚ cpu โ”‚
122
+ โ”‚ paraphrase-multilingual-MiniLM-L12-v2 โ”‚ 2363.24 โ”‚ cuda โ”‚
123
+ โ”‚ silma_ai_embedding_sts_v0.1 โ”‚ 627.13 โ”‚ cuda โ”‚
124
+ โ”‚ muffakir_embedding โ”‚ 621.77 โ”‚ cuda โ”‚
125
+ โ”‚ get_multilingual_base โ”‚ 895.41 โ”‚ cuda โ”‚
126
+ โ”‚ arabic_retrieval_v1.0 โ”‚ 618.56 โ”‚ cuda โ”‚
127
+ โ”‚ arabic_triplet_matryoshka_v2 โ”‚ 610.64 โ”‚ cuda โ”‚
128
+ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
129
+
130
+ ## Size of the model
131
+
132
+ Model Information Results
133
+ โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ณโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”“
134
+ โ”ƒ Model โ”ƒ Parameters (M) โ”ƒ Size (MB) โ”ƒ Relative to Largest (%) โ”ƒ Less than Largest (x) โ”ƒ
135
+ โ”กโ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ•‡โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”ฉ
136
+ โ”‚ zarra โ”‚ 64.00 โ”‚ 244.14 โ”‚ 41.92 โ”‚ 2.39 โ”‚
137
+ โ”‚ bojji โ”‚ 124.88 โ”‚ 476.40 โ”‚ 81.79 โ”‚ 1.22 โ”‚
138
+ โ”‚ potion-multilingual-128M โ”‚ 128.09 โ”‚ 488.63 โ”‚ 83.89 โ”‚ 1.19 โ”‚
139
+ โ”‚ paraphrase-multilingual-MiniLM-โ€ฆ โ”‚ 117.65 โ”‚ 448.82 โ”‚ 77.06 โ”‚ 1.30 โ”‚
140
+ โ”‚ silma_ai_embedding_sts_v0.1 โ”‚ 135.19 โ”‚ 515.72 โ”‚ 88.54 โ”‚ 1.13 โ”‚
141
+ โ”‚ muffakir_embedding โ”‚ 135.19 โ”‚ 515.72 โ”‚ 88.54 โ”‚ 1.13 โ”‚
142
+ โ”‚ arabic_retrieval_v1.0 โ”‚ 135.19 โ”‚ 515.73 โ”‚ 88.54 โ”‚ 1.13 โ”‚
143
+ โ”‚ arabic_triplet_matryoshka_v2 โ”‚ 135.19 โ”‚ 515.72 โ”‚ 88.54 โ”‚ 1.13 โ”‚
144
+ โ”‚ get_multilingual_base โ”‚ 305.37 โ”‚ 582.45 โ”‚ 100.00 โ”‚ 1.00 โ”‚
145
+ โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
146
+
147
  ## Installation
148
 
149
+
150
  Install model2vec using pip:
151
  ```
152
  pip install model2vec