ashercn97 commited on
Commit
328061d
·
verified ·
1 Parent(s): 68ec585

Upload folder using huggingface_hub

Browse files
Files changed (6) hide show
  1. README.md +2065 -0
  2. config.json +43 -0
  3. model.safetensors +3 -0
  4. special_tokens_map.json +35 -0
  5. tokenizer.json +0 -0
  6. tokenizer_config.json +55 -0
README.md ADDED
@@ -0,0 +1,2065 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Salesforce/SFR-Embedding-2_R
4
+ tags:
5
+ - bnb-my-repo
6
+ - mteb
7
+ - sentence-transformers
8
+ - transformers
9
+ model-index:
10
+ - name: Salesforce/SFR-Embedding-2_R
11
+ results:
12
+ - task:
13
+ type: Classification
14
+ dataset:
15
+ type: mteb/amazon_counterfactual
16
+ name: MTEB AmazonCounterfactualClassification (en)
17
+ config: en
18
+ split: test
19
+ revision: e8379541af4e31359cca9fbcf4b00f2671dba205
20
+ metrics:
21
+ - type: accuracy
22
+ value: 92.71641791044776
23
+ - type: ap
24
+ value: 69.47931007147756
25
+ - type: f1
26
+ value: 88.0252625393374
27
+ - task:
28
+ type: Classification
29
+ dataset:
30
+ type: mteb/amazon_polarity
31
+ name: MTEB AmazonPolarityClassification
32
+ config: default
33
+ split: test
34
+ revision: e2d317d38cd51312af73b3d32a06d1a08b442046
35
+ metrics:
36
+ - type: accuracy
37
+ value: 97.31075
38
+ - type: ap
39
+ value: 96.26693923450127
40
+ - type: f1
41
+ value: 97.31042448894502
42
+ - task:
43
+ type: Classification
44
+ dataset:
45
+ type: mteb/amazon_reviews_multi
46
+ name: MTEB AmazonReviewsClassification (en)
47
+ config: en
48
+ split: test
49
+ revision: 1399c76144fd37290681b995c656ef9b2e06e26d
50
+ metrics:
51
+ - type: accuracy
52
+ value: 61.040000000000006
53
+ - type: f1
54
+ value: 60.78646832640785
55
+ - task:
56
+ type: Retrieval
57
+ dataset:
58
+ type: mteb/arguana
59
+ name: MTEB ArguAna
60
+ config: default
61
+ split: test
62
+ revision: c22ab2a51041ffd869aaddef7af8d8215647e41a
63
+ metrics:
64
+ - type: map_at_1
65
+ value: 37.767
66
+ - type: map_at_10
67
+ value: 53.908
68
+ - type: map_at_100
69
+ value: 54.583000000000006
70
+ - type: map_at_1000
71
+ value: 54.583999999999996
72
+ - type: map_at_20
73
+ value: 54.50899999999999
74
+ - type: map_at_3
75
+ value: 49.514
76
+ - type: map_at_5
77
+ value: 52.059999999999995
78
+ - type: mrr_at_1
79
+ value: 38.26458036984353
80
+ - type: mrr_at_10
81
+ value: 54.120408001987066
82
+ - type: mrr_at_100
83
+ value: 54.780719904297406
84
+ - type: mrr_at_1000
85
+ value: 54.78174226698592
86
+ - type: mrr_at_20
87
+ value: 54.706604527160295
88
+ - type: mrr_at_3
89
+ value: 49.71550497866294
90
+ - type: mrr_at_5
91
+ value: 52.247510668563436
92
+ - type: ndcg_at_1
93
+ value: 37.767
94
+ - type: ndcg_at_10
95
+ value: 62.339999999999996
96
+ - type: ndcg_at_100
97
+ value: 64.89399999999999
98
+ - type: ndcg_at_1000
99
+ value: 64.914
100
+ - type: ndcg_at_20
101
+ value: 64.402
102
+ - type: ndcg_at_3
103
+ value: 53.33
104
+ - type: ndcg_at_5
105
+ value: 57.93899999999999
106
+ - type: precision_at_1
107
+ value: 37.767
108
+ - type: precision_at_10
109
+ value: 8.905000000000001
110
+ - type: precision_at_100
111
+ value: 0.9950000000000001
112
+ - type: precision_at_1000
113
+ value: 0.1
114
+ - type: precision_at_20
115
+ value: 4.8469999999999995
116
+ - type: precision_at_3
117
+ value: 21.456
118
+ - type: precision_at_5
119
+ value: 15.121
120
+ - type: recall_at_1
121
+ value: 37.767
122
+ - type: recall_at_10
123
+ value: 89.047
124
+ - type: recall_at_100
125
+ value: 99.502
126
+ - type: recall_at_1000
127
+ value: 99.644
128
+ - type: recall_at_20
129
+ value: 96.942
130
+ - type: recall_at_3
131
+ value: 64.36699999999999
132
+ - type: recall_at_5
133
+ value: 75.605
134
+ - task:
135
+ type: Clustering
136
+ dataset:
137
+ type: mteb/arxiv-clustering-p2p
138
+ name: MTEB ArxivClusteringP2P
139
+ config: default
140
+ split: test
141
+ revision: a122ad7f3f0291bf49cc6f4d32aa80929df69d5d
142
+ metrics:
143
+ - type: v_measure
144
+ value: 54.024325012036314
145
+ - task:
146
+ type: Clustering
147
+ dataset:
148
+ type: mteb/arxiv-clustering-s2s
149
+ name: MTEB ArxivClusteringS2S
150
+ config: default
151
+ split: test
152
+ revision: f910caf1a6075f7329cdf8c1a6135696f37dbd53
153
+ metrics:
154
+ - type: v_measure
155
+ value: 48.817300846601675
156
+ - task:
157
+ type: Reranking
158
+ dataset:
159
+ type: mteb/askubuntudupquestions-reranking
160
+ name: MTEB AskUbuntuDupQuestions
161
+ config: default
162
+ split: test
163
+ revision: 2000358ca161889fa9c082cb41daa8dcfb161a54
164
+ metrics:
165
+ - type: map
166
+ value: 66.71478959728732
167
+ - type: mrr
168
+ value: 79.07202216066482
169
+ - task:
170
+ type: STS
171
+ dataset:
172
+ type: mteb/biosses-sts
173
+ name: MTEB BIOSSES
174
+ config: default
175
+ split: test
176
+ revision: d3fb88f8f02e40887cd149695127462bbcf29b4a
177
+ metrics:
178
+ - type: cos_sim_pearson
179
+ value: 88.79517914982239
180
+ - type: cos_sim_spearman
181
+ value: 87.60440576436838
182
+ - type: euclidean_pearson
183
+ value: 87.75596873521118
184
+ - type: euclidean_spearman
185
+ value: 87.60440576436838
186
+ - type: manhattan_pearson
187
+ value: 87.74113773865973
188
+ - type: manhattan_spearman
189
+ value: 87.50560833247899
190
+ - task:
191
+ type: Classification
192
+ dataset:
193
+ type: mteb/banking77
194
+ name: MTEB Banking77Classification
195
+ config: default
196
+ split: test
197
+ revision: 0fd18e25b25c072e09e0d92ab615fda904d66300
198
+ metrics:
199
+ - type: accuracy
200
+ value: 90.02272727272727
201
+ - type: f1
202
+ value: 89.96681880265936
203
+ - task:
204
+ type: Clustering
205
+ dataset:
206
+ type: mteb/biorxiv-clustering-p2p
207
+ name: MTEB BiorxivClusteringP2P
208
+ config: default
209
+ split: test
210
+ revision: 65b79d1d13f80053f67aca9498d9402c2d9f1f40
211
+ metrics:
212
+ - type: v_measure
213
+ value: 50.75930389699286
214
+ - task:
215
+ type: Clustering
216
+ dataset:
217
+ type: mteb/biorxiv-clustering-s2s
218
+ name: MTEB BiorxivClusteringS2S
219
+ config: default
220
+ split: test
221
+ revision: 258694dd0231531bc1fd9de6ceb52a0853c6d908
222
+ metrics:
223
+ - type: v_measure
224
+ value: 46.57286439805565
225
+ - task:
226
+ type: Retrieval
227
+ dataset:
228
+ type: mteb/cqadupstack
229
+ name: MTEB CQADupstackRetrieval
230
+ config: default
231
+ split: test
232
+ revision: 65ac3a16b8e91f9cee4c9828cc7c335575432a2a
233
+ metrics:
234
+ - type: map_at_1
235
+ value: 28.056666666666665
236
+ - type: map_at_10
237
+ value: 39.61749999999999
238
+ - type: map_at_100
239
+ value: 41.00666666666666
240
+ - type: map_at_1000
241
+ value: 41.11358333333334
242
+ - type: map_at_20
243
+ value: 40.410250000000005
244
+ - type: map_at_3
245
+ value: 35.98591666666667
246
+ - type: map_at_5
247
+ value: 38.02
248
+ - type: mrr_at_1
249
+ value: 33.73950708467142
250
+ - type: mrr_at_10
251
+ value: 44.0987162763402
252
+ - type: mrr_at_100
253
+ value: 44.94302678553521
254
+ - type: mrr_at_1000
255
+ value: 44.98758207055161
256
+ - type: mrr_at_20
257
+ value: 44.61156907536121
258
+ - type: mrr_at_3
259
+ value: 41.247253732468415
260
+ - type: mrr_at_5
261
+ value: 42.84859071071954
262
+ - type: ndcg_at_1
263
+ value: 33.739666666666665
264
+ - type: ndcg_at_10
265
+ value: 46.10683333333334
266
+ - type: ndcg_at_100
267
+ value: 51.49275000000001
268
+ - type: ndcg_at_1000
269
+ value: 53.2585
270
+ - type: ndcg_at_20
271
+ value: 48.349
272
+ - type: ndcg_at_3
273
+ value: 40.12416666666667
274
+ - type: ndcg_at_5
275
+ value: 42.94783333333333
276
+ - type: precision_at_1
277
+ value: 33.739666666666665
278
+ - type: precision_at_10
279
+ value: 8.46025
280
+ - type: precision_at_100
281
+ value: 1.3215833333333333
282
+ - type: precision_at_1000
283
+ value: 0.16524999999999998
284
+ - type: precision_at_20
285
+ value: 4.9935833333333335
286
+ - type: precision_at_3
287
+ value: 19.00516666666667
288
+ - type: precision_at_5
289
+ value: 13.72141666666667
290
+ - type: recall_at_1
291
+ value: 28.056666666666665
292
+ - type: recall_at_10
293
+ value: 60.68825000000001
294
+ - type: recall_at_100
295
+ value: 83.74433333333334
296
+ - type: recall_at_1000
297
+ value: 95.62299999999999
298
+ - type: recall_at_20
299
+ value: 68.77641666666668
300
+ - type: recall_at_3
301
+ value: 44.06991666666667
302
+ - type: recall_at_5
303
+ value: 51.324999999999996
304
+ - task:
305
+ type: Retrieval
306
+ dataset:
307
+ type: mteb/climate-fever
308
+ name: MTEB ClimateFEVER
309
+ config: default
310
+ split: test
311
+ revision: 47f2ac6acb640fc46020b02a5b59fdda04d39380
312
+ metrics:
313
+ - type: map_at_1
314
+ value: 15.609
315
+ - type: map_at_10
316
+ value: 25.584
317
+ - type: map_at_100
318
+ value: 27.291999999999998
319
+ - type: map_at_1000
320
+ value: 27.471
321
+ - type: map_at_20
322
+ value: 26.497
323
+ - type: map_at_3
324
+ value: 21.61
325
+ - type: map_at_5
326
+ value: 23.76
327
+ - type: mrr_at_1
328
+ value: 34.98371335504886
329
+ - type: mrr_at_10
330
+ value: 45.73747479447807
331
+ - type: mrr_at_100
332
+ value: 46.4973410206458
333
+ - type: mrr_at_1000
334
+ value: 46.53372527933685
335
+ - type: mrr_at_20
336
+ value: 46.19978503202757
337
+ - type: mrr_at_3
338
+ value: 42.85559174809991
339
+ - type: mrr_at_5
340
+ value: 44.65038002171556
341
+ - type: ndcg_at_1
342
+ value: 34.984
343
+ - type: ndcg_at_10
344
+ value: 34.427
345
+ - type: ndcg_at_100
346
+ value: 40.908
347
+ - type: ndcg_at_1000
348
+ value: 44.118
349
+ - type: ndcg_at_20
350
+ value: 36.885
351
+ - type: ndcg_at_3
352
+ value: 29.09
353
+ - type: ndcg_at_5
354
+ value: 30.979
355
+ - type: precision_at_1
356
+ value: 34.984
357
+ - type: precision_at_10
358
+ value: 10.476
359
+ - type: precision_at_100
360
+ value: 1.748
361
+ - type: precision_at_1000
362
+ value: 0.23500000000000001
363
+ - type: precision_at_20
364
+ value: 6.313000000000001
365
+ - type: precision_at_3
366
+ value: 21.39
367
+ - type: precision_at_5
368
+ value: 16.378
369
+ - type: recall_at_1
370
+ value: 15.609
371
+ - type: recall_at_10
372
+ value: 39.619
373
+ - type: recall_at_100
374
+ value: 61.952
375
+ - type: recall_at_1000
376
+ value: 79.861
377
+ - type: recall_at_20
378
+ value: 46.489000000000004
379
+ - type: recall_at_3
380
+ value: 26.134
381
+ - type: recall_at_5
382
+ value: 31.955
383
+ - task:
384
+ type: Retrieval
385
+ dataset:
386
+ type: mteb/dbpedia
387
+ name: MTEB DBPedia
388
+ config: default
389
+ split: test
390
+ revision: c0f706b76e590d620bd6618b3ca8efdd34e2d659
391
+ metrics:
392
+ - type: map_at_1
393
+ value: 10.482
394
+ - type: map_at_10
395
+ value: 25.155
396
+ - type: map_at_100
397
+ value: 36.606
398
+ - type: map_at_1000
399
+ value: 38.617000000000004
400
+ - type: map_at_20
401
+ value: 29.676000000000002
402
+ - type: map_at_3
403
+ value: 16.881
404
+ - type: map_at_5
405
+ value: 20.043
406
+ - type: mrr_at_1
407
+ value: 76.0
408
+ - type: mrr_at_10
409
+ value: 82.5610119047619
410
+ - type: mrr_at_100
411
+ value: 82.74795937825128
412
+ - type: mrr_at_1000
413
+ value: 82.75526942226163
414
+ - type: mrr_at_20
415
+ value: 82.70580357142858
416
+ - type: mrr_at_3
417
+ value: 81.41666666666667
418
+ - type: mrr_at_5
419
+ value: 82.26666666666667
420
+ - type: ndcg_at_1
421
+ value: 63.625
422
+ - type: ndcg_at_10
423
+ value: 51.214000000000006
424
+ - type: ndcg_at_100
425
+ value: 56.411
426
+ - type: ndcg_at_1000
427
+ value: 63.429
428
+ - type: ndcg_at_20
429
+ value: 50.595
430
+ - type: ndcg_at_3
431
+ value: 54.989
432
+ - type: ndcg_at_5
433
+ value: 52.589
434
+ - type: precision_at_1
435
+ value: 76.0
436
+ - type: precision_at_10
437
+ value: 41.975
438
+ - type: precision_at_100
439
+ value: 13.26
440
+ - type: precision_at_1000
441
+ value: 2.493
442
+ - type: precision_at_20
443
+ value: 32.15
444
+ - type: precision_at_3
445
+ value: 59.0
446
+ - type: precision_at_5
447
+ value: 51.24999999999999
448
+ - type: recall_at_1
449
+ value: 10.482
450
+ - type: recall_at_10
451
+ value: 31.075000000000003
452
+ - type: recall_at_100
453
+ value: 63.119
454
+ - type: recall_at_1000
455
+ value: 85.32300000000001
456
+ - type: recall_at_20
457
+ value: 40.345
458
+ - type: recall_at_3
459
+ value: 17.916
460
+ - type: recall_at_5
461
+ value: 22.475
462
+ - task:
463
+ type: Classification
464
+ dataset:
465
+ type: mteb/emotion
466
+ name: MTEB EmotionClassification
467
+ config: default
468
+ split: test
469
+ revision: 4f58c6b202a23cf9a4da393831edf4f9183cad37
470
+ metrics:
471
+ - type: accuracy
472
+ value: 93.36500000000001
473
+ - type: f1
474
+ value: 89.89541440183861
475
+ - task:
476
+ type: Retrieval
477
+ dataset:
478
+ type: mteb/fever
479
+ name: MTEB FEVER
480
+ config: default
481
+ split: test
482
+ revision: bea83ef9e8fb933d90a2f1d5515737465d613e12
483
+ metrics:
484
+ - type: map_at_1
485
+ value: 81.948
486
+ - type: map_at_10
487
+ value: 89.47500000000001
488
+ - type: map_at_100
489
+ value: 89.66199999999999
490
+ - type: map_at_1000
491
+ value: 89.671
492
+ - type: map_at_20
493
+ value: 89.582
494
+ - type: map_at_3
495
+ value: 88.646
496
+ - type: map_at_5
497
+ value: 89.19
498
+ - type: mrr_at_1
499
+ value: 88.23882388238825
500
+ - type: mrr_at_10
501
+ value: 93.2122736083131
502
+ - type: mrr_at_100
503
+ value: 93.23908769526588
504
+ - type: mrr_at_1000
505
+ value: 93.23932393435209
506
+ - type: mrr_at_20
507
+ value: 93.23217832106207
508
+ - type: mrr_at_3
509
+ value: 92.98679867986787
510
+ - type: mrr_at_5
511
+ value: 93.16906690669056
512
+ - type: ndcg_at_1
513
+ value: 88.239
514
+ - type: ndcg_at_10
515
+ value: 92.155
516
+ - type: ndcg_at_100
517
+ value: 92.735
518
+ - type: ndcg_at_1000
519
+ value: 92.866
520
+ - type: ndcg_at_20
521
+ value: 92.39699999999999
522
+ - type: ndcg_at_3
523
+ value: 91.188
524
+ - type: ndcg_at_5
525
+ value: 91.754
526
+ - type: precision_at_1
527
+ value: 88.239
528
+ - type: precision_at_10
529
+ value: 10.903
530
+ - type: precision_at_100
531
+ value: 1.147
532
+ - type: precision_at_1000
533
+ value: 0.117
534
+ - type: precision_at_20
535
+ value: 5.5440000000000005
536
+ - type: precision_at_3
537
+ value: 34.598
538
+ - type: precision_at_5
539
+ value: 21.302
540
+ - type: recall_at_1
541
+ value: 81.948
542
+ - type: recall_at_10
543
+ value: 96.518
544
+ - type: recall_at_100
545
+ value: 98.646
546
+ - type: recall_at_1000
547
+ value: 99.399
548
+ - type: recall_at_20
549
+ value: 97.262
550
+ - type: recall_at_3
551
+ value: 93.89800000000001
552
+ - type: recall_at_5
553
+ value: 95.38600000000001
554
+ - task:
555
+ type: Retrieval
556
+ dataset:
557
+ type: mteb/fiqa
558
+ name: MTEB FiQA2018
559
+ config: default
560
+ split: test
561
+ revision: 27a168819829fe9bcd655c2df245fb19452e8e06
562
+ metrics:
563
+ - type: map_at_1
564
+ value: 32.033
565
+ - type: map_at_10
566
+ value: 53.55
567
+ - type: map_at_100
568
+ value: 55.672
569
+ - type: map_at_1000
570
+ value: 55.764
571
+ - type: map_at_20
572
+ value: 54.87800000000001
573
+ - type: map_at_3
574
+ value: 46.761
575
+ - type: map_at_5
576
+ value: 50.529
577
+ - type: mrr_at_1
578
+ value: 60.95679012345679
579
+ - type: mrr_at_10
580
+ value: 68.70835782872815
581
+ - type: mrr_at_100
582
+ value: 69.21918402444501
583
+ - type: mrr_at_1000
584
+ value: 69.23608783148705
585
+ - type: mrr_at_20
586
+ value: 69.07497388036454
587
+ - type: mrr_at_3
588
+ value: 66.76954732510285
589
+ - type: mrr_at_5
590
+ value: 67.95781893004109
591
+ - type: ndcg_at_1
592
+ value: 60.956999999999994
593
+ - type: ndcg_at_10
594
+ value: 61.766
595
+ - type: ndcg_at_100
596
+ value: 67.652
597
+ - type: ndcg_at_1000
598
+ value: 68.94500000000001
599
+ - type: ndcg_at_20
600
+ value: 64.48700000000001
601
+ - type: ndcg_at_3
602
+ value: 57.25
603
+ - type: ndcg_at_5
604
+ value: 58.64
605
+ - type: precision_at_1
606
+ value: 60.956999999999994
607
+ - type: precision_at_10
608
+ value: 17.083000000000002
609
+ - type: precision_at_100
610
+ value: 2.346
611
+ - type: precision_at_1000
612
+ value: 0.257
613
+ - type: precision_at_20
614
+ value: 9.807
615
+ - type: precision_at_3
616
+ value: 38.477
617
+ - type: precision_at_5
618
+ value: 27.962999999999997
619
+ - type: recall_at_1
620
+ value: 32.033
621
+ - type: recall_at_10
622
+ value: 69.44
623
+ - type: recall_at_100
624
+ value: 90.17500000000001
625
+ - type: recall_at_1000
626
+ value: 97.90100000000001
627
+ - type: recall_at_20
628
+ value: 77.629
629
+ - type: recall_at_3
630
+ value: 51.664
631
+ - type: recall_at_5
632
+ value: 59.565
633
+ - task:
634
+ type: Retrieval
635
+ dataset:
636
+ type: mteb/hotpotqa
637
+ name: MTEB HotpotQA
638
+ config: default
639
+ split: test
640
+ revision: ab518f4d6fcca38d87c25209f94beba119d02014
641
+ metrics:
642
+ - type: map_at_1
643
+ value: 42.741
644
+ - type: map_at_10
645
+ value: 74.811
646
+ - type: map_at_100
647
+ value: 75.508
648
+ - type: map_at_1000
649
+ value: 75.541
650
+ - type: map_at_20
651
+ value: 75.25699999999999
652
+ - type: map_at_3
653
+ value: 71.31
654
+ - type: map_at_5
655
+ value: 73.69
656
+ - type: mrr_at_1
657
+ value: 85.48278190411884
658
+ - type: mrr_at_10
659
+ value: 90.20347684425987
660
+ - type: mrr_at_100
661
+ value: 90.29734129342121
662
+ - type: mrr_at_1000
663
+ value: 90.30017606259217
664
+ - type: mrr_at_20
665
+ value: 90.27225310310567
666
+ - type: mrr_at_3
667
+ value: 89.67364393427842
668
+ - type: mrr_at_5
669
+ value: 90.02408282691847
670
+ - type: ndcg_at_1
671
+ value: 85.483
672
+ - type: ndcg_at_10
673
+ value: 81.361
674
+ - type: ndcg_at_100
675
+ value: 83.588
676
+ - type: ndcg_at_1000
677
+ value: 84.19
678
+ - type: ndcg_at_20
679
+ value: 82.42699999999999
680
+ - type: ndcg_at_3
681
+ value: 76.779
682
+ - type: ndcg_at_5
683
+ value: 79.581
684
+ - type: precision_at_1
685
+ value: 85.483
686
+ - type: precision_at_10
687
+ value: 17.113
688
+ - type: precision_at_100
689
+ value: 1.882
690
+ - type: precision_at_1000
691
+ value: 0.196
692
+ - type: precision_at_20
693
+ value: 8.899
694
+ - type: precision_at_3
695
+ value: 50.397999999999996
696
+ - type: precision_at_5
697
+ value: 32.443
698
+ - type: recall_at_1
699
+ value: 42.741
700
+ - type: recall_at_10
701
+ value: 85.564
702
+ - type: recall_at_100
703
+ value: 94.07799999999999
704
+ - type: recall_at_1000
705
+ value: 97.995
706
+ - type: recall_at_20
707
+ value: 88.98700000000001
708
+ - type: recall_at_3
709
+ value: 75.598
710
+ - type: recall_at_5
711
+ value: 81.107
712
+ - task:
713
+ type: Classification
714
+ dataset:
715
+ type: mteb/imdb
716
+ name: MTEB ImdbClassification
717
+ config: default
718
+ split: test
719
+ revision: 3d86128a09e091d6018b6d26cad27f2739fc2db7
720
+ metrics:
721
+ - type: accuracy
722
+ value: 96.80320000000002
723
+ - type: ap
724
+ value: 94.98856145360044
725
+ - type: f1
726
+ value: 96.80287885839178
727
+ - task:
728
+ type: Retrieval
729
+ dataset:
730
+ type: mteb/msmarco
731
+ name: MTEB MSMARCO
732
+ config: default
733
+ split: dev
734
+ revision: c5a29a104738b98a9e76336939199e264163d4a0
735
+ metrics:
736
+ - type: map_at_1
737
+ value: 22.539
738
+ - type: map_at_10
739
+ value: 35.109
740
+ - type: map_at_100
741
+ value: 36.287000000000006
742
+ - type: map_at_1000
743
+ value: 36.335
744
+ - type: map_at_20
745
+ value: 35.838
746
+ - type: map_at_3
747
+ value: 31.11
748
+ - type: map_at_5
749
+ value: 33.455
750
+ - type: mrr_at_1
751
+ value: 23.15186246418338
752
+ - type: mrr_at_10
753
+ value: 35.70532018920268
754
+ - type: mrr_at_100
755
+ value: 36.815167506137584
756
+ - type: mrr_at_1000
757
+ value: 36.85695349443505
758
+ - type: mrr_at_20
759
+ value: 36.39500867880642
760
+ - type: mrr_at_3
761
+ value: 31.81232091690535
762
+ - type: mrr_at_5
763
+ value: 34.096704871060155
764
+ - type: ndcg_at_1
765
+ value: 23.152
766
+ - type: ndcg_at_10
767
+ value: 42.181999999999995
768
+ - type: ndcg_at_100
769
+ value: 47.847
770
+ - type: ndcg_at_1000
771
+ value: 48.988
772
+ - type: ndcg_at_20
773
+ value: 44.767
774
+ - type: ndcg_at_3
775
+ value: 34.088
776
+ - type: ndcg_at_5
777
+ value: 38.257999999999996
778
+ - type: precision_at_1
779
+ value: 23.152
780
+ - type: precision_at_10
781
+ value: 6.678000000000001
782
+ - type: precision_at_100
783
+ value: 0.9530000000000001
784
+ - type: precision_at_1000
785
+ value: 0.105
786
+ - type: precision_at_20
787
+ value: 3.881
788
+ - type: precision_at_3
789
+ value: 14.518
790
+ - type: precision_at_5
791
+ value: 10.831
792
+ - type: recall_at_1
793
+ value: 22.539
794
+ - type: recall_at_10
795
+ value: 63.965
796
+ - type: recall_at_100
797
+ value: 90.129
798
+ - type: recall_at_1000
799
+ value: 98.721
800
+ - type: recall_at_20
801
+ value: 74.00999999999999
802
+ - type: recall_at_3
803
+ value: 42.004999999999995
804
+ - type: recall_at_5
805
+ value: 52.028
806
+ - task:
807
+ type: Classification
808
+ dataset:
809
+ type: mteb/mtop_domain
810
+ name: MTEB MTOPDomainClassification (en)
811
+ config: en
812
+ split: test
813
+ revision: d80d48c1eb48d3562165c59d59d0034df9fff0bf
814
+ metrics:
815
+ - type: accuracy
816
+ value: 98.5750113999088
817
+ - type: f1
818
+ value: 98.41576079230245
819
+ - task:
820
+ type: Classification
821
+ dataset:
822
+ type: mteb/mtop_intent
823
+ name: MTEB MTOPIntentClassification (en)
824
+ config: en
825
+ split: test
826
+ revision: ae001d0e6b1228650b7bd1c2c65fb50ad11a8aba
827
+ metrics:
828
+ - type: accuracy
829
+ value: 91.29502963976289
830
+ - type: f1
831
+ value: 74.84400169335184
832
+ - task:
833
+ type: Classification
834
+ dataset:
835
+ type: mteb/amazon_massive_intent
836
+ name: MTEB MassiveIntentClassification (en)
837
+ config: en
838
+ split: test
839
+ revision: 31efe3c427b0bae9c22cbb560b8f15491cc6bed7
840
+ metrics:
841
+ - type: accuracy
842
+ value: 85.96839273705447
843
+ - type: f1
844
+ value: 82.43129186593926
845
+ - task:
846
+ type: Classification
847
+ dataset:
848
+ type: mteb/amazon_massive_scenario
849
+ name: MTEB MassiveScenarioClassification (en)
850
+ config: en
851
+ split: test
852
+ revision: 7d571f92784cd94a019292a1f45445077d0ef634
853
+ metrics:
854
+ - type: accuracy
855
+ value: 90.60860793544047
856
+ - type: f1
857
+ value: 89.79415994859477
858
+ - task:
859
+ type: Clustering
860
+ dataset:
861
+ type: mteb/medrxiv-clustering-p2p
862
+ name: MTEB MedrxivClusteringP2P
863
+ config: default
864
+ split: test
865
+ revision: e7a26af6f3ae46b30dde8737f02c07b1505bcc73
866
+ metrics:
867
+ - type: v_measure
868
+ value: 46.661892807041355
869
+ - task:
870
+ type: Clustering
871
+ dataset:
872
+ type: mteb/medrxiv-clustering-s2s
873
+ name: MTEB MedrxivClusteringS2S
874
+ config: default
875
+ split: test
876
+ revision: 35191c8c0dca72d8ff3efcd72aa802307d469663
877
+ metrics:
878
+ - type: v_measure
879
+ value: 44.17598473858937
880
+ - task:
881
+ type: Reranking
882
+ dataset:
883
+ type: mteb/mind_small
884
+ name: MTEB MindSmallReranking
885
+ config: default
886
+ split: test
887
+ revision: 59042f120c80e8afa9cdbb224f67076cec0fc9a7
888
+ metrics:
889
+ - type: map
890
+ value: 31.260919294024603
891
+ - type: mrr
892
+ value: 32.37049108835034
893
+ - task:
894
+ type: Retrieval
895
+ dataset:
896
+ type: mteb/nfcorpus
897
+ name: MTEB NFCorpus
898
+ config: default
899
+ split: test
900
+ revision: ec0fa4fe99da2ff19ca1214b7966684033a58814
901
+ metrics:
902
+ - type: map_at_1
903
+ value: 6.672000000000001
904
+ - type: map_at_10
905
+ value: 15.972
906
+ - type: map_at_100
907
+ value: 20.94
908
+ - type: map_at_1000
909
+ value: 22.877
910
+ - type: map_at_20
911
+ value: 17.986
912
+ - type: map_at_3
913
+ value: 11.161
914
+ - type: map_at_5
915
+ value: 13.293
916
+ - type: mrr_at_1
917
+ value: 53.56037151702786
918
+ - type: mrr_at_10
919
+ value: 61.915696103002595
920
+ - type: mrr_at_100
921
+ value: 62.4130902631107
922
+ - type: mrr_at_1000
923
+ value: 62.45228087711845
924
+ - type: mrr_at_20
925
+ value: 62.1983715004112
926
+ - type: mrr_at_3
927
+ value: 60.31991744066049
928
+ - type: mrr_at_5
929
+ value: 61.27966976264191
930
+ - type: ndcg_at_1
931
+ value: 50.929
932
+ - type: ndcg_at_10
933
+ value: 41.336
934
+ - type: ndcg_at_100
935
+ value: 38.586999999999996
936
+ - type: ndcg_at_1000
937
+ value: 48.155
938
+ - type: ndcg_at_20
939
+ value: 38.888
940
+ - type: ndcg_at_3
941
+ value: 47.0
942
+ - type: ndcg_at_5
943
+ value: 44.335
944
+ - type: precision_at_1
945
+ value: 53.251000000000005
946
+ - type: precision_at_10
947
+ value: 31.146
948
+ - type: precision_at_100
949
+ value: 10.040000000000001
950
+ - type: precision_at_1000
951
+ value: 2.432
952
+ - type: precision_at_20
953
+ value: 23.421
954
+ - type: precision_at_3
955
+ value: 45.098
956
+ - type: precision_at_5
957
+ value: 39.071
958
+ - type: recall_at_1
959
+ value: 6.672000000000001
960
+ - type: recall_at_10
961
+ value: 20.764
962
+ - type: recall_at_100
963
+ value: 40.759
964
+ - type: recall_at_1000
965
+ value: 75.015
966
+ - type: recall_at_20
967
+ value: 25.548
968
+ - type: recall_at_3
969
+ value: 12.328
970
+ - type: recall_at_5
971
+ value: 15.601999999999999
972
+ - task:
973
+ type: Retrieval
974
+ dataset:
975
+ type: mteb/nq
976
+ name: MTEB NQ
977
+ config: default
978
+ split: test
979
+ revision: b774495ed302d8c44a3a7ea25c90dbce03968f31
980
+ metrics:
981
+ - type: map_at_1
982
+ value: 50.944
983
+ - type: map_at_10
984
+ value: 67.565
985
+ - type: map_at_100
986
+ value: 68.10300000000001
987
+ - type: map_at_1000
988
+ value: 68.109
989
+ - type: map_at_20
990
+ value: 67.973
991
+ - type: map_at_3
992
+ value: 64.176
993
+ - type: map_at_5
994
+ value: 66.39699999999999
995
+ - type: mrr_at_1
996
+ value: 57.01042873696408
997
+ - type: mrr_at_10
998
+ value: 69.76629605105849
999
+ - type: mrr_at_100
1000
+ value: 70.09927347130204
1001
+ - type: mrr_at_1000
1002
+ value: 70.10309675839956
1003
+ - type: mrr_at_20
1004
+ value: 70.02288627712392
1005
+ - type: mrr_at_3
1006
+ value: 67.46813441483191
1007
+ - type: mrr_at_5
1008
+ value: 68.93105446118189
1009
+ - type: ndcg_at_1
1010
+ value: 57.010000000000005
1011
+ - type: ndcg_at_10
1012
+ value: 73.956
1013
+ - type: ndcg_at_100
1014
+ value: 75.90299999999999
1015
+ - type: ndcg_at_1000
1016
+ value: 76.03999999999999
1017
+ - type: ndcg_at_20
1018
+ value: 75.17
1019
+ - type: ndcg_at_3
1020
+ value: 68.13900000000001
1021
+ - type: ndcg_at_5
1022
+ value: 71.532
1023
+ - type: precision_at_1
1024
+ value: 57.010000000000005
1025
+ - type: precision_at_10
1026
+ value: 10.91
1027
+ - type: precision_at_100
1028
+ value: 1.2
1029
+ - type: precision_at_1000
1030
+ value: 0.121
1031
+ - type: precision_at_20
1032
+ value: 5.753
1033
+ - type: precision_at_3
1034
+ value: 29.828
1035
+ - type: precision_at_5
1036
+ value: 19.971
1037
+ - type: recall_at_1
1038
+ value: 50.944
1039
+ - type: recall_at_10
1040
+ value: 90.754
1041
+ - type: recall_at_100
1042
+ value: 98.699
1043
+ - type: recall_at_1000
1044
+ value: 99.701
1045
+ - type: recall_at_20
1046
+ value: 95.148
1047
+ - type: recall_at_3
1048
+ value: 76.224
1049
+ - type: recall_at_5
1050
+ value: 83.872
1051
+ - task:
1052
+ type: Retrieval
1053
+ dataset:
1054
+ type: mteb/quora
1055
+ name: MTEB QuoraRetrieval
1056
+ config: default
1057
+ split: test
1058
+ revision: e4e08e0b7dbe3c8700f0daef558ff32256715259
1059
+ metrics:
1060
+ - type: map_at_1
1061
+ value: 71.856
1062
+ - type: map_at_10
1063
+ value: 86.077
1064
+ - type: map_at_100
1065
+ value: 86.696
1066
+ - type: map_at_1000
1067
+ value: 86.708
1068
+ - type: map_at_20
1069
+ value: 86.493
1070
+ - type: map_at_3
1071
+ value: 83.176
1072
+ - type: map_at_5
1073
+ value: 85.008
1074
+ - type: mrr_at_1
1075
+ value: 82.74000000000001
1076
+ - type: mrr_at_10
1077
+ value: 88.68947222222207
1078
+ - type: mrr_at_100
1079
+ value: 88.78196949571182
1080
+ - type: mrr_at_1000
1081
+ value: 88.78223256200576
1082
+ - type: mrr_at_20
1083
+ value: 88.76455636228219
1084
+ - type: mrr_at_3
1085
+ value: 87.85833333333316
1086
+ - type: mrr_at_5
1087
+ value: 88.43933333333311
1088
+ - type: ndcg_at_1
1089
+ value: 82.74000000000001
1090
+ - type: ndcg_at_10
1091
+ value: 89.583
1092
+ - type: ndcg_at_100
1093
+ value: 90.652
1094
+ - type: ndcg_at_1000
1095
+ value: 90.711
1096
+ - type: ndcg_at_20
1097
+ value: 90.203
1098
+ - type: ndcg_at_3
1099
+ value: 86.967
1100
+ - type: ndcg_at_5
1101
+ value: 88.43299999999999
1102
+ - type: precision_at_1
1103
+ value: 82.74000000000001
1104
+ - type: precision_at_10
1105
+ value: 13.617
1106
+ - type: precision_at_100
1107
+ value: 1.542
1108
+ - type: precision_at_1000
1109
+ value: 0.157
1110
+ - type: precision_at_20
1111
+ value: 7.217999999999999
1112
+ - type: precision_at_3
1113
+ value: 38.163000000000004
1114
+ - type: precision_at_5
1115
+ value: 25.05
1116
+ - type: recall_at_1
1117
+ value: 71.856
1118
+ - type: recall_at_10
1119
+ value: 96.244
1120
+ - type: recall_at_100
1121
+ value: 99.773
1122
+ - type: recall_at_1000
1123
+ value: 99.99900000000001
1124
+ - type: recall_at_20
1125
+ value: 98.221
1126
+ - type: recall_at_3
1127
+ value: 88.715
1128
+ - type: recall_at_5
1129
+ value: 92.88499999999999
1130
+ - task:
1131
+ type: Clustering
1132
+ dataset:
1133
+ type: mteb/reddit-clustering
1134
+ name: MTEB RedditClustering
1135
+ config: default
1136
+ split: test
1137
+ revision: 24640382cdbf8abc73003fb0fa6d111a705499eb
1138
+ metrics:
1139
+ - type: v_measure
1140
+ value: 62.91969510127886
1141
+ - task:
1142
+ type: Clustering
1143
+ dataset:
1144
+ type: mteb/reddit-clustering-p2p
1145
+ name: MTEB RedditClusteringP2P
1146
+ config: default
1147
+ split: test
1148
+ revision: 385e3cb46b4cfa89021f56c4380204149d0efe33
1149
+ metrics:
1150
+ - type: v_measure
1151
+ value: 72.74201090913765
1152
+ - task:
1153
+ type: Retrieval
1154
+ dataset:
1155
+ type: mteb/scidocs
1156
+ name: MTEB SCIDOCS
1157
+ config: default
1158
+ split: test
1159
+ revision: f8c2fcf00f625baaa80f62ec5bd9e1fff3b8ae88
1160
+ metrics:
1161
+ - type: map_at_1
1162
+ value: 5.8229999999999995
1163
+ - type: map_at_10
1164
+ value: 15.152
1165
+ - type: map_at_100
1166
+ value: 17.936
1167
+ - type: map_at_1000
1168
+ value: 18.292
1169
+ - type: map_at_20
1170
+ value: 16.526
1171
+ - type: map_at_3
1172
+ value: 10.294
1173
+ - type: map_at_5
1174
+ value: 12.794
1175
+ - type: mrr_at_1
1176
+ value: 28.599999999999998
1177
+ - type: mrr_at_10
1178
+ value: 40.68206349206347
1179
+ - type: mrr_at_100
1180
+ value: 41.673752995361795
1181
+ - type: mrr_at_1000
1182
+ value: 41.71500072915374
1183
+ - type: mrr_at_20
1184
+ value: 41.28552805166964
1185
+ - type: mrr_at_3
1186
+ value: 36.84999999999998
1187
+ - type: mrr_at_5
1188
+ value: 39.19999999999995
1189
+ - type: ndcg_at_1
1190
+ value: 28.599999999999998
1191
+ - type: ndcg_at_10
1192
+ value: 24.866
1193
+ - type: ndcg_at_100
1194
+ value: 34.597
1195
+ - type: ndcg_at_1000
1196
+ value: 39.994
1197
+ - type: ndcg_at_20
1198
+ value: 28.309
1199
+ - type: ndcg_at_3
1200
+ value: 22.749
1201
+ - type: ndcg_at_5
1202
+ value: 20.502000000000002
1203
+ - type: precision_at_1
1204
+ value: 28.599999999999998
1205
+ - type: precision_at_10
1206
+ value: 13.089999999999998
1207
+ - type: precision_at_100
1208
+ value: 2.7119999999999997
1209
+ - type: precision_at_1000
1210
+ value: 0.39899999999999997
1211
+ - type: precision_at_20
1212
+ value: 8.53
1213
+ - type: precision_at_3
1214
+ value: 21.099999999999998
1215
+ - type: precision_at_5
1216
+ value: 18.22
1217
+ - type: recall_at_1
1218
+ value: 5.8229999999999995
1219
+ - type: recall_at_10
1220
+ value: 26.522000000000002
1221
+ - type: recall_at_100
1222
+ value: 55.003
1223
+ - type: recall_at_1000
1224
+ value: 80.977
1225
+ - type: recall_at_20
1226
+ value: 34.618
1227
+ - type: recall_at_3
1228
+ value: 12.848
1229
+ - type: recall_at_5
1230
+ value: 18.477
1231
+ - task:
1232
+ type: STS
1233
+ dataset:
1234
+ type: mteb/sickr-sts
1235
+ name: MTEB SICK-R
1236
+ config: default
1237
+ split: test
1238
+ revision: 20a6d6f312dd54037fe07a32d58e5e168867909d
1239
+ metrics:
1240
+ - type: cos_sim_pearson
1241
+ value: 80.72562067620224
1242
+ - type: cos_sim_spearman
1243
+ value: 77.00710192931953
1244
+ - type: euclidean_pearson
1245
+ value: 78.65843289108192
1246
+ - type: euclidean_spearman
1247
+ value: 77.00710077709005
1248
+ - type: manhattan_pearson
1249
+ value: 78.48859522905846
1250
+ - type: manhattan_spearman
1251
+ value: 76.8213740840866
1252
+ - task:
1253
+ type: STS
1254
+ dataset:
1255
+ type: mteb/sts12-sts
1256
+ name: MTEB STS12
1257
+ config: default
1258
+ split: test
1259
+ revision: a0d554a64d88156834ff5ae9920b964011b16384
1260
+ metrics:
1261
+ - type: cos_sim_pearson
1262
+ value: 81.15015325911659
1263
+ - type: cos_sim_spearman
1264
+ value: 75.67268325741222
1265
+ - type: euclidean_pearson
1266
+ value: 75.54004763633206
1267
+ - type: euclidean_spearman
1268
+ value: 75.67262179635058
1269
+ - type: manhattan_pearson
1270
+ value: 75.80681616893116
1271
+ - type: manhattan_spearman
1272
+ value: 75.93721016401406
1273
+ - task:
1274
+ type: STS
1275
+ dataset:
1276
+ type: mteb/sts13-sts
1277
+ name: MTEB STS13
1278
+ config: default
1279
+ split: test
1280
+ revision: 7e90230a92c190f1bf69ae9002b8cea547a64cca
1281
+ metrics:
1282
+ - type: cos_sim_pearson
1283
+ value: 81.71651874476737
1284
+ - type: cos_sim_spearman
1285
+ value: 82.39667472464997
1286
+ - type: euclidean_pearson
1287
+ value: 82.28256504757712
1288
+ - type: euclidean_spearman
1289
+ value: 82.39663674872656
1290
+ - type: manhattan_pearson
1291
+ value: 82.3192873176068
1292
+ - type: manhattan_spearman
1293
+ value: 82.41915252757059
1294
+ - task:
1295
+ type: STS
1296
+ dataset:
1297
+ type: mteb/sts14-sts
1298
+ name: MTEB STS14
1299
+ config: default
1300
+ split: test
1301
+ revision: 6031580fec1f6af667f0bd2da0a551cf4f0b2375
1302
+ metrics:
1303
+ - type: cos_sim_pearson
1304
+ value: 81.222967367593
1305
+ - type: cos_sim_spearman
1306
+ value: 79.92685877403252
1307
+ - type: euclidean_pearson
1308
+ value: 79.95053542861498
1309
+ - type: euclidean_spearman
1310
+ value: 79.9268858850991
1311
+ - type: manhattan_pearson
1312
+ value: 79.90485851323321
1313
+ - type: manhattan_spearman
1314
+ value: 79.93878025669312
1315
+ - task:
1316
+ type: STS
1317
+ dataset:
1318
+ type: mteb/sts15-sts
1319
+ name: MTEB STS15
1320
+ config: default
1321
+ split: test
1322
+ revision: ae752c7c21bf194d8b67fd573edf7ae58183cbe3
1323
+ metrics:
1324
+ - type: cos_sim_pearson
1325
+ value: 85.27539130156643
1326
+ - type: cos_sim_spearman
1327
+ value: 85.81645767911826
1328
+ - type: euclidean_pearson
1329
+ value: 85.5488615685444
1330
+ - type: euclidean_spearman
1331
+ value: 85.81647022566916
1332
+ - type: manhattan_pearson
1333
+ value: 85.6358149547879
1334
+ - type: manhattan_spearman
1335
+ value: 85.96347118567043
1336
+ - task:
1337
+ type: STS
1338
+ dataset:
1339
+ type: mteb/sts16-sts
1340
+ name: MTEB STS16
1341
+ config: default
1342
+ split: test
1343
+ revision: 4d8694f8f0e0100860b497b999b3dbed754a0513
1344
+ metrics:
1345
+ - type: cos_sim_pearson
1346
+ value: 83.43727336154858
1347
+ - type: cos_sim_spearman
1348
+ value: 84.50468882202796
1349
+ - type: euclidean_pearson
1350
+ value: 83.23576727105372
1351
+ - type: euclidean_spearman
1352
+ value: 84.50468882202796
1353
+ - type: manhattan_pearson
1354
+ value: 83.28843314503176
1355
+ - type: manhattan_spearman
1356
+ value: 84.60383766214322
1357
+ - task:
1358
+ type: STS
1359
+ dataset:
1360
+ type: mteb/sts17-crosslingual-sts
1361
+ name: MTEB STS17 (en-en)
1362
+ config: en-en
1363
+ split: test
1364
+ revision: faeb762787bd10488a50c8b5be4a3b82e411949c
1365
+ metrics:
1366
+ - type: cos_sim_pearson
1367
+ value: 88.86589365166874
1368
+ - type: cos_sim_spearman
1369
+ value: 88.93117996163835
1370
+ - type: euclidean_pearson
1371
+ value: 89.12271565981082
1372
+ - type: euclidean_spearman
1373
+ value: 88.93117996163835
1374
+ - type: manhattan_pearson
1375
+ value: 88.94419759325545
1376
+ - type: manhattan_spearman
1377
+ value: 88.63073561731899
1378
+ - task:
1379
+ type: STS
1380
+ dataset:
1381
+ type: mteb/sts22-crosslingual-sts
1382
+ name: MTEB STS22 (en)
1383
+ config: en
1384
+ split: test
1385
+ revision: 6d1ba47164174a496b7fa5d3569dae26a6813b80
1386
+ metrics:
1387
+ - type: cos_sim_pearson
1388
+ value: 67.96578378422929
1389
+ - type: cos_sim_spearman
1390
+ value: 67.10257461424345
1391
+ - type: euclidean_pearson
1392
+ value: 67.51317866195149
1393
+ - type: euclidean_spearman
1394
+ value: 67.10257461424345
1395
+ - type: manhattan_pearson
1396
+ value: 67.74940912013754
1397
+ - type: manhattan_spearman
1398
+ value: 67.46694183937207
1399
+ - task:
1400
+ type: STS
1401
+ dataset:
1402
+ type: mteb/stsbenchmark-sts
1403
+ name: MTEB STSBenchmark
1404
+ config: default
1405
+ split: test
1406
+ revision: b0fddb56ed78048fa8b90373c8a3cfc37b684831
1407
+ metrics:
1408
+ - type: cos_sim_pearson
1409
+ value: 83.55433725920493
1410
+ - type: cos_sim_spearman
1411
+ value: 83.60373857254014
1412
+ - type: euclidean_pearson
1413
+ value: 83.08086082334839
1414
+ - type: euclidean_spearman
1415
+ value: 83.6036864776559
1416
+ - type: manhattan_pearson
1417
+ value: 83.2232267589246
1418
+ - type: manhattan_spearman
1419
+ value: 83.78923946962664
1420
+ - task:
1421
+ type: Reranking
1422
+ dataset:
1423
+ type: mteb/scidocs-reranking
1424
+ name: MTEB SciDocsRR
1425
+ config: default
1426
+ split: test
1427
+ revision: d3c5e1fc0b855ab6097bf1cda04dd73947d7caab
1428
+ metrics:
1429
+ - type: map
1430
+ value: 87.28566757174322
1431
+ - type: mrr
1432
+ value: 96.63827639317836
1433
+ - task:
1434
+ type: Retrieval
1435
+ dataset:
1436
+ type: mteb/scifact
1437
+ name: MTEB SciFact
1438
+ config: default
1439
+ split: test
1440
+ revision: 0228b52cf27578f30900b9e5271d331663a030d7
1441
+ metrics:
1442
+ - type: map_at_1
1443
+ value: 70.661
1444
+ - type: map_at_10
1445
+ value: 82.051
1446
+ - type: map_at_100
1447
+ value: 82.162
1448
+ - type: map_at_1000
1449
+ value: 82.167
1450
+ - type: map_at_20
1451
+ value: 82.122
1452
+ - type: map_at_3
1453
+ value: 79.919
1454
+ - type: map_at_5
1455
+ value: 81.368
1456
+ - type: mrr_at_1
1457
+ value: 74.33333333333333
1458
+ - type: mrr_at_10
1459
+ value: 82.98452380952381
1460
+ - type: mrr_at_100
1461
+ value: 83.09512420633841
1462
+ - type: mrr_at_1000
1463
+ value: 83.10026279387446
1464
+ - type: mrr_at_20
1465
+ value: 83.05460927960928
1466
+ - type: mrr_at_3
1467
+ value: 81.8888888888889
1468
+ - type: mrr_at_5
1469
+ value: 82.65555555555557
1470
+ - type: ndcg_at_1
1471
+ value: 74.333
1472
+ - type: ndcg_at_10
1473
+ value: 85.914
1474
+ - type: ndcg_at_100
1475
+ value: 86.473
1476
+ - type: ndcg_at_1000
1477
+ value: 86.602
1478
+ - type: ndcg_at_20
1479
+ value: 86.169
1480
+ - type: ndcg_at_3
1481
+ value: 83.047
1482
+ - type: ndcg_at_5
1483
+ value: 84.72
1484
+ - type: precision_at_1
1485
+ value: 74.333
1486
+ - type: precision_at_10
1487
+ value: 10.933
1488
+ - type: precision_at_100
1489
+ value: 1.1199999999999999
1490
+ - type: precision_at_1000
1491
+ value: 0.11299999999999999
1492
+ - type: precision_at_20
1493
+ value: 5.5169999999999995
1494
+ - type: precision_at_3
1495
+ value: 32.444
1496
+ - type: precision_at_5
1497
+ value: 20.8
1498
+ - type: recall_at_1
1499
+ value: 70.661
1500
+ - type: recall_at_10
1501
+ value: 96.333
1502
+ - type: recall_at_100
1503
+ value: 99.0
1504
+ - type: recall_at_1000
1505
+ value: 100.0
1506
+ - type: recall_at_20
1507
+ value: 97.333
1508
+ - type: recall_at_3
1509
+ value: 88.64999999999999
1510
+ - type: recall_at_5
1511
+ value: 93.089
1512
+ - task:
1513
+ type: PairClassification
1514
+ dataset:
1515
+ type: mteb/sprintduplicatequestions-pairclassification
1516
+ name: MTEB SprintDuplicateQuestions
1517
+ config: default
1518
+ split: test
1519
+ revision: d66bd1f72af766a5cc4b0ca5e00c162f89e8cc46
1520
+ metrics:
1521
+ - type: cos_sim_accuracy
1522
+ value: 99.89108910891089
1523
+ - type: cos_sim_ap
1524
+ value: 97.61815451002174
1525
+ - type: cos_sim_f1
1526
+ value: 94.51097804391219
1527
+ - type: cos_sim_precision
1528
+ value: 94.32270916334662
1529
+ - type: cos_sim_recall
1530
+ value: 94.69999999999999
1531
+ - type: dot_accuracy
1532
+ value: 99.89108910891089
1533
+ - type: dot_ap
1534
+ value: 97.61815451002175
1535
+ - type: dot_f1
1536
+ value: 94.51097804391219
1537
+ - type: dot_precision
1538
+ value: 94.32270916334662
1539
+ - type: dot_recall
1540
+ value: 94.69999999999999
1541
+ - type: euclidean_accuracy
1542
+ value: 99.89108910891089
1543
+ - type: euclidean_ap
1544
+ value: 97.61815534251431
1545
+ - type: euclidean_f1
1546
+ value: 94.51097804391219
1547
+ - type: euclidean_precision
1548
+ value: 94.32270916334662
1549
+ - type: euclidean_recall
1550
+ value: 94.69999999999999
1551
+ - type: manhattan_accuracy
1552
+ value: 99.8940594059406
1553
+ - type: manhattan_ap
1554
+ value: 97.66124472227202
1555
+ - type: manhattan_f1
1556
+ value: 94.65267366316841
1557
+ - type: manhattan_precision
1558
+ value: 94.60539460539461
1559
+ - type: manhattan_recall
1560
+ value: 94.69999999999999
1561
+ - type: max_accuracy
1562
+ value: 99.8940594059406
1563
+ - type: max_ap
1564
+ value: 97.66124472227202
1565
+ - type: max_f1
1566
+ value: 94.65267366316841
1567
+ - task:
1568
+ type: Clustering
1569
+ dataset:
1570
+ type: mteb/stackexchange-clustering
1571
+ name: MTEB StackExchangeClustering
1572
+ config: default
1573
+ split: test
1574
+ revision: 6cbc1f7b2bc0622f2e39d2c77fa502909748c259
1575
+ metrics:
1576
+ - type: v_measure
1577
+ value: 76.482776391195
1578
+ - task:
1579
+ type: Clustering
1580
+ dataset:
1581
+ type: mteb/stackexchange-clustering-p2p
1582
+ name: MTEB StackExchangeClusteringP2P
1583
+ config: default
1584
+ split: test
1585
+ revision: 815ca46b2622cec33ccafc3735d572c266efdb44
1586
+ metrics:
1587
+ - type: v_measure
1588
+ value: 48.29023235124473
1589
+ - task:
1590
+ type: Reranking
1591
+ dataset:
1592
+ type: mteb/stackoverflowdupquestions-reranking
1593
+ name: MTEB StackOverflowDupQuestions
1594
+ config: default
1595
+ split: test
1596
+ revision: e185fbe320c72810689fc5848eb6114e1ef5ec69
1597
+ metrics:
1598
+ - type: map
1599
+ value: 55.3190739691685
1600
+ - type: mrr
1601
+ value: 56.40441972243442
1602
+ - task:
1603
+ type: Summarization
1604
+ dataset:
1605
+ type: mteb/summeval
1606
+ name: MTEB SummEval
1607
+ config: default
1608
+ split: test
1609
+ revision: cda12ad7615edc362dbf25a00fdd61d3b1eaf93c
1610
+ metrics:
1611
+ - type: cos_sim_pearson
1612
+ value: 31.98570594378664
1613
+ - type: cos_sim_spearman
1614
+ value: 30.712965330802174
1615
+ - type: dot_pearson
1616
+ value: 31.98570540209124
1617
+ - type: dot_spearman
1618
+ value: 30.712965330802174
1619
+ - task:
1620
+ type: Retrieval
1621
+ dataset:
1622
+ type: mteb/trec-covid
1623
+ name: MTEB TRECCOVID
1624
+ config: default
1625
+ split: test
1626
+ revision: bb9466bac8153a0349341eb1b22e06409e78ef4e
1627
+ metrics:
1628
+ - type: map_at_1
1629
+ value: 0.25
1630
+ - type: map_at_10
1631
+ value: 2.2640000000000002
1632
+ - type: map_at_100
1633
+ value: 14.447
1634
+ - type: map_at_1000
1635
+ value: 35.452
1636
+ - type: map_at_20
1637
+ value: 4.163
1638
+ - type: map_at_3
1639
+ value: 0.715
1640
+ - type: map_at_5
1641
+ value: 1.1780000000000002
1642
+ - type: mrr_at_1
1643
+ value: 94.0
1644
+ - type: mrr_at_10
1645
+ value: 96.66666666666667
1646
+ - type: mrr_at_100
1647
+ value: 96.66666666666667
1648
+ - type: mrr_at_1000
1649
+ value: 96.66666666666667
1650
+ - type: mrr_at_20
1651
+ value: 96.66666666666667
1652
+ - type: mrr_at_3
1653
+ value: 96.66666666666667
1654
+ - type: mrr_at_5
1655
+ value: 96.66666666666667
1656
+ - type: ndcg_at_1
1657
+ value: 92.0
1658
+ - type: ndcg_at_10
1659
+ value: 87.26899999999999
1660
+ - type: ndcg_at_100
1661
+ value: 68.586
1662
+ - type: ndcg_at_1000
1663
+ value: 61.056999999999995
1664
+ - type: ndcg_at_20
1665
+ value: 83.452
1666
+ - type: ndcg_at_3
1667
+ value: 90.11200000000001
1668
+ - type: ndcg_at_5
1669
+ value: 89.103
1670
+ - type: precision_at_1
1671
+ value: 94.0
1672
+ - type: precision_at_10
1673
+ value: 91.2
1674
+ - type: precision_at_100
1675
+ value: 70.12
1676
+ - type: precision_at_1000
1677
+ value: 26.773999999999997
1678
+ - type: precision_at_20
1679
+ value: 87.3
1680
+ - type: precision_at_3
1681
+ value: 92.667
1682
+ - type: precision_at_5
1683
+ value: 92.4
1684
+ - type: recall_at_1
1685
+ value: 0.25
1686
+ - type: recall_at_10
1687
+ value: 2.3970000000000002
1688
+ - type: recall_at_100
1689
+ value: 17.233999999999998
1690
+ - type: recall_at_1000
1691
+ value: 57.879000000000005
1692
+ - type: recall_at_20
1693
+ value: 4.508
1694
+ - type: recall_at_3
1695
+ value: 0.734
1696
+ - type: recall_at_5
1697
+ value: 1.2269999999999999
1698
+ - task:
1699
+ type: Retrieval
1700
+ dataset:
1701
+ type: mteb/touche2020
1702
+ name: MTEB Touche2020
1703
+ config: default
1704
+ split: test
1705
+ revision: a34f9a33db75fa0cbb21bb5cfc3dae8dc8bec93f
1706
+ metrics:
1707
+ - type: map_at_1
1708
+ value: 2.806
1709
+ - type: map_at_10
1710
+ value: 11.369
1711
+ - type: map_at_100
1712
+ value: 17.791
1713
+ - type: map_at_1000
1714
+ value: 19.363
1715
+ - type: map_at_20
1716
+ value: 14.038999999999998
1717
+ - type: map_at_3
1718
+ value: 5.817
1719
+ - type: map_at_5
1720
+ value: 8.331
1721
+ - type: mrr_at_1
1722
+ value: 36.734693877551024
1723
+ - type: mrr_at_10
1724
+ value: 53.355199222546155
1725
+ - type: mrr_at_100
1726
+ value: 53.648197984932665
1727
+ - type: mrr_at_1000
1728
+ value: 53.648197984932665
1729
+ - type: mrr_at_20
1730
+ value: 53.500971817298336
1731
+ - type: mrr_at_3
1732
+ value: 48.63945578231292
1733
+ - type: mrr_at_5
1734
+ value: 51.29251700680272
1735
+ - type: ndcg_at_1
1736
+ value: 35.714
1737
+ - type: ndcg_at_10
1738
+ value: 28.18
1739
+ - type: ndcg_at_100
1740
+ value: 39.22
1741
+ - type: ndcg_at_1000
1742
+ value: 50.807
1743
+ - type: ndcg_at_20
1744
+ value: 28.979
1745
+ - type: ndcg_at_3
1746
+ value: 31.114000000000004
1747
+ - type: ndcg_at_5
1748
+ value: 29.687
1749
+ - type: precision_at_1
1750
+ value: 36.735
1751
+ - type: precision_at_10
1752
+ value: 24.898
1753
+ - type: precision_at_100
1754
+ value: 7.918
1755
+ - type: precision_at_1000
1756
+ value: 1.5779999999999998
1757
+ - type: precision_at_20
1758
+ value: 18.878
1759
+ - type: precision_at_3
1760
+ value: 31.293
1761
+ - type: precision_at_5
1762
+ value: 29.387999999999998
1763
+ - type: recall_at_1
1764
+ value: 2.806
1765
+ - type: recall_at_10
1766
+ value: 17.776
1767
+ - type: recall_at_100
1768
+ value: 49.41
1769
+ - type: recall_at_1000
1770
+ value: 84.97200000000001
1771
+ - type: recall_at_20
1772
+ value: 26.589000000000002
1773
+ - type: recall_at_3
1774
+ value: 6.866999999999999
1775
+ - type: recall_at_5
1776
+ value: 10.964
1777
+ - task:
1778
+ type: Classification
1779
+ dataset:
1780
+ type: mteb/toxic_conversations_50k
1781
+ name: MTEB ToxicConversationsClassification
1782
+ config: default
1783
+ split: test
1784
+ revision: edfaf9da55d3dd50d43143d90c1ac476895ae6de
1785
+ metrics:
1786
+ - type: accuracy
1787
+ value: 91.1376953125
1788
+ - type: ap
1789
+ value: 40.51219896084815
1790
+ - type: f1
1791
+ value: 77.5195445434559
1792
+ - task:
1793
+ type: Classification
1794
+ dataset:
1795
+ type: mteb/tweet_sentiment_extraction
1796
+ name: MTEB TweetSentimentExtractionClassification
1797
+ config: default
1798
+ split: test
1799
+ revision: d604517c81ca91fe16a244d1248fc021f9ecee7a
1800
+ metrics:
1801
+ - type: accuracy
1802
+ value: 79.69722693831352
1803
+ - type: f1
1804
+ value: 80.02969178591319
1805
+ - task:
1806
+ type: Clustering
1807
+ dataset:
1808
+ type: mteb/twentynewsgroups-clustering
1809
+ name: MTEB TwentyNewsgroupsClustering
1810
+ config: default
1811
+ split: test
1812
+ revision: 6125ec4e24fa026cec8a478383ee943acfbd5449
1813
+ metrics:
1814
+ - type: v_measure
1815
+ value: 66.42427742893598
1816
+ - task:
1817
+ type: PairClassification
1818
+ dataset:
1819
+ type: mteb/twittersemeval2015-pairclassification
1820
+ name: MTEB TwitterSemEval2015
1821
+ config: default
1822
+ split: test
1823
+ revision: 70970daeab8776df92f5ea462b6173c0b46fd2d1
1824
+ metrics:
1825
+ - type: cos_sim_accuracy
1826
+ value: 87.81069321094355
1827
+ - type: cos_sim_ap
1828
+ value: 78.57014017906349
1829
+ - type: cos_sim_f1
1830
+ value: 72.38883143743536
1831
+ - type: cos_sim_precision
1832
+ value: 70.95793208312215
1833
+ - type: cos_sim_recall
1834
+ value: 73.87862796833772
1835
+ - type: dot_accuracy
1836
+ value: 87.81069321094355
1837
+ - type: dot_ap
1838
+ value: 78.5701399541226
1839
+ - type: dot_f1
1840
+ value: 72.38883143743536
1841
+ - type: dot_precision
1842
+ value: 70.95793208312215
1843
+ - type: dot_recall
1844
+ value: 73.87862796833772
1845
+ - type: euclidean_accuracy
1846
+ value: 87.81069321094355
1847
+ - type: euclidean_ap
1848
+ value: 78.57015336777854
1849
+ - type: euclidean_f1
1850
+ value: 72.38883143743536
1851
+ - type: euclidean_precision
1852
+ value: 70.95793208312215
1853
+ - type: euclidean_recall
1854
+ value: 73.87862796833772
1855
+ - type: manhattan_accuracy
1856
+ value: 87.57227156225785
1857
+ - type: manhattan_ap
1858
+ value: 78.19109731614216
1859
+ - type: manhattan_f1
1860
+ value: 71.87819856704198
1861
+ - type: manhattan_precision
1862
+ value: 69.77148534525584
1863
+ - type: manhattan_recall
1864
+ value: 74.1160949868074
1865
+ - type: max_accuracy
1866
+ value: 87.81069321094355
1867
+ - type: max_ap
1868
+ value: 78.57015336777854
1869
+ - type: max_f1
1870
+ value: 72.38883143743536
1871
+ - task:
1872
+ type: PairClassification
1873
+ dataset:
1874
+ type: mteb/twitterurlcorpus-pairclassification
1875
+ name: MTEB TwitterURLCorpus
1876
+ config: default
1877
+ split: test
1878
+ revision: 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf
1879
+ metrics:
1880
+ - type: cos_sim_accuracy
1881
+ value: 89.95032405790352
1882
+ - type: cos_sim_ap
1883
+ value: 88.03104739249996
1884
+ - type: cos_sim_f1
1885
+ value: 80.34377190070451
1886
+ - type: cos_sim_precision
1887
+ value: 77.11534376548892
1888
+ - type: cos_sim_recall
1889
+ value: 83.85432707114259
1890
+ - type: dot_accuracy
1891
+ value: 89.95032405790352
1892
+ - type: dot_ap
1893
+ value: 88.03105328515932
1894
+ - type: dot_f1
1895
+ value: 80.34377190070451
1896
+ - type: dot_precision
1897
+ value: 77.11534376548892
1898
+ - type: dot_recall
1899
+ value: 83.85432707114259
1900
+ - type: euclidean_accuracy
1901
+ value: 89.95032405790352
1902
+ - type: euclidean_ap
1903
+ value: 88.03105084564575
1904
+ - type: euclidean_f1
1905
+ value: 80.34377190070451
1906
+ - type: euclidean_precision
1907
+ value: 77.11534376548892
1908
+ - type: euclidean_recall
1909
+ value: 83.85432707114259
1910
+ - type: manhattan_accuracy
1911
+ value: 89.88046726433035
1912
+ - type: manhattan_ap
1913
+ value: 88.01484191858279
1914
+ - type: manhattan_f1
1915
+ value: 80.34005593993817
1916
+ - type: manhattan_precision
1917
+ value: 76.95290468133108
1918
+ - type: manhattan_recall
1919
+ value: 84.03911302740991
1920
+ - type: max_accuracy
1921
+ value: 89.95032405790352
1922
+ - type: max_ap
1923
+ value: 88.03105328515932
1924
+ - type: max_f1
1925
+ value: 80.34377190070451
1926
+ language:
1927
+ - en
1928
+ license: cc-by-nc-4.0
1929
+ ---
1930
+ # Salesforce/SFR-Embedding-2_R (Quantized)
1931
+
1932
+ ## Description
1933
+ This model is a quantized version of the original model [`Salesforce/SFR-Embedding-2_R`](https://huggingface.co/Salesforce/SFR-Embedding-2_R).
1934
+
1935
+ It's quantized using the BitsAndBytes library to 4-bit using the [bnb-my-repo](https://huggingface.co/spaces/bnb-community/bnb-my-repo) space.
1936
+
1937
+ ## Quantization Details
1938
+ - **Quantization Type**: int4
1939
+ - **bnb_4bit_quant_type**: nf4
1940
+ - **bnb_4bit_use_double_quant**: True
1941
+ - **bnb_4bit_compute_dtype**: bfloat16
1942
+ - **bnb_4bit_quant_storage**: uint8
1943
+
1944
+
1945
+
1946
+ # 📄 Original Model Information
1947
+
1948
+
1949
+
1950
+ <h1 align="center">Salesforce/SFR-Embedding-2_R</h1>
1951
+
1952
+ **SFR-Embedding by Salesforce Research.**
1953
+
1954
+ The model is for **research purposes only**.
1955
+
1956
+ More technical details will be updated later. Meanwhile, please refer to our previous work [SFR-Embedding](https://www.salesforce.com/blog/sfr-embedding/) for details.
1957
+
1958
+ ### Ethical Considerations
1959
+ This release is for research purposes only in support of an academic paper. Our models, datasets, and code are not specifically designed or evaluated for all downstream purposes. We strongly recommend users evaluate and address potential concerns related to accuracy, safety, and fairness before deploying this model. We encourage users to consider the common limitations of AI, comply with applicable laws, and leverage best practices when selecting use cases, particularly for high-risk scenarios where errors or misuse could significantly impact people’s lives, rights, or safety. For further guidance on use cases, refer to our [AUP](https://www.salesforce.com/content/dam/web/en_us/www/documents/legal/Agreements/policies/ExternalFacing_Services_Policy.pdf) and [AI AUP](https://www.salesforce.com/content/dam/web/en_us/www/documents/legal/Agreements/policies/ai-acceptable-use-policy.pdf).
1960
+
1961
+
1962
+ SFR-Embedding Team (∗indicates equal contributors, † indicates co-leaders).
1963
+ * Rui Meng*
1964
+ * Ye Liu*
1965
+ * Tong Niu
1966
+ * Shafiq Rayhan Joty
1967
+ * Caiming Xiong †
1968
+ * Yingbo Zhou †
1969
+ * Semih Yavuz †
1970
+
1971
+ ### Citation
1972
+ ```bibtex
1973
+ @misc{SFR-embedding-2,
1974
+ title={SFR-Embedding-2: Advanced Text Embedding with Multi-stage Training},
1975
+ author={Rui Meng*, Ye Liu*, Shafiq Rayhan Joty, Caiming Xiong, Yingbo Zhou, Semih Yavuz},
1976
+ year={2024},
1977
+ url={https://huggingface.co/Salesforce/SFR-Embedding-2_R}
1978
+ }
1979
+ ```
1980
+
1981
+
1982
+ ## How to run
1983
+
1984
+ #### Transformers
1985
+ The models can be used as follows:
1986
+ ```python
1987
+ import torch
1988
+ import torch.nn.functional as F
1989
+ from torch import Tensor
1990
+ from transformers import AutoTokenizer, AutoModel
1991
+
1992
+ def last_token_pool(last_hidden_states: Tensor,
1993
+ attention_mask: Tensor) -> Tensor:
1994
+ left_padding = (attention_mask[:, -1].sum() == attention_mask.shape[0])
1995
+ if left_padding:
1996
+ return last_hidden_states[:, -1]
1997
+ else:
1998
+ sequence_lengths = attention_mask.sum(dim=1) - 1
1999
+ batch_size = last_hidden_states.shape[0]
2000
+ return last_hidden_states[torch.arange(batch_size, device=last_hidden_states.device), sequence_lengths]
2001
+
2002
+ def get_detailed_instruct(task_description: str, query: str) -> str:
2003
+ return f'Instruct: {task_description}\nQuery: {query}'
2004
+
2005
+ # Each query must come with a one-sentence instruction that describes the task
2006
+ task = 'Given a web search query, retrieve relevant passages that answer the query'
2007
+ queries = [
2008
+ get_detailed_instruct(task, 'How to bake a chocolate cake'),
2009
+ get_detailed_instruct(task, 'Symptoms of the flu')
2010
+ ]
2011
+ # No need to add instruction for retrieval documents
2012
+ passages = [
2013
+ "To bake a delicious chocolate cake, you'll need the following ingredients: all-purpose flour, sugar, cocoa powder, baking powder, baking soda, salt, eggs, milk, vegetable oil, and vanilla extract. Start by preheating your oven to 350°F (175°C). In a mixing bowl, combine the dry ingredients (flour, sugar, cocoa powder, baking powder, baking soda, and salt). In a separate bowl, whisk together the wet ingredients (eggs, milk, vegetable oil, and vanilla extract). Gradually add the wet mixture to the dry ingredients, stirring until well combined. Pour the batter into a greased cake pan and bake for 30-35 minutes. Let it cool before frosting with your favorite chocolate frosting. Enjoy your homemade chocolate cake!",
2014
+ "The flu, or influenza, is an illness caused by influenza viruses. Common symptoms of the flu include a high fever, chills, cough, sore throat, runny or stuffy nose, body aches, headache, fatigue, and sometimes nausea and vomiting. These symptoms can come on suddenly and are usually more severe than the common cold. It's important to get plenty of rest, stay hydrated, and consult a healthcare professional if you suspect you have the flu. In some cases, antiviral medications can help alleviate symptoms and reduce the duration of the illness."
2015
+ ]
2016
+
2017
+ # load model and tokenizer
2018
+ tokenizer = AutoTokenizer.from_pretrained('Salesforce/SFR-Embedding-2_R')
2019
+ model = AutoModel.from_pretrained('Salesforce/SFR-Embedding-2_R')
2020
+
2021
+ # get the embeddings
2022
+ max_length = 4096
2023
+ input_texts = queries + passages
2024
+ batch_dict = tokenizer(input_texts, max_length=max_length, padding=True, truncation=True, return_tensors="pt")
2025
+ outputs = model(**batch_dict)
2026
+ embeddings = last_token_pool(outputs.last_hidden_state, batch_dict['attention_mask'])
2027
+
2028
+ # normalize embeddings
2029
+ embeddings = F.normalize(embeddings, p=2, dim=1)
2030
+ scores = (embeddings[:2] @ embeddings[2:].T) * 100
2031
+ print(scores.tolist())
2032
+ # [[40.132083892822266, 25.032529830932617], [15.006855010986328, 39.93733215332031]]
2033
+ ```
2034
+
2035
+ ### Sentence Transformers
2036
+ ```python
2037
+ from sentence_transformers import SentenceTransformer
2038
+
2039
+ model = SentenceTransformer("Salesforce/SFR-Embedding-2_R")
2040
+
2041
+ def get_detailed_instruct(task_description: str, query: str) -> str:
2042
+ return f'Instruct: {task_description}\nQuery: {query}'
2043
+
2044
+ # Each query must come with a one-sentence instruction that describes the task
2045
+ task = 'Given a web search query, retrieve relevant passages that answer the query'
2046
+ queries = [
2047
+ get_detailed_instruct(task, 'How to bake a chocolate cake'),
2048
+ get_detailed_instruct(task, 'Symptoms of the flu')
2049
+ ]
2050
+ # No need to add instruction for retrieval documents
2051
+ passages = [
2052
+ "To bake a delicious chocolate cake, you'll need the following ingredients: all-purpose flour, sugar, cocoa powder, baking powder, baking soda, salt, eggs, milk, vegetable oil, and vanilla extract. Start by preheating your oven to 350°F (175°C). In a mixing bowl, combine the dry ingredients (flour, sugar, cocoa powder, baking powder, baking soda, and salt). In a separate bowl, whisk together the wet ingredients (eggs, milk, vegetable oil, and vanilla extract). Gradually add the wet mixture to the dry ingredients, stirring until well combined. Pour the batter into a greased cake pan and bake for 30-35 minutes. Let it cool before frosting with your favorite chocolate frosting. Enjoy your homemade chocolate cake!",
2053
+ "The flu, or influenza, is an illness caused by influenza viruses. Common symptoms of the flu include a high fever, chills, cough, sore throat, runny or stuffy nose, body aches, headache, fatigue, and sometimes nausea and vomiting. These symptoms can come on suddenly and are usually more severe than the common cold. It's important to get plenty of rest, stay hydrated, and consult a healthcare professional if you suspect you have the flu. In some cases, antiviral medications can help alleviate symptoms and reduce the duration of the illness."
2054
+ ]
2055
+
2056
+ embeddings = model.encode(queries + passages)
2057
+ scores = model.similarity(embeddings[:2], embeddings[2:]) * 100
2058
+ print(scores.tolist())
2059
+ # [[40.13203811645508, 25.032546997070312], [15.00684642791748, 39.937339782714844]]
2060
+ ```
2061
+
2062
+
2063
+
2064
+
2065
+
config.json ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Salesforce/SFR-Embedding-2_R",
3
+ "architectures": [
4
+ "MistralModel"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 1,
8
+ "eos_token_id": 2,
9
+ "head_dim": 128,
10
+ "hidden_act": "silu",
11
+ "hidden_size": 4096,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 14336,
14
+ "max_position_embeddings": 32768,
15
+ "model_type": "mistral",
16
+ "num_attention_heads": 32,
17
+ "num_hidden_layers": 32,
18
+ "num_key_value_heads": 8,
19
+ "pad_token_id": 2,
20
+ "quantization_config": {
21
+ "_load_in_4bit": true,
22
+ "_load_in_8bit": false,
23
+ "bnb_4bit_compute_dtype": "bfloat16",
24
+ "bnb_4bit_quant_storage": "uint8",
25
+ "bnb_4bit_quant_type": "nf4",
26
+ "bnb_4bit_use_double_quant": true,
27
+ "llm_int8_enable_fp32_cpu_offload": false,
28
+ "llm_int8_has_fp16_weight": false,
29
+ "llm_int8_skip_modules": null,
30
+ "llm_int8_threshold": 6.0,
31
+ "load_in_4bit": true,
32
+ "load_in_8bit": false,
33
+ "quant_method": "bitsandbytes"
34
+ },
35
+ "rms_norm_eps": 1e-05,
36
+ "rope_theta": 10000.0,
37
+ "sliding_window": 4096,
38
+ "tie_word_embeddings": false,
39
+ "torch_dtype": "float32",
40
+ "transformers_version": "4.49.0",
41
+ "use_cache": false,
42
+ "vocab_size": 32000
43
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d39efefc6adbd5adc86cbd9f1c7ed3f2fbcd67420b93de65ca2a55be0d9fe957
3
+ size 4126214972
special_tokens_map.json ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "additional_special_tokens": [
3
+ "<unk>",
4
+ "<s>",
5
+ "</s>"
6
+ ],
7
+ "bos_token": {
8
+ "content": "<s>",
9
+ "lstrip": false,
10
+ "normalized": false,
11
+ "rstrip": false,
12
+ "single_word": false
13
+ },
14
+ "eos_token": {
15
+ "content": "</s>",
16
+ "lstrip": false,
17
+ "normalized": false,
18
+ "rstrip": false,
19
+ "single_word": false
20
+ },
21
+ "pad_token": {
22
+ "content": "</s>",
23
+ "lstrip": false,
24
+ "normalized": false,
25
+ "rstrip": false,
26
+ "single_word": false
27
+ },
28
+ "unk_token": {
29
+ "content": "<unk>",
30
+ "lstrip": false,
31
+ "normalized": false,
32
+ "rstrip": false,
33
+ "single_word": false
34
+ }
35
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "add_bos_token": true,
3
+ "add_eos_token": true,
4
+ "add_prefix_space": null,
5
+ "added_tokens_decoder": {
6
+ "0": {
7
+ "content": "<unk>",
8
+ "lstrip": false,
9
+ "normalized": false,
10
+ "rstrip": false,
11
+ "single_word": false,
12
+ "special": true
13
+ },
14
+ "1": {
15
+ "content": "<s>",
16
+ "lstrip": false,
17
+ "normalized": false,
18
+ "rstrip": false,
19
+ "single_word": false,
20
+ "special": true
21
+ },
22
+ "2": {
23
+ "content": "</s>",
24
+ "lstrip": false,
25
+ "normalized": false,
26
+ "rstrip": false,
27
+ "single_word": false,
28
+ "special": true
29
+ }
30
+ },
31
+ "additional_special_tokens": [
32
+ "<unk>",
33
+ "<s>",
34
+ "</s>"
35
+ ],
36
+ "bos_token": "<s>",
37
+ "clean_up_tokenization_spaces": false,
38
+ "eos_token": "</s>",
39
+ "extra_special_tokens": {},
40
+ "legacy": true,
41
+ "max_length": 512,
42
+ "model_max_length": 1000000000000000019884624838656,
43
+ "pad_to_multiple_of": null,
44
+ "pad_token": "</s>",
45
+ "pad_token_type_id": 0,
46
+ "padding_side": "right",
47
+ "sp_model_kwargs": {},
48
+ "spaces_between_special_tokens": false,
49
+ "stride": 0,
50
+ "tokenizer_class": "LlamaTokenizer",
51
+ "truncation_side": "right",
52
+ "truncation_strategy": "longest_first",
53
+ "unk_token": "<unk>",
54
+ "use_default_system_prompt": false
55
+ }