radoslavralev commited on
Commit
4ad2652
·
verified ·
1 Parent(s): f786998

Add new SentenceTransformer model

Browse files
1_Pooling/config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
- "word_embedding_dimension": 384,
3
- "pooling_mode_cls_token": false,
4
- "pooling_mode_mean_tokens": true,
5
  "pooling_mode_max_tokens": false,
6
  "pooling_mode_mean_sqrt_len_tokens": false,
7
  "pooling_mode_weightedmean_tokens": false,
 
1
  {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
  "pooling_mode_max_tokens": false,
6
  "pooling_mode_mean_sqrt_len_tokens": false,
7
  "pooling_mode_weightedmean_tokens": false,
README.md CHANGED
@@ -7,101 +7,134 @@ tags:
7
  - generated_from_trainer
8
  - dataset_size:90000
9
  - loss:MultipleNegativesRankingLoss
10
- base_model: thenlper/gte-small
11
  widget:
12
- - source_sentence: what is the maximum i can contribute to a traditional ira
13
  sentences:
14
- - With Roth IRAs, there are no age restrictions for contributions. Investors age
15
- 50 and older can contribute $5,500 for 2015, plus a catch-up contribution of $1,000
16
- for a total maximum possible IRA contribution of $6,500.
17
- - Classically, squamous epithelia are found lining surfaces utilizing simple passive
18
- diffusion such as the alveolar epithelium in the lungs. Specialized squamous epithelia
19
- also form the lining of cavities such as the blood vessels (endothelium) and pericardium
20
- (mesothelium) and the major cavities found within the body.lassically, squamous
21
- epithelia are found lining surfaces utilizing simple passive diffusion such as
22
- the alveolar epithelium in the lungs. Specialized squamous epithelia also form
23
- the lining of cavities such as the blood vessels (endothelium) and pericardium
24
- (mesothelium) and the major cavities found within the body.
25
- - What is a Roth IRA? How is a Roth IRA different from a regular IRA? What are the
26
- advantages of the Roth version? Who can contribute to a Roth IRA? When can I take
27
- money out of a Roth? When do I have to withdraw money from a Roth? Which is better
28
- for me, a Roth or traditional IRA?
29
- - source_sentence: what is diameter
 
 
 
 
 
30
  sentences:
31
- - The diameter of a circle is the length of the line through the center and touching
32
- two points on its edge. In the figure above, drag the orange dots around and see
33
- that the diameter never changes. Sometimes the word 'diameter' is used to refer
34
- to the line itself.
35
- - "If you know the radius of the circle, double it to get the diameter. The radius\
36
- \ is the distance from the center of the circle to its edge. For example, if the\
37
- \ radius of the circle is 4 cm, then the diameter of the circle is 4 cm x 2, or\
38
- \ 8 cm. 2. If you know the circumference of the circle, divide it by Ï\x80 to\
39
- \ get the diameter. Ï\x80 is equal to approximately 3.14 but you should use your\
40
- \ calculator to get the most accurate results."
41
- - By Tony Griffitts. A denitrator is a biological filter that removes nitrate (NO
42
- 3) from the aquarium. A denitrator filter uses anaerobic bacteria to brake down
43
- nitrate into nitrogen gas (N 2), which escapes into the atmosphere, the result
44
- is nitrate free effluent.hen you first set up the filter run the aquarium water
45
- through it at a swift rate for about 2 week. This will give bacteria time to colonize
46
- the filter. After 2 weeks cut down the filter flow rate to about a drop or two
47
- a second. At this time, start to add about 5ml of bacteria food a day to the filter.
48
- - source_sentence: how do ovaries produce hormones
 
 
 
 
49
  sentences:
50
- - 'The hormones from the brain control the levels of estrogen and progesterone released
51
- by the female reproductive system, leading to the events of the ovarian cycle:
52
- 1 A follicle starts to grow and begins producing the hormone estrogen.'
53
- - "Hey, itâ\x80\x99s like riding a bike! You have the GPX file on your computer,\
54
- \ and you just need to move or transfer it over to the Garmin device. To do this,\
55
- \ follow these steps: Connect the Garmin to the computer with a USB cable. Check\
56
- \ that you can â\x80\x9Cseeâ\x80\x9D the device, plus its memory card (if thereâ\x80\
57
- \x99s one installed).In Windows, the best way to do this is to double-click the\
58
- \ My Computer icon on your desktop.opy your GPX file into the NewFiles folder.\
59
- \ To do this, just drag and drop the file from your computer to the NewFiles window.\
60
- \ Or copy and paste it, whichever is easier and quicker for you. I know some people\
61
- \ can get a bit stuck on this part, and itâ\x80\x99s often overlooked."
62
- - But women also have testosterone. The ovaries produce both testosterone and estrogen.
63
- Relatively small quantities of testosterone are released into your bloodstream
64
- by the ovaries and adrenal glands. In addition to being produced by the ovaries,
65
- estrogen is also produced by the body's fat tissue. These sex hormones are involved
66
- in the growth, maintenance, and repair of reproductive tissues. But that's not
67
- all. They also influence other body tissues and bone mass.
68
- - source_sentence: weather in floyds knobs indiana
69
  sentences:
70
- - "Weekly Weather Report for 47119, Floyds Knobs, Indiana. Looking at the weather\
71
- \ in 47119, Floyds Knobs, Indiana over the next 7 days, the maximum temperature\
72
- \ will be 11â\x84\x83 (or 52â\x84\x89) on Friday 9 th February at around 2 pm.\
73
- \ In the same week the minimum temperature will be -8â\x84\x83 (or 18â\x84\x89\
74
- ) on Thursday 8 th February at around 8 am."
75
- - Central States Weather News - from the National Weather Service. Indiana StateInformation
76
- - compiled by NOAA. Indiana State Police - Weather Related Road Conditions (seasonal)
77
- National Weather Service - Local | Indiana | Hoosier Weather | IN Data. NOAA Weather
78
- Radio - NOAA station in Richmond, Indiana is 162.500, (KHB52, formerly WXJ46).
79
- - Lisinopril is a drug of the angiotensin-converting enzyme (ACE) inhibitor class
80
- used primarily in treatment of high blood pressure, heart failure, and after heart
81
- attacks. It is also used for preventing kidney and eye complications in people
82
- with diabetes. Its indications, contraindications, and side effects are as those
83
- for all ACE inhibitors. Lisinopril was the third ACE inhibitor (after captopril
84
- and enalapril) and was introduced into therapy in the early 1990s.
85
- - source_sentence: what congressional district is cambridge, ohio in
 
 
 
 
 
 
 
 
 
 
 
 
 
 
86
  sentences:
87
- - Ohio (OH) - 43725. 1 As of the 2010 census, zip code 43725 is located in Congressional
88
- District 6, OH. 2 Approximately 33.9% of 43725's population lives in a low income
89
- household, or a household with an annual income of less than $25,000.This is a
90
- low percentage of low income households for Cambridge, but a high percentage for
91
- Guernsey County.
92
- - The Freedom Riders, who were recruited by the Congress of Racial Equality (CORE),
93
- a U.S. civil rights group, departed from Washington, D.C., and attempted to integrate
94
- facilities at bus terminals along the way into the Deep South.he 1961 Freedom
95
- Rides sought to test a 1960 decision by the Supreme Court in Boynton v. Virginia
96
- that segregation of interstate transportation facilities, including bus terminals,
97
- was unconstitutional as well.
98
- - Pennsylvania's 3rd congressional district has been represented by Republican Mike
99
- Kelly since January 2011. He ran unopposed in the Republican primary. Missa Eaton,
100
- Sharon resident and president of Democrat Women of Mercer County, ran unopposed
101
- in the Democratic primary.emocrats Mark Critz, who has represented Pennsylvania's
102
- 12th congressional district since 2010; and Jason Altmire, who has represented
103
- Pennsylvania's 4th congressional district since 2007, both sought re-election
104
- in the new 12th district.
 
 
 
 
 
 
 
 
 
 
 
 
 
105
  pipeline_tag: sentence-similarity
106
  library_name: sentence-transformers
107
  metrics:
@@ -121,7 +154,7 @@ metrics:
121
  - cosine_mrr@10
122
  - cosine_map@100
123
  model-index:
124
- - name: SentenceTransformer based on thenlper/gte-small
125
  results:
126
  - task:
127
  type: information-retrieval
@@ -131,49 +164,49 @@ model-index:
131
  type: NanoMSMARCO
132
  metrics:
133
  - type: cosine_accuracy@1
134
- value: 0.3
135
  name: Cosine Accuracy@1
136
  - type: cosine_accuracy@3
137
- value: 0.5
138
  name: Cosine Accuracy@3
139
  - type: cosine_accuracy@5
140
- value: 0.54
141
  name: Cosine Accuracy@5
142
  - type: cosine_accuracy@10
143
- value: 0.64
144
  name: Cosine Accuracy@10
145
  - type: cosine_precision@1
146
- value: 0.3
147
  name: Cosine Precision@1
148
  - type: cosine_precision@3
149
- value: 0.16666666666666663
150
  name: Cosine Precision@3
151
  - type: cosine_precision@5
152
- value: 0.10800000000000001
153
  name: Cosine Precision@5
154
  - type: cosine_precision@10
155
- value: 0.064
156
  name: Cosine Precision@10
157
  - type: cosine_recall@1
158
- value: 0.3
159
  name: Cosine Recall@1
160
  - type: cosine_recall@3
161
- value: 0.5
162
  name: Cosine Recall@3
163
  - type: cosine_recall@5
164
- value: 0.54
165
  name: Cosine Recall@5
166
  - type: cosine_recall@10
167
- value: 0.64
168
  name: Cosine Recall@10
169
  - type: cosine_ndcg@10
170
- value: 0.4652069013901737
171
  name: Cosine Ndcg@10
172
  - type: cosine_mrr@10
173
- value: 0.40979365079365077
174
  name: Cosine Mrr@10
175
  - type: cosine_map@100
176
- value: 0.4230821832193898
177
  name: Cosine Map@100
178
  - task:
179
  type: information-retrieval
@@ -183,49 +216,49 @@ model-index:
183
  type: NanoNQ
184
  metrics:
185
  - type: cosine_accuracy@1
186
- value: 0.2
187
  name: Cosine Accuracy@1
188
  - type: cosine_accuracy@3
189
- value: 0.38
190
  name: Cosine Accuracy@3
191
  - type: cosine_accuracy@5
192
- value: 0.42
193
  name: Cosine Accuracy@5
194
  - type: cosine_accuracy@10
195
- value: 0.5
196
  name: Cosine Accuracy@10
197
  - type: cosine_precision@1
198
- value: 0.2
199
  name: Cosine Precision@1
200
  - type: cosine_precision@3
201
- value: 0.12666666666666665
202
  name: Cosine Precision@3
203
  - type: cosine_precision@5
204
- value: 0.084
205
  name: Cosine Precision@5
206
  - type: cosine_precision@10
207
- value: 0.052000000000000005
208
  name: Cosine Precision@10
209
  - type: cosine_recall@1
210
- value: 0.2
211
  name: Cosine Recall@1
212
  - type: cosine_recall@3
213
- value: 0.36
214
  name: Cosine Recall@3
215
  - type: cosine_recall@5
216
- value: 0.39
217
  name: Cosine Recall@5
218
  - type: cosine_recall@10
219
- value: 0.47
220
  name: Cosine Recall@10
221
  - type: cosine_ndcg@10
222
- value: 0.33923532538418527
223
  name: Cosine Ndcg@10
224
  - type: cosine_mrr@10
225
- value: 0.301547619047619
226
  name: Cosine Mrr@10
227
  - type: cosine_map@100
228
- value: 0.3085529476310609
229
  name: Cosine Map@100
230
  - task:
231
  type: nano-beir
@@ -235,63 +268,63 @@ model-index:
235
  type: NanoBEIR_mean
236
  metrics:
237
  - type: cosine_accuracy@1
238
- value: 0.25
239
  name: Cosine Accuracy@1
240
  - type: cosine_accuracy@3
241
- value: 0.44
242
  name: Cosine Accuracy@3
243
  - type: cosine_accuracy@5
244
- value: 0.48
245
  name: Cosine Accuracy@5
246
  - type: cosine_accuracy@10
247
- value: 0.5700000000000001
248
  name: Cosine Accuracy@10
249
  - type: cosine_precision@1
250
- value: 0.25
251
  name: Cosine Precision@1
252
  - type: cosine_precision@3
253
- value: 0.14666666666666664
254
  name: Cosine Precision@3
255
  - type: cosine_precision@5
256
- value: 0.096
257
  name: Cosine Precision@5
258
  - type: cosine_precision@10
259
- value: 0.058
260
  name: Cosine Precision@10
261
  - type: cosine_recall@1
262
- value: 0.25
263
  name: Cosine Recall@1
264
  - type: cosine_recall@3
265
- value: 0.43
266
  name: Cosine Recall@3
267
  - type: cosine_recall@5
268
- value: 0.465
269
  name: Cosine Recall@5
270
  - type: cosine_recall@10
271
- value: 0.5549999999999999
272
  name: Cosine Recall@10
273
  - type: cosine_ndcg@10
274
- value: 0.4022211133871795
275
  name: Cosine Ndcg@10
276
  - type: cosine_mrr@10
277
- value: 0.3556706349206349
278
  name: Cosine Mrr@10
279
  - type: cosine_map@100
280
- value: 0.36581756542522537
281
  name: Cosine Map@100
282
  ---
283
 
284
- # SentenceTransformer based on thenlper/gte-small
285
 
286
- This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [thenlper/gte-small](https://huggingface.co/thenlper/gte-small). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
287
 
288
  ## Model Details
289
 
290
  ### Model Description
291
  - **Model Type:** Sentence Transformer
292
- - **Base model:** [thenlper/gte-small](https://huggingface.co/thenlper/gte-small) <!-- at revision 17e1f347d17fe144873b1201da91788898c639cd -->
293
  - **Maximum Sequence Length:** 128 tokens
294
- - **Output Dimensionality:** 384 dimensions
295
  - **Similarity Function:** Cosine Similarity
296
  <!-- - **Training Dataset:** Unknown -->
297
  <!-- - **Language:** Unknown -->
@@ -307,9 +340,8 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [t
307
 
308
  ```
309
  SentenceTransformer(
310
- (0): Transformer({'max_seq_length': 128, 'do_lower_case': False, 'architecture': 'BertModel'})
311
- (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
312
- (2): Normalize()
313
  )
314
  ```
315
 
@@ -331,20 +363,20 @@ from sentence_transformers import SentenceTransformer
331
  model = SentenceTransformer("redis/model-a-baseline")
332
  # Run inference
333
  sentences = [
334
- 'what congressional district is cambridge, ohio in',
335
- "Ohio (OH) - 43725. 1 As of the 2010 census, zip code 43725 is located in Congressional District 6, OH. 2 Approximately 33.9% of 43725's population lives in a low income household, or a household with an annual income of less than $25,000.This is a low percentage of low income households for Cambridge, but a high percentage for Guernsey County.",
336
- "Pennsylvania's 3rd congressional district has been represented by Republican Mike Kelly since January 2011. He ran unopposed in the Republican primary. Missa Eaton, Sharon resident and president of Democrat Women of Mercer County, ran unopposed in the Democratic primary.emocrats Mark Critz, who has represented Pennsylvania's 12th congressional district since 2010; and Jason Altmire, who has represented Pennsylvania's 4th congressional district since 2007, both sought re-election in the new 12th district.",
337
  ]
338
  embeddings = model.encode(sentences)
339
  print(embeddings.shape)
340
- # [3, 384]
341
 
342
  # Get the similarity scores for the embeddings
343
  similarities = model.similarity(embeddings, embeddings)
344
  print(similarities)
345
- # tensor([[1.0001, 0.3689, 0.9104],
346
- # [0.3689, 1.0000, 0.1788],
347
- # [0.9104, 0.1788, 1.0000]])
348
  ```
349
 
350
  <!--
@@ -382,21 +414,21 @@ You can finetune this model on your own dataset.
382
 
383
  | Metric | NanoMSMARCO | NanoNQ |
384
  |:--------------------|:------------|:-----------|
385
- | cosine_accuracy@1 | 0.3 | 0.2 |
386
- | cosine_accuracy@3 | 0.5 | 0.38 |
387
- | cosine_accuracy@5 | 0.54 | 0.42 |
388
- | cosine_accuracy@10 | 0.64 | 0.5 |
389
- | cosine_precision@1 | 0.3 | 0.2 |
390
- | cosine_precision@3 | 0.1667 | 0.1267 |
391
- | cosine_precision@5 | 0.108 | 0.084 |
392
- | cosine_precision@10 | 0.064 | 0.052 |
393
- | cosine_recall@1 | 0.3 | 0.2 |
394
- | cosine_recall@3 | 0.5 | 0.36 |
395
- | cosine_recall@5 | 0.54 | 0.39 |
396
- | cosine_recall@10 | 0.64 | 0.47 |
397
- | **cosine_ndcg@10** | **0.4652** | **0.3392** |
398
- | cosine_mrr@10 | 0.4098 | 0.3015 |
399
- | cosine_map@100 | 0.4231 | 0.3086 |
400
 
401
  #### Nano BEIR
402
 
@@ -412,23 +444,23 @@ You can finetune this model on your own dataset.
412
  }
413
  ```
414
 
415
- | Metric | Value |
416
- |:--------------------|:-----------|
417
- | cosine_accuracy@1 | 0.25 |
418
- | cosine_accuracy@3 | 0.44 |
419
- | cosine_accuracy@5 | 0.48 |
420
- | cosine_accuracy@10 | 0.57 |
421
- | cosine_precision@1 | 0.25 |
422
- | cosine_precision@3 | 0.1467 |
423
- | cosine_precision@5 | 0.096 |
424
- | cosine_precision@10 | 0.058 |
425
- | cosine_recall@1 | 0.25 |
426
- | cosine_recall@3 | 0.43 |
427
- | cosine_recall@5 | 0.465 |
428
- | cosine_recall@10 | 0.555 |
429
- | **cosine_ndcg@10** | **0.4022** |
430
- | cosine_mrr@10 | 0.3557 |
431
- | cosine_map@100 | 0.3658 |
432
 
433
  <!--
434
  ## Bias, Risks and Limitations
@@ -451,16 +483,16 @@ You can finetune this model on your own dataset.
451
  * Size: 90,000 training samples
452
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
453
  * Approximate statistics based on the first 1000 samples:
454
- | | anchor | positive | negative |
455
- |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
456
- | type | string | string | string |
457
- | details | <ul><li>min: 4 tokens</li><li>mean: 9.18 tokens</li><li>max: 43 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 78.75 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 17 tokens</li><li>mean: 77.97 tokens</li><li>max: 128 tokens</li></ul> |
458
  * Samples:
459
- | anchor | positive | negative |
460
- |:--------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
461
- | <code>pomp definition</code> | <code>Pomp is a ceremonial display, such as you'd find at the Independence Day parade in your town, where brass bands and men and women in full military dress march to patriotic songs, while citizens wave flags and cheer.</code> | <code>Pom-poms are shaken by cheerleaders, Pom or Dance teams and sports fans during spectator sports. Small decorative pom-poms may be attached to clothing; these are sometimes called toories or bobbles. Pom-pom is derived from the French word pompon, which refers to a small decorative ball made of fabric or feathers. pair of cheerleading pom-poms. A pom-pom – also spelled pom-pon, pompom or pompon – is a loose, fluffy, decorative ball or tuft of fibrous material. Pom-poms may come in many colors, sizes, and varieties and are made from a wide array of materials, including wool, cotton, paper, plastic, and occasionally feathers.</code> |
462
- | <code>what is the definition of recompense</code> | <code>verb. To recompense is to pay someone back or make amends to someone for some loss. An example of recompense is when a shoplifter gives money to the person from whom he stole.</code> | <code>Split and merge into it. Answered by The Community. Making the world better, one answer at a time. Recuse is a legal term used when a person disqualifies oneself (as a judge) in a legal case due to a potential prejudice or partiality. Example: The judge recused himself from that case, citing a possible conflict of interest. Excuse is to release a person from an obligation or duty. Example: The gentleman is excused from jury duty as his serving would cause a hardship for his family.</code> |
463
- | <code>kashubian language pronunciation</code> | <code>Kashubian is a member of the West Slavic group of Slavic languages with about 200,000 speakers and used as an everyday language by about 53,000 people.ashubian (kaszebsczi kaszëbsczi). Jazek jãzëk kashubian is a member Of The west slavic Group of slavic languages 200,000 about 200000 speakers and used as an everyday language 53,000 about. 53000 people</code> | <code>[ syll. ko-to-ko, kot-oko ] The baby girl name Kotoko is pronounced as KAHT OW Kow †. Kotoko's origin and use are both in the Japanese language.Kotoko is a form of the Japanese name Koto. Kotoko is irregularly used as a baby girl name.aby names that sound like Kotoko include Kadea, Kadeejah, Kadeesha, Kadeija, Kadeja, Kadesha, Kadeshia, Kadesia, Kadessa, Kadiesha, Kadija (African, Arabic, English, and Swahili), Kadijah (African, Arabic, and English), Kadisha, Kadya, Kadyja, Kadysha, Kaitaka, Kathia, Kathya, and Katica (Czech, Hungarian, and Slavic).</code> |
464
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
465
  ```json
466
  {
@@ -477,16 +509,16 @@ You can finetune this model on your own dataset.
477
  * Size: 10,000 evaluation samples
478
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
479
  * Approximate statistics based on the first 1000 samples:
480
- | | anchor | positive | negative |
481
- |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
482
- | type | string | string | string |
483
- | details | <ul><li>min: 4 tokens</li><li>mean: 9.18 tokens</li><li>max: 25 tokens</li></ul> | <ul><li>min: 16 tokens</li><li>mean: 78.83 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 76.86 tokens</li><li>max: 128 tokens</li></ul> |
484
  * Samples:
485
- | anchor | positive | negative |
486
- |:----------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
487
- | <code>what is the pantanal</code> | <code>The Pantanal is a region in South America lying mostly in Western Brazil but extending into Bolivia as well. It is considered one of the world's largest and most diverse freshwater wetland ecosystems. The Pantanal is also one of Brazil's major tourist draws, for its wildlife.he Pantanal is a region in South America lying mostly in Western Brazil but extending into Bolivia as well. It is considered one of the world's largest and most diverse freshwater wetland ecosystems. The Pantanal is also one of Brazil's major tourist draws, for its wildlife.</code> | <code>Lantana is a large plant genus with about 150 species of flowering plants, which are perennial and native to the West Indies, according to the University of Florida. Lantanas are grown for their attractive clusters of small, multicolored or single-colored flowers and for their medicinal uses.</code> |
488
- | <code>radon mitigation cost</code> | <code>1 Ground water wells can also be tested for radon, then, if needed, can get a water radon mitigation system installed. 2 Installing a water radon mitigation system runs from $1,000-$4,500, and maintenance runs $0 to $150 annually. 3 To find out about radon content in a city water system, call the local water provider. It's wise to retest a home's radon level every year or two after a mitigation system is installed. 2 Most radon mitigation systems include a fan, which will need to be replaced about every 5 years. 3 Expect to pay $250-$300 to have this necessary maintenance done.</code> | <code>You have tested your home for radon and confirmed that you have elevated radon levels — 4 picocuries per liter (pCi/L) or higher. The EPA recommends that you take action to reduce your home's radon levels if your radon test result is 4 pCi/L or higher.High radon levels can be reduced through mitigation. CLICK HERE to order a test kit. 1 Select a licensed or certified radon mitigation contractor to reduce the radon levels.2 Have mitigation contractor determine appropriate radon reduction method.igh radon levels can be reduced through mitigation. CLICK HERE to order a test kit. 1 Select a licensed or certified radon mitigation contractor to reduce the radon levels. 2 Have mitigation contractor determine appropriate radon reduction method.</code> |
489
- | <code>how many calories is an einstein bagel</code> | <code>1 Calories In Einstein Bagel Strawberry Balsamic Vinaigrette. 144 calories, 14g fat, 5g carbs, 0g protein, 1g fiber. Calories In Einstein Bagel Strawberry Chicken Salad. 397 calories, 10g fat, 20g carbs, 56g protein, 4g fiber.</code> | <code>There are 110 calories in a 1 bagel serving of Thomas' Bagel Thins - 100% Whole Wheat. Calorie breakdown: 7% fat, 74% carbs, 19% protein.</code> |
490
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
491
  ```json
492
  {
@@ -502,9 +534,9 @@ You can finetune this model on your own dataset.
502
  - `eval_strategy`: steps
503
  - `per_device_train_batch_size`: 128
504
  - `per_device_eval_batch_size`: 128
505
- - `learning_rate`: 8e-05
506
- - `weight_decay`: 0.005
507
- - `max_steps`: 3375
508
  - `warmup_ratio`: 0.1
509
  - `fp16`: True
510
  - `dataloader_drop_last`: True
@@ -531,14 +563,14 @@ You can finetune this model on your own dataset.
531
  - `gradient_accumulation_steps`: 1
532
  - `eval_accumulation_steps`: None
533
  - `torch_empty_cache_steps`: None
534
- - `learning_rate`: 8e-05
535
- - `weight_decay`: 0.005
536
  - `adam_beta1`: 0.9
537
  - `adam_beta2`: 0.999
538
  - `adam_epsilon`: 1e-08
539
  - `max_grad_norm`: 1.0
540
  - `num_train_epochs`: 3.0
541
- - `max_steps`: 3375
542
  - `lr_scheduler_type`: linear
543
  - `lr_scheduler_kwargs`: {}
544
  - `warmup_ratio`: 0.1
@@ -645,20 +677,9 @@ You can finetune this model on your own dataset.
645
  ### Training Logs
646
  | Epoch | Step | Training Loss | Validation Loss | NanoMSMARCO_cosine_ndcg@10 | NanoNQ_cosine_ndcg@10 | NanoBEIR_mean_cosine_ndcg@10 |
647
  |:------:|:----:|:-------------:|:---------------:|:--------------------------:|:---------------------:|:----------------------------:|
648
- | 0 | 0 | - | 4.9500 | 0.6259 | 0.6583 | 0.6421 |
649
- | 0.3556 | 250 | 3.7709 | 3.0872 | 0.5333 | 0.4370 | 0.4851 |
650
- | 0.7112 | 500 | 3.1718 | 3.0498 | 0.4838 | 0.3837 | 0.4337 |
651
- | 1.0669 | 750 | 3.1254 | 3.0347 | 0.5118 | 0.3702 | 0.4410 |
652
- | 1.4225 | 1000 | 3.0691 | 3.0262 | 0.5028 | 0.3431 | 0.4230 |
653
- | 1.7781 | 1250 | 3.0635 | 3.0187 | 0.4694 | 0.3645 | 0.4170 |
654
- | 2.1337 | 1500 | 3.0361 | 3.0196 | 0.4736 | 0.3628 | 0.4182 |
655
- | 2.4893 | 1750 | 3.0015 | 3.0172 | 0.4869 | 0.3784 | 0.4327 |
656
- | 2.8450 | 2000 | 2.9995 | 3.0137 | 0.5018 | 0.3437 | 0.4228 |
657
- | 3.2006 | 2250 | 2.9779 | 3.0132 | 0.4725 | 0.3569 | 0.4147 |
658
- | 3.5562 | 2500 | 2.9653 | 3.0130 | 0.4705 | 0.3565 | 0.4135 |
659
- | 3.9118 | 2750 | 2.9611 | 3.0128 | 0.4725 | 0.3318 | 0.4022 |
660
- | 4.2674 | 3000 | 2.9451 | 3.0165 | 0.4660 | 0.3472 | 0.4066 |
661
- | 4.6230 | 3250 | 2.9383 | 3.0151 | 0.4652 | 0.3392 | 0.4022 |
662
 
663
 
664
  ### Framework Versions
 
7
  - generated_from_trainer
8
  - dataset_size:90000
9
  - loss:MultipleNegativesRankingLoss
10
+ base_model: Alibaba-NLP/gte-modernbert-base
11
  widget:
12
+ - source_sentence: who is the publisher of the norton anthology american literature
13
  sentences:
14
+ - W. W. Norton & Company W. W. Norton & Company is an American publishing company
15
+ based in New York City. It has been owned wholly by its employees since the early
16
+ 1960s. The company is known for its "Norton Anthologies" (particularly The Norton
17
+ Anthology of English Literature) and its texts in the Norton Critical Editions
18
+ series, the latter of which are frequently assigned in university literature courses.
19
+ - New Orleans La Nouvelle-Orléans (New Orleans) was founded in Spring of 1718 (7
20
+ May has become the traditional date to mark the anniversary, but the actual day
21
+ is unknown[25]) by the French Mississippi Company, under the direction of Jean-Baptiste
22
+ Le Moyne de Bienville, on land inhabited by the Chitimacha. It was named for Philippe
23
+ II, Duke of Orléans, who was Regent of the Kingdom of France at the time. His
24
+ title came from the French city of Orléans.
25
+ - I Really Like You The music video was directed by Peter Glanz. Jepsen filmed part
26
+ of the song's music video on 16 February 2015, in front of the Mondrian Hotel
27
+ in Manhattan alongside Tom Hanks, Justin Bieber and a troupe of dancers. Also
28
+ making cameo appearances in the video are Rudy Mancuso and Andrew B. Bachelor
29
+ (A.K.A. King Bach), well-known users of the short-form video sharing application
30
+ Vine. The video was released on 6 March 2015.[15] CBC Music's Nicolle Weeks described
31
+ it as "a more affable version" of the music video for The Verve's "Bitter Sweet
32
+ Symphony" (1997).[16] The music video has been rated as one of 10 Best Music Videos
33
+ of 2015 (So Far) by the readers of Billboard.[17]
34
+ - source_sentence: how many members on the house of representatives
35
  sentences:
36
+ - 'United States House of Representatives The composition and powers of the House
37
+ are established by Article One of the United States Constitution. The House is
38
+ composed of Representatives who sit in congressional districts that are allocated
39
+ to each of the 50 states on a basis of population as measured by the U.S. Census,
40
+ with each district entitled to one representative. Since its inception in 1789,
41
+ all Representatives have been directly elected. The total number of voting representatives
42
+ is fixed by law at 435.[1] As of the 2010 Census, the largest delegation is that
43
+ of California, with fifty-three representatives. Seven states have the smallest
44
+ delegation possible, a single representative: Alaska, Delaware, Montana, North
45
+ Dakota, South Dakota, Vermont, and Wyoming.[2]'
46
+ - Ain't No Mountain High Enough "Ain't No Mountain High Enough" is an R&B/soul song
47
+ written by Nickolas Ashford & Valerie Simpson in 1966 for the Tamla label, a division
48
+ of Motown. The composition was first successful as a 1967 hit single recorded
49
+ by Marvin Gaye and Tammi Terrell, becoming a hit again in 1970 when recorded by
50
+ former Supremes frontwoman Diana Ross. The song became Ross' first solo number-one
51
+ hit on the Billboard Hot 100 chart and was nominated for a Grammy Award.
52
+ - Synthetic element In chemistry, a synthetic element is a chemical element that
53
+ does not occur naturally on Earth, and can only be created artificially. So far,
54
+ 24 synthetic elements have been created (those with atomic numbers 95–118).
55
+ All are unstable, decaying with half-lives ranging from 15.6 million years to
56
+ a few hundred microseconds.
57
+ - source_sentence: what is the meaning of mbbs and md
58
  sentences:
59
+ - Adductor longus muscle Its main actions is to adduct and laterally rotate the
60
+ thigh; it can also produce some degree of flexion/anteversion.[1]
61
+ - Category 6 cable When used for 10/100/1000BASE-T, the maximum allowed length of
62
+ a Cat 6 cable is up to 100 meters (328 ft). This consists of 90 meters (295 ft)
63
+ of solid "horizontal" cabling between the patch panel and the wall jack, plus
64
+ 5 meters (16 ft) of stranded patch cable between each jack and the attached device.[7]
65
+ For 10GBASE-T, an unshielded Cat 6 cable should not exceed 55 meters.[8]
66
+ - Doctor of Medicine Historically, Australian medical schools have followed the
67
+ British tradition by conferring the degrees of Bachelor of Medicine and Bachelor
68
+ of Surgery (MBBS) to its graduates whilst reserving the title of Doctor of Medicine
69
+ (MD) for their research training degree, analogous to the PhD, or for their honorary
70
+ doctorates. Although the majority of Australian MBBS degrees have been graduate
71
+ programs since the 1990s, under the previous Australian Qualifications Framework
72
+ (AQF) they remained categorized as Level 7 Bachelor's degrees together with other
73
+ undergraduate programs.
74
+ - source_sentence: what holds the bone ends of an amphiarthrodial joint together
 
 
 
75
  sentences:
76
+ - Pubic symphysis The pubic symphysis is a nonsynovial amphiarthrodial joint. The
77
+ name comes from the Greek word "symphysis", meaning "growing together". The width
78
+ of the pubic symphysis at the front is 3–5 mm greater than its width at the back.
79
+ This joint is connected by fibrocartilage and may contain a fluid-filled cavity;
80
+ the center is avascular, possibly due to the nature of the compressive forces
81
+ passing through this joint, which may lead to harmful vascular disease.[2] The
82
+ ends of both pubic bones are covered by a thin layer of hyaline cartilage attached
83
+ to the fibrocartilage. The fibrocartilaginous disk is reinforced by a series of
84
+ ligaments. These ligaments cling to the fibrocartilaginous disk to the point that
85
+ fibers intermix with it.
86
+ - 'John 3:16 In Exodus 4:22, the Israelites as a people are called "my firstborn
87
+ son" by God using the singular form. In John, the focus shifts to the person of
88
+ Jesus as representative of that title. The verse is part of the New Testament
89
+ narrative in the third chapter of John in the discussion at Jerusalem between
90
+ Jesus and Nicodemus, who is called a "ruler of the Jews". (v.1) After speaking
91
+ of the necessity of a man being born again before he could "see the kingdom of
92
+ God", (v.3) Jesus spoke also of "heavenly things" (v.11-13) and of salvation (v.14-17)
93
+ and the condemnation (v.18,19) of those that do not believe in Jesus. "14 And
94
+ as Moses lifted up the serpent in the wilderness, even so must the Son of man
95
+ be lifted up: 15 That whosoever believeth in him should not perish, but have eternal
96
+ life." (John 3:14-15) Note that verse 15 is nearly identical to the latter part
97
+ of John 3:16.'
98
+ - Tony Hadley Anthony Patrick Hadley (born 2 June 1960) is an English singer-songwriter,
99
+ occasional stage actor and radio presenter. He rose to fame in the 1980s as the
100
+ lead singer of the new wave band Spandau Ballet before launching a solo career
101
+ following the group's split in 1990. Hadley is recognisable for his suave image,[1]
102
+ as well as his powerful blue-eyed soul voice, which has been described by AllMusic
103
+ as a "dramatic warble".[2] He has also been described as a "top crooner" by the
104
+ BBC.[3]
105
+ - source_sentence: what are the 5 liberties of the first amendment
106
  sentences:
107
+ - First Amendment to the United States Constitution The First Amendment (Amendment
108
+ I) to the United States Constitution prohibits the making of any law respecting
109
+ an establishment of religion, ensuring that there is no prohibition on the free
110
+ exercise of religion, abridging the freedom of speech, infringing on the freedom
111
+ of the press, interfering with the right to peaceably assemble, or prohibiting
112
+ the petitioning for a governmental redress of grievances. It was adopted on December
113
+ 15, 1791, as one of the ten amendments that constitute the Bill of Rights.
114
+ - Poor People's Campaign The SCLC announced the campaign on December 4, 1967. King
115
+ delivered a speech which identified "a kind of social insanity which could lead
116
+ to national ruin."[23] In January 1968, the SCLC created and distributed an "Economic
117
+ Fact Sheet" with statistics explaining why the campaign was necessary.[24] King
118
+ avoided providing specific details about the campaign and attempted to redirect
119
+ media attention to the values at stake.[25] The Poor People’s Campaign held firm
120
+ to the movement’s commitment to non-violence. “We are custodians of the philosophy
121
+ of non-violence,” said King at a press conference. “And it has worked”.[9] King
122
+ originally wanted the Poor People's Campaign to start in Quitman County, Mississippi
123
+ because of the intense and visible economic disparity there.[26]
124
+ - 'Jake and the Never Land Pirates Jake and the Never Land Pirates (also known as
125
+ Captain Jake and the Never Land Pirates in the fourth season and associated merchandise[1])
126
+ is an Annie Award-winning musical and interactive American children''s animated
127
+ television series shown on Disney Junior. It is based on Disney''s Peter Pan franchise,
128
+ which in turn is based on the famous book and play by British author J. M. Barrie.
129
+ It is the first Disney Junior original show following the switch from Playhouse
130
+ Disney. It stars Sean Ryan Fox from Henry Danger, Megan Richie, Jadon Sand, David
131
+ Arquette, Corey Burton, Jeff Bennett, Loren Hoskins and Dee Bradley Baker. The
132
+ title character Jake was previously voiced by Colin Ford, and then later by Cameron
133
+ Boyce, while Izzy was voiced for the first three seasons by Madison Pettis and
134
+ Cubby was voiced by Jonathan Morgan Heit. The series is created by Disney veteran
135
+ Bobs Gannaway, whose works include another Disney Junior series, Mickey Mouse
136
+ Clubhouse, and films such as Secret of the Wings, The Pirate Fairy and Planes:
137
+ Fire & Rescue. The last episode aired on November 6, 2016.'
138
  pipeline_tag: sentence-similarity
139
  library_name: sentence-transformers
140
  metrics:
 
154
  - cosine_mrr@10
155
  - cosine_map@100
156
  model-index:
157
+ - name: SentenceTransformer based on Alibaba-NLP/gte-modernbert-base
158
  results:
159
  - task:
160
  type: information-retrieval
 
164
  type: NanoMSMARCO
165
  metrics:
166
  - type: cosine_accuracy@1
167
+ value: 0.26
168
  name: Cosine Accuracy@1
169
  - type: cosine_accuracy@3
170
+ value: 0.48
171
  name: Cosine Accuracy@3
172
  - type: cosine_accuracy@5
173
+ value: 0.6
174
  name: Cosine Accuracy@5
175
  - type: cosine_accuracy@10
176
+ value: 0.68
177
  name: Cosine Accuracy@10
178
  - type: cosine_precision@1
179
+ value: 0.26
180
  name: Cosine Precision@1
181
  - type: cosine_precision@3
182
+ value: 0.15999999999999998
183
  name: Cosine Precision@3
184
  - type: cosine_precision@5
185
+ value: 0.12000000000000002
186
  name: Cosine Precision@5
187
  - type: cosine_precision@10
188
+ value: 0.068
189
  name: Cosine Precision@10
190
  - type: cosine_recall@1
191
+ value: 0.26
192
  name: Cosine Recall@1
193
  - type: cosine_recall@3
194
+ value: 0.48
195
  name: Cosine Recall@3
196
  - type: cosine_recall@5
197
+ value: 0.6
198
  name: Cosine Recall@5
199
  - type: cosine_recall@10
200
+ value: 0.68
201
  name: Cosine Recall@10
202
  - type: cosine_ndcg@10
203
+ value: 0.45896424557362947
204
  name: Cosine Ndcg@10
205
  - type: cosine_mrr@10
206
+ value: 0.38885714285714285
207
  name: Cosine Mrr@10
208
  - type: cosine_map@100
209
+ value: 0.39926372736834176
210
  name: Cosine Map@100
211
  - task:
212
  type: information-retrieval
 
216
  type: NanoNQ
217
  metrics:
218
  - type: cosine_accuracy@1
219
+ value: 0.36
220
  name: Cosine Accuracy@1
221
  - type: cosine_accuracy@3
222
+ value: 0.58
223
  name: Cosine Accuracy@3
224
  - type: cosine_accuracy@5
225
+ value: 0.64
226
  name: Cosine Accuracy@5
227
  - type: cosine_accuracy@10
228
+ value: 0.8
229
  name: Cosine Accuracy@10
230
  - type: cosine_precision@1
231
+ value: 0.36
232
  name: Cosine Precision@1
233
  - type: cosine_precision@3
234
+ value: 0.19333333333333333
235
  name: Cosine Precision@3
236
  - type: cosine_precision@5
237
+ value: 0.12800000000000003
238
  name: Cosine Precision@5
239
  - type: cosine_precision@10
240
+ value: 0.08
241
  name: Cosine Precision@10
242
  - type: cosine_recall@1
243
+ value: 0.36
244
  name: Cosine Recall@1
245
  - type: cosine_recall@3
246
+ value: 0.57
247
  name: Cosine Recall@3
248
  - type: cosine_recall@5
249
+ value: 0.62
250
  name: Cosine Recall@5
251
  - type: cosine_recall@10
252
+ value: 0.75
253
  name: Cosine Recall@10
254
  - type: cosine_ndcg@10
255
+ value: 0.5491170117720099
256
  name: Cosine Ndcg@10
257
  - type: cosine_mrr@10
258
+ value: 0.49174603174603176
259
  name: Cosine Mrr@10
260
  - type: cosine_map@100
261
+ value: 0.4918572150858902
262
  name: Cosine Map@100
263
  - task:
264
  type: nano-beir
 
268
  type: NanoBEIR_mean
269
  metrics:
270
  - type: cosine_accuracy@1
271
+ value: 0.31
272
  name: Cosine Accuracy@1
273
  - type: cosine_accuracy@3
274
+ value: 0.53
275
  name: Cosine Accuracy@3
276
  - type: cosine_accuracy@5
277
+ value: 0.62
278
  name: Cosine Accuracy@5
279
  - type: cosine_accuracy@10
280
+ value: 0.74
281
  name: Cosine Accuracy@10
282
  - type: cosine_precision@1
283
+ value: 0.31
284
  name: Cosine Precision@1
285
  - type: cosine_precision@3
286
+ value: 0.17666666666666664
287
  name: Cosine Precision@3
288
  - type: cosine_precision@5
289
+ value: 0.12400000000000003
290
  name: Cosine Precision@5
291
  - type: cosine_precision@10
292
+ value: 0.07400000000000001
293
  name: Cosine Precision@10
294
  - type: cosine_recall@1
295
+ value: 0.31
296
  name: Cosine Recall@1
297
  - type: cosine_recall@3
298
+ value: 0.5249999999999999
299
  name: Cosine Recall@3
300
  - type: cosine_recall@5
301
+ value: 0.61
302
  name: Cosine Recall@5
303
  - type: cosine_recall@10
304
+ value: 0.7150000000000001
305
  name: Cosine Recall@10
306
  - type: cosine_ndcg@10
307
+ value: 0.5040406286728196
308
  name: Cosine Ndcg@10
309
  - type: cosine_mrr@10
310
+ value: 0.4403015873015873
311
  name: Cosine Mrr@10
312
  - type: cosine_map@100
313
+ value: 0.44556047122711595
314
  name: Cosine Map@100
315
  ---
316
 
317
+ # SentenceTransformer based on Alibaba-NLP/gte-modernbert-base
318
 
319
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
320
 
321
  ## Model Details
322
 
323
  ### Model Description
324
  - **Model Type:** Sentence Transformer
325
+ - **Base model:** [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base) <!-- at revision e7f32e3c00f91d699e8c43b53106206bcc72bb22 -->
326
  - **Maximum Sequence Length:** 128 tokens
327
+ - **Output Dimensionality:** 768 dimensions
328
  - **Similarity Function:** Cosine Similarity
329
  <!-- - **Training Dataset:** Unknown -->
330
  <!-- - **Language:** Unknown -->
 
340
 
341
  ```
342
  SentenceTransformer(
343
+ (0): Transformer({'max_seq_length': 128, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
344
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
 
345
  )
346
  ```
347
 
 
363
  model = SentenceTransformer("redis/model-a-baseline")
364
  # Run inference
365
  sentences = [
366
+ 'what are the 5 liberties of the first amendment',
367
+ 'First Amendment to the United States Constitution The First Amendment (Amendment I) to the United States Constitution prohibits the making of any law respecting an establishment of religion, ensuring that there is no prohibition on the free exercise of religion, abridging the freedom of speech, infringing on the freedom of the press, interfering with the right to peaceably assemble, or prohibiting the petitioning for a governmental redress of grievances. It was adopted on December 15, 1791, as one of the ten amendments that constitute the Bill of Rights.',
368
+ "Jake and the Never Land Pirates Jake and the Never Land Pirates (also known as Captain Jake and the Never Land Pirates in the fourth season and associated merchandise[1]) is an Annie Award-winning musical and interactive American children's animated television series shown on Disney Junior. It is based on Disney's Peter Pan franchise, which in turn is based on the famous book and play by British author J. M. Barrie. It is the first Disney Junior original show following the switch from Playhouse Disney. It stars Sean Ryan Fox from Henry Danger, Megan Richie, Jadon Sand, David Arquette, Corey Burton, Jeff Bennett, Loren Hoskins and Dee Bradley Baker. The title character Jake was previously voiced by Colin Ford, and then later by Cameron Boyce, while Izzy was voiced for the first three seasons by Madison Pettis and Cubby was voiced by Jonathan Morgan Heit. The series is created by Disney veteran Bobs Gannaway, whose works include another Disney Junior series, Mickey Mouse Clubhouse, and films such as Secret of the Wings, The Pirate Fairy and Planes: Fire & Rescue. The last episode aired on November 6, 2016.",
369
  ]
370
  embeddings = model.encode(sentences)
371
  print(embeddings.shape)
372
+ # [3, 768]
373
 
374
  # Get the similarity scores for the embeddings
375
  similarities = model.similarity(embeddings, embeddings)
376
  print(similarities)
377
+ # tensor([[ 1.0000, 0.9877, -0.0962],
378
+ # [ 0.9877, 1.0000, -0.0887],
379
+ # [-0.0962, -0.0887, 1.0000]])
380
  ```
381
 
382
  <!--
 
414
 
415
  | Metric | NanoMSMARCO | NanoNQ |
416
  |:--------------------|:------------|:-----------|
417
+ | cosine_accuracy@1 | 0.26 | 0.36 |
418
+ | cosine_accuracy@3 | 0.48 | 0.58 |
419
+ | cosine_accuracy@5 | 0.6 | 0.64 |
420
+ | cosine_accuracy@10 | 0.68 | 0.8 |
421
+ | cosine_precision@1 | 0.26 | 0.36 |
422
+ | cosine_precision@3 | 0.16 | 0.1933 |
423
+ | cosine_precision@5 | 0.12 | 0.128 |
424
+ | cosine_precision@10 | 0.068 | 0.08 |
425
+ | cosine_recall@1 | 0.26 | 0.36 |
426
+ | cosine_recall@3 | 0.48 | 0.57 |
427
+ | cosine_recall@5 | 0.6 | 0.62 |
428
+ | cosine_recall@10 | 0.68 | 0.75 |
429
+ | **cosine_ndcg@10** | **0.459** | **0.5491** |
430
+ | cosine_mrr@10 | 0.3889 | 0.4917 |
431
+ | cosine_map@100 | 0.3993 | 0.4919 |
432
 
433
  #### Nano BEIR
434
 
 
444
  }
445
  ```
446
 
447
+ | Metric | Value |
448
+ |:--------------------|:----------|
449
+ | cosine_accuracy@1 | 0.31 |
450
+ | cosine_accuracy@3 | 0.53 |
451
+ | cosine_accuracy@5 | 0.62 |
452
+ | cosine_accuracy@10 | 0.74 |
453
+ | cosine_precision@1 | 0.31 |
454
+ | cosine_precision@3 | 0.1767 |
455
+ | cosine_precision@5 | 0.124 |
456
+ | cosine_precision@10 | 0.074 |
457
+ | cosine_recall@1 | 0.31 |
458
+ | cosine_recall@3 | 0.525 |
459
+ | cosine_recall@5 | 0.61 |
460
+ | cosine_recall@10 | 0.715 |
461
+ | **cosine_ndcg@10** | **0.504** |
462
+ | cosine_mrr@10 | 0.4403 |
463
+ | cosine_map@100 | 0.4456 |
464
 
465
  <!--
466
  ## Bias, Risks and Limitations
 
483
  * Size: 90,000 training samples
484
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
485
  * Approximate statistics based on the first 1000 samples:
486
+ | | anchor | positive | negative |
487
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
488
+ | type | string | string | string |
489
+ | details | <ul><li>min: 10 tokens</li><li>mean: 12.57 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 107.04 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 15 tokens</li><li>mean: 105.42 tokens</li><li>max: 128 tokens</li></ul> |
490
  * Samples:
491
+ | anchor | positive | negative |
492
+ |:----------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
493
+ | <code>who played in the movie throw momma from the train</code> | <code>Anne Ramsey Angelina (Anne) Ramsey (March 27, 1929[1] – August 11, 1988) was an American stage, television, and film actress. She was best known for portraying Mama Fratelli in The Goonies (1985) and Mrs. Lift, mother of Danny DeVito's protagonist, in Throw Momma from the Train (1987). The latter film saw Ramsey nominated for a Golden Globe Award and the Academy Award for Best Supporting Actress.</code> | <code>Aye Mere Watan Ke Logo "Aye Mere Watan Ke Logo" (Hindi: मेरे वतन के लोगों; "O' people of my country") is a Hindi patriotic song written by Kavi Pradeep, composed by C. Ramchandra, and performed by Lata Mangeshkar. The song commemorates Indian soldiers who died during the Sino-Indian War in 1962. The song was first performed live by Mangeshkar on 27 January 1963 at the National Stadium in New Delhi in the presence of President Sarvepalli Radhakrishnan and Prime Minister Jawaharlal Nehru, on account of Republic Day (26 January) 1963, which was just two months after the end of the war.</code> |
494
+ | <code>when was the wall in san diego built</code> | <code>Mexico–United States barrier In September 2017, the U.S. government announced the start of construction of eight prototype barriers made from concrete and other materials.[51][52] On June 3, 2018 the San Diego section of the US border wall construction began. [53]</code> | <code>The Dance of Dragons At the Wall, Jon Snow (Kit Harington) retreats from Hardhome defeated, accompanied by the surviving wildlings, much to the chagrin of some of the Night's Watch. In the North, Stannis Baratheon (Stephen Dillane) reluctantly allows Melisandre (Carice van Houten) to sacrifice his daughter Shireen (Kerry Ingram) after Ramsay Bolton (Iwan Rheon) sabotages his resources, resulting to his army's damaged morale. In Braavos, Arya Stark (Maisie Williams), detours from her mission given by Jaqen H'ghar (Tom Wlaschiha) to reconnoiter Meryn Trant (Ian Beattie) instead. In Dorne, Jaime Lannister (Nikolaj Coster-Waldau) secures Myrcella Baratheon's (Nell Tiger Free) release from Doran Martell's (Alexander Siddig) court against an indignant Ellaria Sand (Indira Varma). In Meereen, the Sons of the Harpy attack the stadium of Daznak's Pit in an attempt to assassinate Daenerys Targaryen (Emilia Clarke), who is rescued by Jorah Mormont (Iain Glen) and her firstborn dragon, Drogon. Lea...</code> |
495
+ | <code>urology is the study of diseases of the</code> | <code>Urology Urology (from Greek οὖρον ouron "urine" and -λο��ία -logia "study of"), also known as genitourinary surgery, is the branch of medicine that focuses on surgical and medical diseases of the male and female urinary-tract system and the male reproductive organs. Organs under the domain of urology include the kidneys, adrenal glands, ureters, urinary bladder, urethra, and the male reproductive organs (testes, epididymis, vas deferens, seminal vesicles, prostate, and penis).</code> | <code>The Jackie Gleason Show By far the most memorable and popular of Gleason's characters was blowhard Brooklyn bus driver Ralph Kramden, featured originally in a series of Cavalcade skits known as "The Honeymooners", with Pert Kelton as his wife Alice, and Art Carney as his upstairs neighbor Ed Norton. These were so popular that in 1955 Gleason suspended the variety format and filmed The Honeymooners as a regular half-hour sitcom (television's first spin-off), co-starring Carney, Audrey Meadows (who had replaced the blacklisted Kelton after the earlier move to CBS), and Joyce Randolph. Finishing 19th in the ratings, these 39 episodes were subsequently rerun constantly in syndication, often five nights a week, with the cycle repeating every two months for decades. They are probably the most familiar body of work from 1950s television with the exception of I Love Lucy starring Lucille Ball and Desi Arnaz.</code> |
496
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
497
  ```json
498
  {
 
509
  * Size: 10,000 evaluation samples
510
  * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
511
  * Approximate statistics based on the first 1000 samples:
512
+ | | anchor | positive | negative |
513
+ |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
514
+ | type | string | string | string |
515
+ | details | <ul><li>min: 9 tokens</li><li>mean: 12.46 tokens</li><li>max: 25 tokens</li></ul> | <ul><li>min: 16 tokens</li><li>mean: 106.89 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 106.57 tokens</li><li>max: 128 tokens</li></ul> |
516
  * Samples:
517
+ | anchor | positive | negative |
518
+ |:-------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
519
+ | <code>what is the political system that the united states follows in regards to elections</code> | <code>Primary election The United States is one of few countries to select candidates through popular vote in a primary election system; most countries rely on party leaders to vote candidates, as was previously the case in the U.S.[9] In modern politics, primary elections have been described as a significant vehicle for taking decision-making from political insiders to the voters, though this is disputed by select political science research.[10] The selection of candidates for federal, state, and local general elections takes place in primary elections organized by the public administration for the general voting public to participate in for the purpose of nominating the respective parties' official candidates; state voters start the electoral process for governors and legislators through the primary process, as well as for many local officials from city councilors to county commissioners.[11] The candidate who moves from the primary to be successful in the general election takes public off...</code> | <code>Bob Gaudio Robert John "Bob" Gaudio (born November 17, 1942) is an American singer, songwriter, musician, and record producer, and the keyboardist/backing vocalist for The Four Seasons.</code> |
520
+ | <code>what caused the unusual landscape at the valley of fire</code> | <code>Valley of Fire State Park Complex uplifting and faulting of the region, followed by extensive erosion, have created the present landscape. The rough floor and jagged walls of the park contain brilliant formations of eroded sandstone and sand dunes more than 150 million years old. Other important rock formations include limestones, shales, and conglomerates.[4]</code> | <code>Fundamental Constitutions of Carolina Because the Fundamental Constitutions were drafted during John Locke's service to one of Province of Carolina proprietors, Anthony Ashley Cooper, it is widely alleged that Locke had a major role in the making of the Constitutions. In the view of historian David Armitage[5] and political scientist Vicki Hsueh, the Constitutions were co-authored by Locke and his patron Cooper, known also as 1st Earl of Shaftesbury.[6] However the document was a legal document written for and signed and sealed by the eight Lord proprietors to whom Charles II had granted the colony; Locke was only a paid secretary. He wrote it much as a lawyer writes a will.[4]</code> |
521
+ | <code>according to the guinness world records what author has the most published works</code> | <code>Guinness World Records The book itself holds a world record, as the best-selling copyrighted book of all time. As of the 2017 edition, it is now in its 62nd year of publication, published in 100 countries and 23 languages. The international franchise has extended beyond print to include television series and museums. The popularity of the franchise has resulted in Guinness World Records becoming the primary international authority on the cataloguing and verification of a huge number of world records; the organisation employs official record adjudicators authorised to verify the authenticity of the setting and breaking of records.[2]</code> | <code>The Big Chill (film) The Big Chill is a 1983 American comedy-drama film directed by Lawrence Kasdan, starring Tom Berenger, Glenn Close, Jeff Goldblum, William Hurt, Kevin Kline, Mary Kay Place, Meg Tilly, and JoBeth Williams. The plot focuses on a group of baby boomers who attended the University of Michigan, reuniting after 15 years when their friend Alex commits suicide. Kevin Costner was cast as Alex, but all scenes showing his face were cut. It was filmed in Beaufort, South Carolina.[2]</code> |
522
  * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
523
  ```json
524
  {
 
534
  - `eval_strategy`: steps
535
  - `per_device_train_batch_size`: 128
536
  - `per_device_eval_batch_size`: 128
537
+ - `learning_rate`: 4e-05
538
+ - `weight_decay`: 0.01
539
+ - `max_steps`: 703
540
  - `warmup_ratio`: 0.1
541
  - `fp16`: True
542
  - `dataloader_drop_last`: True
 
563
  - `gradient_accumulation_steps`: 1
564
  - `eval_accumulation_steps`: None
565
  - `torch_empty_cache_steps`: None
566
+ - `learning_rate`: 4e-05
567
+ - `weight_decay`: 0.01
568
  - `adam_beta1`: 0.9
569
  - `adam_beta2`: 0.999
570
  - `adam_epsilon`: 1e-08
571
  - `max_grad_norm`: 1.0
572
  - `num_train_epochs`: 3.0
573
+ - `max_steps`: 703
574
  - `lr_scheduler_type`: linear
575
  - `lr_scheduler_kwargs`: {}
576
  - `warmup_ratio`: 0.1
 
677
  ### Training Logs
678
  | Epoch | Step | Training Loss | Validation Loss | NanoMSMARCO_cosine_ndcg@10 | NanoNQ_cosine_ndcg@10 | NanoBEIR_mean_cosine_ndcg@10 |
679
  |:------:|:----:|:-------------:|:---------------:|:--------------------------:|:---------------------:|:----------------------------:|
680
+ | 0 | 0 | - | 4.4513 | 0.6530 | 0.6552 | 0.6541 |
681
+ | 0.3556 | 250 | 3.1939 | 2.9908 | 0.4651 | 0.5551 | 0.5101 |
682
+ | 0.7112 | 500 | 2.9769 | 2.9599 | 0.4590 | 0.5491 | 0.5040 |
 
 
 
 
 
 
 
 
 
 
 
683
 
684
 
685
  ### Framework Versions
config_sentence_transformers.json CHANGED
@@ -1,5 +1,4 @@
1
  {
2
- "model_type": "SentenceTransformer",
3
  "__version__": {
4
  "sentence_transformers": "5.2.0",
5
  "transformers": "4.57.3",
@@ -10,5 +9,6 @@
10
  "document": ""
11
  },
12
  "default_prompt_name": null,
13
- "similarity_fn_name": "cosine"
 
14
  }
 
1
  {
 
2
  "__version__": {
3
  "sentence_transformers": "5.2.0",
4
  "transformers": "4.57.3",
 
9
  "document": ""
10
  },
11
  "default_prompt_name": null,
12
+ "similarity_fn_name": "cosine",
13
+ "model_type": "SentenceTransformer"
14
  }
modules.json CHANGED
@@ -10,11 +10,5 @@
10
  "name": "1",
11
  "path": "1_Pooling",
12
  "type": "sentence_transformers.models.Pooling"
13
- },
14
- {
15
- "idx": 2,
16
- "name": "2",
17
- "path": "2_Normalize",
18
- "type": "sentence_transformers.models.Normalize"
19
  }
20
  ]
 
10
  "name": "1",
11
  "path": "1_Pooling",
12
  "type": "sentence_transformers.models.Pooling"
 
 
 
 
 
 
13
  }
14
  ]