sucharush commited on
Commit
6a5ad8b
·
verified ·
1 Parent(s): 157a763

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 384,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,974 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:98112
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: BAAI/bge-small-en-v1.5
10
+ widget:
11
+ - source_sentence: 'Represent this question for retrieving relevant documents: I''m
12
+ curious about strategies businesses can use to ensure data privacy and security.'
13
+ sentences:
14
+ - 'The Vicsek fractal is a self-replicating fractal, where each square is replaced
15
+ by a pattern of five smaller squares in each iteration. The number of line segments
16
+ in each iteration can be calculated using the formula:
17
+
18
+
19
+ Number of line segments = 4 * (5^(n-1))
20
+
21
+
22
+ where n is the iteration number.
23
+
24
+
25
+ For the fifth iteration (n=5):
26
+
27
+
28
+ Number of line segments = 4 * (5^(5-1))
29
+
30
+ Number of line segments = 4 * (5^4)
31
+
32
+ Number of line segments = 4 * (625)
33
+
34
+ Number of line segments = 2500
35
+
36
+
37
+ So, there are 2500 line segments needed to draw the fifth iteration of the Vicsek
38
+ fractal.'
39
+ - '**Advanced Techniques and Strategies for Data Privacy and Security:**
40
+
41
+
42
+ 1. **Data encryption:** Encrypting data both at rest and in transit ensures it
43
+ remains protected even if accessed by unauthorized individuals.
44
+
45
+
46
+ 2. **Regular security audits:** Conducting regular security audits helps businesses
47
+ identify and address vulnerabilities in their systems and processes.
48
+
49
+
50
+ 3. **Data masking:** Data masking techniques such as tokenization and pseudonymization
51
+ protect sensitive data by replacing it with non-identifiable values.
52
+
53
+
54
+ 4. **Access control management:** Implementing robust access control measures,
55
+ such as role-based access control (RBAC) and multi-factor authentication (MFA),
56
+ restricts access to data to authorized personnel only.
57
+
58
+
59
+ 5. **Data minimization:** Businesses should only collect and store data that is
60
+ absolutely necessary for their operations to reduce the risk of data breaches.
61
+
62
+
63
+ **Practical Examples of Data Privacy and Security:**
64
+
65
+
66
+ - **Healthcare:** Hospitals and medical facilities use data encryption to protect
67
+ patient health records, ensuring compliance with HIPAA regulations.
68
+
69
+
70
+ - **Financial Services:** Banks and financial institutions implement MFA and access
71
+ control measures to safeguard customer financial data.
72
+
73
+
74
+ - **Retail:** E-commerce companies use data masking techniques to protect sensitive
75
+ customer information, such as credit card numbers.
76
+
77
+
78
+ **Interview Questions on Data Privacy and Security:**
79
+
80
+
81
+ - Describe the key principles of data privacy and security.
82
+
83
+
84
+ - Explain the different methods used for data encryption and their strengths and
85
+ weaknesses.
86
+
87
+
88
+ - How can organizations implement effective access control mechanisms to protect
89
+ data?
90
+
91
+
92
+ - What are the best practices for conducting security audits to ensure data privacy?
93
+
94
+
95
+ - Discuss the ethical and legal implications of data privacy and security breaches.'
96
+ - 'First, let''s write the system of linear equations as an augmented matrix:
97
+
98
+
99
+ [ 1 2 -1 | 5]
100
+
101
+ [ 2 -3 4 | 7]
102
+
103
+ [-6 7 -5 | -1]
104
+
105
+
106
+ Now, we''ll perform forward elimination to convert the matrix into an upper triangular
107
+ matrix.
108
+
109
+
110
+ Step 1: Eliminate x from the second and third rows.
111
+
112
+
113
+ To eliminate x from the second row, we''ll subtract 2 times the first row from
114
+ the second row:
115
+
116
+
117
+ [ 1 2 -1 | 5]
118
+
119
+ [ 0 -7 6 | -3]
120
+
121
+ [-6 7 -5 | -1]
122
+
123
+
124
+ To eliminate x from the third row, we''ll add 6 times the first row to the third
125
+ row:
126
+
127
+
128
+ [ 1 2 -1 | 5]
129
+
130
+ [ 0 -7 6 | -3]
131
+
132
+ [ 0 5 -1 | 29]
133
+
134
+
135
+ Step 2: Eliminate y from the third row.
136
+
137
+
138
+ To eliminate y from the third row, we''ll add (5/7) times the second row to the
139
+ third row:
140
+
141
+
142
+ [ 1 2 -1 | 5]
143
+
144
+ [ 0 -7 6 | -3]
145
+
146
+ [ 0 0 1 | 4]
147
+
148
+
149
+ Now, we have an upper triangular matrix, and we can perform back substitution
150
+ to find the values of x, y, and z.
151
+
152
+
153
+ Step 3: Back substitution
154
+
155
+
156
+ From the third row, we have z = 4.
157
+
158
+
159
+ Now, we''ll substitute z into the second row to find y:
160
+
161
+
162
+ -7y + 6(4) = -3
163
+
164
+ -7y + 24 = -3
165
+
166
+ -7y = -27
167
+
168
+ y = 27/7
169
+
170
+
171
+ Finally, we''ll substitute y and z into the first row to find x:
172
+
173
+
174
+ x + 2(27/7) - 4 = 5
175
+
176
+ x + 54/7 - 4 = 5
177
+
178
+ x = 5 - 54/7 + 4
179
+
180
+ x = (35 - 54 + 28)/7
181
+
182
+ x = 9/7
183
+
184
+
185
+ So, the solution to the system of linear equations is:
186
+
187
+
188
+ x = 9/7
189
+
190
+ y = 27/7
191
+
192
+ z = 4'
193
+ - source_sentence: "Represent this question for retrieving relevant documents: Solve\
194
+ \ the following system of linear equations with complex coefficients: \n\n2x +\
195
+ \ 5y + 3iz = 8i \n\nx - 3y + 2z = 1 \n\n-4x + 2y + 5iz = -5"
196
+ sentences:
197
+ - To determine whether tympanic membrane (TM) electrodes induce behavioral pure-tone
198
+ threshold shifts. Pure-tone thresholds (250 to 8000 Hz) were measured twice in
199
+ test (n = 18) and control (n = 10) groups. TM electrodes were placed between first
200
+ and second threshold measurements in the test group, whereas the control group
201
+ did not receive electrodes. Pure-tone threshold shifts were compared between groups.
202
+ The effect of TM electrode contact location on threshold shifts was evaluated
203
+ in the test group. TM electrodes significantly increased average low-frequency
204
+ thresholds, 7.5 dB at 250 Hz and 4.2 dB at 500 Hz, and shifts were as large as
205
+ 25 dB in individual ears. Also, threshold shifts did not appear to vary at any
206
+ frequency with TM electrode contact location.
207
+ - 'To solve the given system of linear equations with complex coefficients, we can
208
+ use the Gaussian elimination method. The given system of equations is:
209
+
210
+
211
+ (1) 2x + 5y + 3iz = 8i
212
+
213
+ (2) x - 3y + 2z = 1
214
+
215
+ (3) -4x + 2y + 5iz = -5
216
+
217
+
218
+ First, we will write the augmented matrix for this system:
219
+
220
+
221
+ | 2 5 3i | 8i |
222
+
223
+ | 1 -3 2 | 1 |
224
+
225
+ | -4 2 5i | -5 |
226
+
227
+
228
+ Next, we will perform row operations to get the matrix in row-echelon form. We
229
+ will start by making the first element of the second row 0. To do this, we can
230
+ subtract half of the first row from the second row:
231
+
232
+
233
+ | 2 5 3i | 8i |
234
+
235
+ | 0 -5.5 -1i | -3i |
236
+
237
+ | -4 2 5i | -5 |
238
+
239
+
240
+ Now, we will make the first element of the third row 0. To do this, we can add
241
+ twice the first row to the third row:
242
+
243
+
244
+ | 2 5 3i | 8i |
245
+
246
+ | 0 -5.5 -1i | -3i |
247
+
248
+ | 0 12 11i | 11i |
249
+
250
+
251
+ Next, we will make the second element of the third row 0. To do this, we can add
252
+ (12/5.5) times the second row to the third row:
253
+
254
+
255
+ | 2 5 3i | 8i |
256
+
257
+ | 0 -5.5 -1i | -3i |
258
+
259
+ | 0 0 10.8i| 6i |
260
+
261
+
262
+ Now, we have the matrix in row-echelon form. We can now solve for the variables
263
+ using back-substitution.
264
+
265
+
266
+ From the third row, we have:
267
+
268
+
269
+ 10.8i * z = 6i
270
+
271
+
272
+ Dividing both sides by 10.8i, we get:
273
+
274
+
275
+ z = 6i / 10.8i = 6/10.8 = 1/1.8 = 5/9
276
+
277
+
278
+ Now, we can substitute z back into the second row to find y:
279
+
280
+
281
+ -5.5y - 1i(5/9) = -3i
282
+
283
+
284
+ Multiplying both sides by -1, we get:
285
+
286
+
287
+ 5.5y + (5i/9) = 3i
288
+
289
+
290
+ Subtracting 5i/9 from both sides, we get:
291
+
292
+
293
+ 5.5y = 3i - 5i/9 = (22i - 5i) / 9 = 17i/9
294
+
295
+
296
+ Dividing both sides by 5.5, we get:
297
+
298
+
299
+ y = (17i/9) / 5.5 = 17i / 49.5 = 17i / (99/2) = 34i / 99
300
+
301
+
302
+ Finally, we can substitute y and z back into the first row to find x:
303
+
304
+
305
+ 2x + 5(34i/99) + 3i(5/9) = 8i
306
+
307
+
308
+ Multiplying both sides by 99, we get:
309
+
310
+
311
+ 198x + 5(34i) + 3i(55) = 792i
312
+
313
+
314
+ 198x + 170i + 165i = 792i
315
+
316
+
317
+ 198x = 792i - 335i = 457i
318
+
319
+
320
+ Dividing both sides by 198, we get:
321
+
322
+
323
+ x = 457i / 198
324
+
325
+
326
+ So, the solution to the given system of linear equations is:
327
+
328
+
329
+ x = 457i / 198
330
+
331
+ y = 34i / 99
332
+
333
+ z = 5/9'
334
+ - Remodelling of the asthmatic airway includes increased deposition of proteoglycan
335
+ (PG) molecules. One of the stimuli driving airway remodelling may be excessive
336
+ mechanical stimulation. We hypothesized that fibroblasts from asthmatic patients
337
+ would respond to excessive mechanical strain with up-regulation of message for
338
+ PGs. We obtained fibroblasts from asthmatic patients (AF) and normal volunteers
339
+ (NF) using endobronchial biopsy. Cells were maintained in culture until the fifth
340
+ passage and then grown on a flexible collagen-coated membrane. Using the Flexercell
341
+ device, cells were then subjected to cyclic stretch at 30% amplitude at 1 Hz for
342
+ 24 h. Control cells were unstrained. Total RNA was extracted from the cell layer
343
+ and quantitative RT-PCR performed for decorin, lumican and versican mRNA. In unstrained
344
+ cells, the expression of decorin mRNA was greater in AF than NF. With strain,
345
+ NF showed increased expression of versican mRNA and AF showed increased expression
346
+ of versican and decorin mRNA. The relative increase in versican mRNA expression
347
+ with strain was greater in AF than NF.
348
+ - source_sentence: 'Represent this question for retrieving relevant documents: What
349
+ is the total arc length of the Lévy C curve after iterating 8 times if the original
350
+ line segment had a length of 1 unit?'
351
+ sentences:
352
+ - "Pose estimation is indeed a fascinating area in computer vision, but it's not\
353
+ \ entirely a walk in the park. Estimating the pose of a human or object involves\
354
+ \ a combination of complex mathematical techniques and algorithms. Let's delve\
355
+ \ deeper into some key aspects of pose estimation:\n\n1). **3D vs 2D Pose Estimation**:\
356
+ \ \n - 3D Pose Estimation aims to determine the 3-dimensional pose of a subject,\
357
+ \ providing depth information along with the 2D coordinates. This requires specialized\
358
+ \ techniques like stereo cameras or depth sensors to capture the 3D structure\
359
+ \ of the scene.\n - In comparison, 2D Pose Estimation focuses on estimating the\
360
+ \ 2D pose of a subject within a single image or video frame, providing information\
361
+ \ about joint locations in the image plane.\n\n2). **Model-based Pose Estimation**:\
362
+ \ \n - This approach leverages predefined models of human (or object) skeletons\
363
+ \ with known joint connections. The model is then fitted to the input image or\
364
+ \ video data to estimate the pose of the subject. \n - A prominent example of\
365
+ \ Model-based Pose Estimation is the popular OpenPose library, which utilizes\
366
+ \ a part-based model to estimate human poses.\n\n3). **Model-free Pose Estimation**:\
367
+ \ \n - Contrary to model-based methods, model-free approaches do not rely on predefined\
368
+ \ models. Instead, they directly learn to estimate the pose from raw image or\
369
+ \ video data. \n - One such technique is the Convolutional Pose Machine (CPM)\
370
+ \ which uses convolutional neural networks to predict heatmaps for body joints,\
371
+ \ which are then refined to obtain the final pose estimation.\n\n4). **Case Study:\
372
+ \ Human Pose Estimation in Sports Analysis**: \n - Pose estimation plays a crucial\
373
+ \ role in sports analysis, enabling the quantification of player movements and\
374
+ \ kinematics. \n - For instance, in soccer, pose estimation techniques can be\
375
+ \ employed to track player positions, analyze their running patterns, and evaluate\
376
+ \ their performance during matches.\n\n5). **Comparative Analysis with Similar\
377
+ \ Concepts**: \n - Object Detection: While both pose estimation and object detection\
378
+ \ involve locating and identifying objects in images or videos, pose estimation\
379
+ \ specifically focuses on determining the pose or configuration of the object,\
380
+ \ while object detection primarily aims to identify and localize the object's\
381
+ \ presence.\n - Motion Capture: Pose estimation is closely related to motion capture,\
382
+ \ which involves tracking and recording the movements of human subjects. Motion\
383
+ \ capture systems typically employ specialized sensors or cameras to capture highly\
384
+ \ accurate 3D pose data, whereas pose estimation algorithms typically rely on\
385
+ \ computer vision techniques to infer poses from 2D or 3D image or video data.\n\
386
+ \n6). **Common Misconceptions and Clarifications**: \n - Pose estimation is not\
387
+ \ limited to humans: It can also be used to estimate the pose of objects, animals,\
388
+ \ and even vehicles.\n - Pose estimation is distinct from facial expression recognition:\
389
+ \ While both involve analyzing images or videos of people, pose estimation focuses\
390
+ \ on body posture and joint locations, whereas facial expression recognition aims\
391
+ \ to identify and interpret facial expressions."
392
+ - "The Lévy C curve is a self-replicating fractal that is created by iteratively\
393
+ \ replacing a straight line segment with two segments, each of which is 1/sqrt(2)\
394
+ \ times the length of the original segment, and joined at a right angle. \n\n\
395
+ After each iteration, the total arc length of the curve increases by a factor\
396
+ \ of 2/sqrt(2), which is equal to sqrt(2). \n\nIf the original line segment has\
397
+ \ a length of 1 unit, then after 8 iterations, the total arc length of the Lévy\
398
+ \ C curve will be:\n\nArc length = Original length * (sqrt(2))^n\nArc length =\
399
+ \ 1 * (sqrt(2))^8\nArc length = 1 * 2^4\nArc length = 1 * 16\nArc length = 16\
400
+ \ units\n\nSo, the total arc length of the Lévy C curve after iterating 8 times\
401
+ \ is 16 units."
402
+ - 'If the dictator keeps X points for themselves, the receiver will get the remaining
403
+ points, which can be calculated as:
404
+
405
+
406
+ Y = 10 - X
407
+
408
+
409
+ To find the fractional amount of the total points the receiver received, we can
410
+ create a fraction with Y as the numerator and the total points (10) as the denominator:
411
+
412
+
413
+ Fraction = Y/10 = (10 - X)/10
414
+
415
+
416
+ So, the receiver gets a fractional amount of (10 - X)/10 of the total points.'
417
+ - source_sentence: 'Represent this question for retrieving relevant documents: Detailed
418
+ Elaboration on Dimensionality Reduction and Industry Application'
419
+ sentences:
420
+ - '**Dimensionality Reduction: A Comprehensive Overview**
421
+
422
+
423
+ Dimensionality reduction is a fundamental concept in machine learning and data
424
+ analysis. It involves transforming high-dimensional data into a lower-dimensional
425
+ representation while preserving the most important information. Dimensionality
426
+ reduction techniques have a wide range of applications in various industries,
427
+ such as:
428
+
429
+
430
+ * **Feature engineering:** reducing the number of features in a dataset to improve
431
+ the efficiency of machine learning algorithms.
432
+
433
+ * **Visualization:** enabling the visualization of high-dimensional data by projecting
434
+ it onto a lower-dimensional subspace.
435
+
436
+ * **Data compression:** reducing the storage and transmission costs of large datasets.
437
+
438
+
439
+ **Specific Industry Applications:**
440
+
441
+
442
+ * **Computer vision:** Extracting meaningful features from images and videos for
443
+ object recognition, image segmentation, and facial recognition.
444
+
445
+ * **Natural language processing:** Reducing the dimensionality of text data for
446
+ text classification, document summarization, and machine translation.
447
+
448
+ * **Bioinformatics:** Analyzing gene expression data and identifying biomarkers
449
+ for disease diagnosis and drug discovery.
450
+
451
+ * **Financial modeling:** Identifying patterns and trends in financial data for
452
+ risk assessment, portfolio optimization, and fraud detection.
453
+
454
+ * **Recommendation systems:** Generating personalized recommendations for products,
455
+ movies, or music based on user preferences.
456
+
457
+
458
+ To further enhance your understanding, I can provide detailed explanations of
459
+ specific techniques, industry case studies, or address any specific questions
460
+ you may have.'
461
+ - Hepatocellular carcinoma is one of the most common malignancies worldwide. The
462
+ only curative treatment is surgery. As hepatocellular carcinoma is often associated
463
+ with liver cirrhosis, patients are at risk for postoperative liver failure. In
464
+ the recent years, platelets are thought to play an important role in liver regeneration.The
465
+ aim of this study was to discover the relevance of postoperative platelet counts
466
+ after liver resection for hepatocellular carcinoma. Data of 68 patients who underwent
467
+ liver resection for hepatocellular carcinoma between July 2007 and July 2012 in
468
+ a single centre were analysed. Postoperative morbidity and mortality were evaluated
469
+ in regard to postoperative platelet counts. Comparative analysis between patients
470
+ with platelet counts ≤100 2x109/ l and >100 x109/ l at d1 was performed in regard
471
+ to postoperative outcome. Within this cohort, 43 patients (63%) suffered from
472
+ histologically proven liver cirrhosis. Postoperative mortality was statistically
473
+ significant associated with postoperative reduced platelet counts. Comparative
474
+ analysis showed significantly elevated postoperative bilirubin levels and lower
475
+ prothrombin time in patients with platelet counts ≤ 100 1x109/ l at d1.
476
+ - "Let G be a group of order 25. Since 25 = 5^2 and 5 is prime, by the Sylow theorems,\
477
+ \ the number of 5-Sylow subgroups in G, denoted by n_5, satisfies:\n\n1. n_5 divides\
478
+ \ 25/5 = 5, and\n2. n_5 ≡ 1 (mod 5).\n\nFrom these conditions, we have that n_5\
479
+ \ can only be 1 or 5. \n\nCase 1: n_5 = 1\nIn this case, there is only one 5-Sylow\
480
+ \ subgroup, say H, in G. By the Sylow theorems, H is a normal subgroup of G. Since\
481
+ \ the order of H is 5, which is prime, H is cyclic, i.e., H ≅ C_5 (the cyclic\
482
+ \ group of order 5). \n\nNow, let g be an element of G that is not in H. Since\
483
+ \ H is normal in G, the set {gh : h ∈ H} is also a subgroup of G. Let K = {gh\
484
+ \ : h ∈ H}. Note that the order of K is also 5, as there is a one-to-one correspondence\
485
+ \ between the elements of H and K. Thus, K is also a cyclic group of order 5,\
486
+ \ i.e., K ≅ C_5.\n\nSince the orders of H and K are both 5, their intersection\
487
+ \ is trivial, i.e., H ∩ K = {e}, where e is the identity element of G. Moreover,\
488
+ \ since the order of G is 25, any element of G can be written as a product of\
489
+ \ elements from H and K. Therefore, G is the internal direct product of H and\
490
+ \ K, i.e., G ≅ H × K ≅ C_5 × C_5.\n\nCase 2: n_5 = 5\nIn this case, there are\
491
+ \ five 5-Sylow subgroups in G. Let H be one of these subgroups. Since the order\
492
+ \ of H is 5, which is prime, H is cyclic, i.e., H ≅ C_5.\n\nNow, consider the\
493
+ \ action of G on the set of 5-Sylow subgroups by conjugation. This action gives\
494
+ \ rise to a homomorphism φ: G → S_5, where S_5 is the symmetric group on 5 elements.\
495
+ \ The kernel of φ, say N, is a normal subgroup of G. Since the action is nontrivial,\
496
+ \ N is a proper subgroup of G, and thus, the order of N is either 1 or 5. If the\
497
+ \ order of N is 1, then G is isomorphic to a subgroup of S_5, which is a contradiction\
498
+ \ since the order of G is 25 and there is no subgroup of S_5 with order 25. Therefore,\
499
+ \ the order of N must be 5.\n\nSince the order of N is 5, N is a cyclic group\
500
+ \ of order 5, i.e., N ≅ C_5. Moreover, N is a normal subgroup of G. Let g be an\
501
+ \ element of G that is not in N. Then, the set {gn : n ∈ N} is also a subgroup\
502
+ \ of G. Let K = {gn : n ∈ N}. Note that the order of K is also 5, as there is\
503
+ \ a one-to-one correspondence between the elements of N and K. Thus, K is also\
504
+ \ a cyclic group of order 5, i.e., K ≅ C_5.\n\nSince the orders of N and K are\
505
+ \ both 5, their intersection is trivial, i.e., N ∩ K = {e}, where e is the identity\
506
+ \ element of G. Moreover, since the order of G is 25, any element of G can be\
507
+ \ written as a product of elements from N and K. Therefore, G is the internal\
508
+ \ direct product of N and K, i.e., G ≅ N × K ≅ C_5 × C_5.\n\nIn conclusion, a\
509
+ \ group of order 25 is either cyclic or isomorphic to the direct product of two\
510
+ \ cyclic groups of order 5."
511
+ - source_sentence: 'Represent this question for retrieving relevant documents: Does
512
+ low 25-Hydroxyvitamin D Level be Associated with Peripheral Arterial Disease in
513
+ Type 2 Diabetes Patients?'
514
+ sentences:
515
+ - 'Patients with type 2 diabetes have an increased risk of atherosclerosis and vascular
516
+ disease. Vitamin D deficiency is associated with vascular disease and is prevalent
517
+ in diabetes patients. We undertook this study to determine the association between
518
+ 25-hydroxyvitamin D (25[OH]D) levels and prevalence of peripheral arterial disease
519
+ (PAD) in type 2 diabetes patients. A total of 1028 type 2 diabetes patients were
520
+ recruited at Nanjing Medical University Affiliated Nanjing Hospital from November
521
+ 2011 to October 2013. PAD was defined as an ankle-brachial index (ABI) < 0.9.
522
+ Cardiovascular risk factors (blood pressure, HbA1c, lipid profile), comorbidities,
523
+ carotid intima-media thickness (IMT) and 25(OH)D were assessed. Overall prevalence
524
+ of PAD and of decreased 25(OH)D (<30 ng/mL) were 20.1% (207/1028) and 54.6% (561/1028),
525
+ respectively. PAD prevalence was higher in participants with decreased (23.9%)
526
+ than in those with normal (15.6%) 25(OH)D (≥30 ng/mL, p <0.01). Decreased 25(OH)D
527
+ was associated with increased risk of PAD (odds ratio [OR], 1.69, 95% CI: 1.17-2.44,
528
+ p <0.001) and PAD was significantly more likely to occur in participants ≥65 years
529
+ of age (OR, 2.56, 95% CI: 1.51 -4.48, vs. 1.21, 95% CI: 0.80-1.83, p-interaction = 0.027).
530
+ After adjusting for known cardiovascular risk factors and potential confounding
531
+ variables, the association of decreased 25(OH)D and PAD remained significant in
532
+ patients <65 years of age (OR, 1.55; 95% CI: 1.14-2.12, p = 0.006).'
533
+ - No study has been performed to compare the impacts of migraine and major depressive
534
+ episode (MDE) on depression, anxiety and somatic symptoms, and health-related
535
+ quality of life (HRQoL) among psychiatric outpatients. The aim of this study was
536
+ to investigate the above issue. This study enrolled consecutive psychiatric outpatients
537
+ with mood and/or anxiety disorders who undertook a first visit to a medical center.
538
+ Migraine was diagnosed according to the International Classification of Headache
539
+ Disorders, 2nd edition. Three psychometric scales and the Short-Form 36 were administered.
540
+ General linear models were used to estimate the difference in scores contributed
541
+ by either migraine or MDE. Multiple linear regressions were employed to compare
542
+ the variance of these scores explained by migraine or MDE. Among 214 enrolled
543
+ participants, 35.0% had migraine. Bipolar II disorder patients (70.0%) had the
544
+ highest percentage of migraine, followed by major depressive disorder (49.1%)
545
+ and only anxiety disorder (24.5%). Patients with migraine had worse depression,
546
+ anxiety, and somatic symptoms and lower SF-36 scores than those without. The estimated
547
+ differences in the scores of physical functioning, bodily pain, and somatic symptoms
548
+ contributed by migraine were not lower than those contributed by MDE. The regression
549
+ model demonstrated the variance explained by migraine was significantly greater
550
+ than that explained by MDE in physical and pain symptoms.
551
+ - 'Based on the information provided, we only know the number of patients who died
552
+ within the first year after the surgery. To determine the probability of a patient
553
+ surviving at least two years, we would need additional information about the number
554
+ of patients who died in the second year or survived beyond that.
555
+
556
+
557
+ Without this information, it is not possible to calculate the probability of a
558
+ patient surviving at least two years after the surgery.'
559
+ pipeline_tag: sentence-similarity
560
+ library_name: sentence-transformers
561
+ metrics:
562
+ - cosine_accuracy@1
563
+ - cosine_accuracy@3
564
+ - cosine_accuracy@5
565
+ - cosine_accuracy@10
566
+ - cosine_precision@1
567
+ - cosine_precision@3
568
+ - cosine_precision@5
569
+ - cosine_recall@1
570
+ - cosine_recall@3
571
+ - cosine_recall@5
572
+ - cosine_ndcg@10
573
+ - cosine_mrr@10
574
+ - cosine_map@100
575
+ model-index:
576
+ - name: SentenceTransformer based on BAAI/bge-small-en-v1.5
577
+ results:
578
+ - task:
579
+ type: logging
580
+ name: Logging
581
+ dataset:
582
+ name: ir eval
583
+ type: ir-eval
584
+ metrics:
585
+ - type: cosine_accuracy@1
586
+ value: 0.9241493167018252
587
+ name: Cosine Accuracy@1
588
+ - type: cosine_accuracy@3
589
+ value: 0.9788131706869669
590
+ name: Cosine Accuracy@3
591
+ - type: cosine_accuracy@5
592
+ value: 0.9906447766669724
593
+ name: Cosine Accuracy@5
594
+ - type: cosine_accuracy@10
595
+ value: 0.9965147207190681
596
+ name: Cosine Accuracy@10
597
+ - type: cosine_precision@1
598
+ value: 0.9241493167018252
599
+ name: Cosine Precision@1
600
+ - type: cosine_precision@3
601
+ value: 0.3262710568956556
602
+ name: Cosine Precision@3
603
+ - type: cosine_precision@5
604
+ value: 0.1981289553333945
605
+ name: Cosine Precision@5
606
+ - type: cosine_recall@1
607
+ value: 0.9241493167018252
608
+ name: Cosine Recall@1
609
+ - type: cosine_recall@3
610
+ value: 0.9788131706869669
611
+ name: Cosine Recall@3
612
+ - type: cosine_recall@5
613
+ value: 0.9906447766669724
614
+ name: Cosine Recall@5
615
+ - type: cosine_ndcg@10
616
+ value: 0.9634519649573985
617
+ name: Cosine Ndcg@10
618
+ - type: cosine_mrr@10
619
+ value: 0.9524509418552345
620
+ name: Cosine Mrr@10
621
+ - type: cosine_map@100
622
+ value: 0.9526115405885596
623
+ name: Cosine Map@100
624
+ ---
625
+
626
+ # SentenceTransformer based on BAAI/bge-small-en-v1.5
627
+
628
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
629
+
630
+ ## Model Details
631
+
632
+ ### Model Description
633
+ - **Model Type:** Sentence Transformer
634
+ - **Base model:** [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) <!-- at revision 5c38ec7c405ec4b44b94cc5a9bb96e735b38267a -->
635
+ - **Maximum Sequence Length:** 512 tokens
636
+ - **Output Dimensionality:** 384 dimensions
637
+ - **Similarity Function:** Cosine Similarity
638
+ <!-- - **Training Dataset:** Unknown -->
639
+ <!-- - **Language:** Unknown -->
640
+ <!-- - **License:** Unknown -->
641
+
642
+ ### Model Sources
643
+
644
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
645
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
646
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
647
+
648
+ ### Full Model Architecture
649
+
650
+ ```
651
+ SentenceTransformer(
652
+ (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
653
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
654
+ (2): Normalize()
655
+ )
656
+ ```
657
+
658
+ ## Usage
659
+
660
+ ### Direct Usage (Sentence Transformers)
661
+
662
+ First install the Sentence Transformers library:
663
+
664
+ ```bash
665
+ pip install -U sentence-transformers
666
+ ```
667
+
668
+ Then you can load this model and run inference.
669
+ ```python
670
+ from sentence_transformers import SentenceTransformer
671
+
672
+ # Download from the 🤗 Hub
673
+ model = SentenceTransformer("sucharush/bge_MNR")
674
+ # Run inference
675
+ sentences = [
676
+ 'Represent this question for retrieving relevant documents: Does low 25-Hydroxyvitamin D Level be Associated with Peripheral Arterial Disease in Type 2 Diabetes Patients?',
677
+ 'Patients with type 2 diabetes have an increased risk of atherosclerosis and vascular disease. Vitamin D deficiency is associated with vascular disease and is prevalent in diabetes patients. We undertook this study to determine the association between 25-hydroxyvitamin D (25[OH]D) levels and prevalence of peripheral arterial disease (PAD) in type 2 diabetes patients. A total of 1028 type 2 diabetes patients were recruited at Nanjing Medical University Affiliated Nanjing Hospital from November 2011 to October 2013. PAD was defined as an ankle-brachial index (ABI)\xa0<\xa00.9. Cardiovascular risk factors (blood pressure, HbA1c, lipid profile), comorbidities, carotid intima-media thickness (IMT) and 25(OH)D were assessed. Overall prevalence of PAD and of decreased 25(OH)D (<30\xa0ng/mL) were 20.1% (207/1028) and 54.6% (561/1028), respectively. PAD prevalence was higher in participants with decreased (23.9%) than in those with normal (15.6%) 25(OH)D (≥30\xa0ng/mL, p\xa0<0.01). Decreased 25(OH)D was associated with increased risk of PAD (odds ratio [OR], 1.69, 95% CI: 1.17-2.44, p\xa0<0.001) and PAD was significantly more likely to occur in participants ≥65\xa0years of age (OR, 2.56, 95% CI: 1.51 -4.48, vs. 1.21, 95% CI: 0.80-1.83, p-interaction\xa0=\xa00.027). After adjusting for known cardiovascular risk factors and potential confounding variables, the association of decreased 25(OH)D and PAD remained significant in patients <65\xa0years of age (OR, 1.55; 95% CI: 1.14-2.12, p\xa0=\xa00.006).',
678
+ 'Based on the information provided, we only know the number of patients who died within the first year after the surgery. To determine the probability of a patient surviving at least two years, we would need additional information about the number of patients who died in the second year or survived beyond that.\n\nWithout this information, it is not possible to calculate the probability of a patient surviving at least two years after the surgery.',
679
+ ]
680
+ embeddings = model.encode(sentences)
681
+ print(embeddings.shape)
682
+ # [3, 384]
683
+
684
+ # Get the similarity scores for the embeddings
685
+ similarities = model.similarity(embeddings, embeddings)
686
+ print(similarities.shape)
687
+ # [3, 3]
688
+ ```
689
+
690
+ <!--
691
+ ### Direct Usage (Transformers)
692
+
693
+ <details><summary>Click to see the direct usage in Transformers</summary>
694
+
695
+ </details>
696
+ -->
697
+
698
+ <!--
699
+ ### Downstream Usage (Sentence Transformers)
700
+
701
+ You can finetune this model on your own dataset.
702
+
703
+ <details><summary>Click to expand</summary>
704
+
705
+ </details>
706
+ -->
707
+
708
+ <!--
709
+ ### Out-of-Scope Use
710
+
711
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
712
+ -->
713
+
714
+ ## Evaluation
715
+
716
+ ### Metrics
717
+
718
+ #### Logging
719
+
720
+ * Dataset: `ir-eval`
721
+ * Evaluated with <code>__main__.LoggingEvaluator</code>
722
+
723
+ | Metric | Value |
724
+ |:-------------------|:-----------|
725
+ | cosine_accuracy@1 | 0.9241 |
726
+ | cosine_accuracy@3 | 0.9788 |
727
+ | cosine_accuracy@5 | 0.9906 |
728
+ | cosine_accuracy@10 | 0.9965 |
729
+ | cosine_precision@1 | 0.9241 |
730
+ | cosine_precision@3 | 0.3263 |
731
+ | cosine_precision@5 | 0.1981 |
732
+ | cosine_recall@1 | 0.9241 |
733
+ | cosine_recall@3 | 0.9788 |
734
+ | cosine_recall@5 | 0.9906 |
735
+ | **cosine_ndcg@10** | **0.9635** |
736
+ | cosine_mrr@10 | 0.9525 |
737
+ | cosine_map@100 | 0.9526 |
738
+
739
+ <!--
740
+ ## Bias, Risks and Limitations
741
+
742
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
743
+ -->
744
+
745
+ <!--
746
+ ### Recommendations
747
+
748
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
749
+ -->
750
+
751
+ ## Training Details
752
+
753
+ ### Training Dataset
754
+
755
+ #### Unnamed Dataset
756
+
757
+ * Size: 98,112 training samples
758
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
759
+ * Approximate statistics based on the first 1000 samples:
760
+ | | sentence_0 | sentence_1 |
761
+ |:--------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
762
+ | type | string | string |
763
+ | details | <ul><li>min: 18 tokens</li><li>mean: 55.27 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 317.52 tokens</li><li>max: 512 tokens</li></ul> |
764
+ * Samples:
765
+ | sentence_0 | sentence_1 |
766
+ |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
767
+ | <code>Represent this question for retrieving relevant documents: Are elevated levels of pro-inflammatory oxylipins in older subjects normalized by flaxseed consumption?</code> | <code>Oxylipins, including eicosanoids, are highly bioactive molecules endogenously produced from polyunsaturated fatty acids. Oxylipins play a key role in chronic disease progression. It is possible, but unknown, if oxylipin concentrations change with the consumption of functional foods or differ with subject age. Therefore, in a parallel comparator trial, 20 healthy individuals were recruited into a younger (19-28years) or older (45-64years) age group (n=10/group). Participants ingested one muffin/day containing 30g of milled flaxseed (6g alpha-linolenic acid) for 4weeks. Plasma oxylipins were isolated through solid phase extraction, analyzed with HPLC-MS/MS targeted lipidomics, and quantified with the stable isotope dilution method. At baseline, the older group exhibited 13 oxylipins ≥2-fold the concentration of the younger group. Specifically, pro-inflammatory oxylipins 5-hydroxyeicosatetraenoic acid, 9,10,13-trihydroxyoctadecenoic acid, and 9,12,13-trihydroxyoctadecenoic acid were signi...</code> |
768
+ | <code>Represent this question for retrieving relevant documents: Find the isometries of the metric $ds^2 = dx^2 + dy^2$ over the rectangle $R=[0,a] \times [0,b]$, subject to the additional condition that any isometry $f$ maps $(0,0)$ to $(x_0, y_0)$. Find $x_0$ and $y_0$ such that the isometry $f$ is given by $f(x,y) = (x_0 + x, y_0 - y)$.</code> | <code>An isometry is a transformation that preserves the distance between points. In this case, we are looking for transformations that preserve the metric $ds^2 = dx^2 + dy^2$. Let's consider the transformation $f(x,y) = (x_0 + x, y_0 - y)$ and find the conditions on $x_0$ and $y_0$ for it to be an isometry.<br><br>First, let's compute the differential of the transformation:<br><br>$$df = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} \begin{pmatrix} dx \\ dy \end{pmatrix} = \begin{pmatrix} dx \\ -dy \end{pmatrix}$$<br><br>Now, let's compute the metric under this transformation:<br><br>$$ds'^2 = (dx')^2 + (dy')^2 = dx^2 + (-dy)^2 = dx^2 + dy^2$$<br><br>Since $ds'^2 = ds^2$, the transformation $f(x,y) = (x_0 + x, y_0 - y)$ is an isometry.<br><br>Now, let's find the conditions on $x_0$ and $y_0$ such that the isometry maps $(0,0)$ to $(x_0, y_0)$. Applying the transformation to $(0,0)$, we get:<br><br>$$f(0,0) = (x_0 + 0, y_0 - 0) = (x_0, y_0)$$<br><br>Since the transformation maps $(0,0)$ to $(x_0, y_0)$, there are no additional conditions...</code> |
769
+ | <code>Represent this question for retrieving relevant documents: Do two di-leucine motifs regulate trafficking and function of mouse ASIC2a?</code> | <code>Acid-sensing ion channels (ASICs) are proton-gated cation channels that mediate acid-induced responses in neurons. ASICs are important for mechanosensation, learning and memory, fear, pain, and neuronal injury. ASIC2a is widely expressed in the nervous system and modulates ASIC channel trafficking and activity in both central and peripheral systems. Here, to better understand mechanisms regulating ASIC2a, we searched for potential protein motifs that regulate ASIC2a trafficking.</code> |
770
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
771
+ ```json
772
+ {
773
+ "scale": 20.0,
774
+ "similarity_fct": "cos_sim"
775
+ }
776
+ ```
777
+
778
+ ### Training Hyperparameters
779
+ #### Non-Default Hyperparameters
780
+
781
+ - `eval_strategy`: steps
782
+ - `per_device_train_batch_size`: 32
783
+ - `per_device_eval_batch_size`: 32
784
+ - `num_train_epochs`: 1
785
+ - `batch_sampler`: no_duplicates
786
+ - `multi_dataset_batch_sampler`: round_robin
787
+
788
+ #### All Hyperparameters
789
+ <details><summary>Click to expand</summary>
790
+
791
+ - `overwrite_output_dir`: False
792
+ - `do_predict`: False
793
+ - `eval_strategy`: steps
794
+ - `prediction_loss_only`: True
795
+ - `per_device_train_batch_size`: 32
796
+ - `per_device_eval_batch_size`: 32
797
+ - `per_gpu_train_batch_size`: None
798
+ - `per_gpu_eval_batch_size`: None
799
+ - `gradient_accumulation_steps`: 1
800
+ - `eval_accumulation_steps`: None
801
+ - `torch_empty_cache_steps`: None
802
+ - `learning_rate`: 5e-05
803
+ - `weight_decay`: 0.0
804
+ - `adam_beta1`: 0.9
805
+ - `adam_beta2`: 0.999
806
+ - `adam_epsilon`: 1e-08
807
+ - `max_grad_norm`: 1
808
+ - `num_train_epochs`: 1
809
+ - `max_steps`: -1
810
+ - `lr_scheduler_type`: linear
811
+ - `lr_scheduler_kwargs`: {}
812
+ - `warmup_ratio`: 0.0
813
+ - `warmup_steps`: 0
814
+ - `log_level`: passive
815
+ - `log_level_replica`: warning
816
+ - `log_on_each_node`: True
817
+ - `logging_nan_inf_filter`: True
818
+ - `save_safetensors`: True
819
+ - `save_on_each_node`: False
820
+ - `save_only_model`: False
821
+ - `restore_callback_states_from_checkpoint`: False
822
+ - `no_cuda`: False
823
+ - `use_cpu`: False
824
+ - `use_mps_device`: False
825
+ - `seed`: 42
826
+ - `data_seed`: None
827
+ - `jit_mode_eval`: False
828
+ - `use_ipex`: False
829
+ - `bf16`: False
830
+ - `fp16`: False
831
+ - `fp16_opt_level`: O1
832
+ - `half_precision_backend`: auto
833
+ - `bf16_full_eval`: False
834
+ - `fp16_full_eval`: False
835
+ - `tf32`: None
836
+ - `local_rank`: 0
837
+ - `ddp_backend`: None
838
+ - `tpu_num_cores`: None
839
+ - `tpu_metrics_debug`: False
840
+ - `debug`: []
841
+ - `dataloader_drop_last`: False
842
+ - `dataloader_num_workers`: 0
843
+ - `dataloader_prefetch_factor`: None
844
+ - `past_index`: -1
845
+ - `disable_tqdm`: False
846
+ - `remove_unused_columns`: True
847
+ - `label_names`: None
848
+ - `load_best_model_at_end`: False
849
+ - `ignore_data_skip`: False
850
+ - `fsdp`: []
851
+ - `fsdp_min_num_params`: 0
852
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
853
+ - `tp_size`: 0
854
+ - `fsdp_transformer_layer_cls_to_wrap`: None
855
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
856
+ - `deepspeed`: None
857
+ - `label_smoothing_factor`: 0.0
858
+ - `optim`: adamw_torch
859
+ - `optim_args`: None
860
+ - `adafactor`: False
861
+ - `group_by_length`: False
862
+ - `length_column_name`: length
863
+ - `ddp_find_unused_parameters`: None
864
+ - `ddp_bucket_cap_mb`: None
865
+ - `ddp_broadcast_buffers`: False
866
+ - `dataloader_pin_memory`: True
867
+ - `dataloader_persistent_workers`: False
868
+ - `skip_memory_metrics`: True
869
+ - `use_legacy_prediction_loop`: False
870
+ - `push_to_hub`: False
871
+ - `resume_from_checkpoint`: None
872
+ - `hub_model_id`: None
873
+ - `hub_strategy`: every_save
874
+ - `hub_private_repo`: None
875
+ - `hub_always_push`: False
876
+ - `gradient_checkpointing`: False
877
+ - `gradient_checkpointing_kwargs`: None
878
+ - `include_inputs_for_metrics`: False
879
+ - `include_for_metrics`: []
880
+ - `eval_do_concat_batches`: True
881
+ - `fp16_backend`: auto
882
+ - `push_to_hub_model_id`: None
883
+ - `push_to_hub_organization`: None
884
+ - `mp_parameters`:
885
+ - `auto_find_batch_size`: False
886
+ - `full_determinism`: False
887
+ - `torchdynamo`: None
888
+ - `ray_scope`: last
889
+ - `ddp_timeout`: 1800
890
+ - `torch_compile`: False
891
+ - `torch_compile_backend`: None
892
+ - `torch_compile_mode`: None
893
+ - `include_tokens_per_second`: False
894
+ - `include_num_input_tokens_seen`: False
895
+ - `neftune_noise_alpha`: None
896
+ - `optim_target_modules`: None
897
+ - `batch_eval_metrics`: False
898
+ - `eval_on_start`: False
899
+ - `use_liger_kernel`: False
900
+ - `eval_use_gather_object`: False
901
+ - `average_tokens_across_devices`: False
902
+ - `prompts`: None
903
+ - `batch_sampler`: no_duplicates
904
+ - `multi_dataset_batch_sampler`: round_robin
905
+
906
+ </details>
907
+
908
+ ### Training Logs
909
+ | Epoch | Step | Training Loss | ir-eval_cosine_ndcg@10 |
910
+ |:------:|:----:|:-------------:|:----------------------:|
911
+ | 0.1631 | 500 | 0.021 | 0.9523 |
912
+ | 0.3262 | 1000 | 0.0069 | 0.9600 |
913
+ | 0.4892 | 1500 | 0.0051 | 0.9593 |
914
+ | 0.6523 | 2000 | 0.0055 | 0.9605 |
915
+ | 0.8154 | 2500 | 0.0053 | 0.9638 |
916
+ | 0.9785 | 3000 | 0.0056 | 0.9634 |
917
+ | 1.0 | 3066 | - | 0.9635 |
918
+
919
+
920
+ ### Framework Versions
921
+ - Python: 3.12.8
922
+ - Sentence Transformers: 3.4.1
923
+ - Transformers: 4.51.3
924
+ - PyTorch: 2.5.1+cu124
925
+ - Accelerate: 1.3.0
926
+ - Datasets: 3.2.0
927
+ - Tokenizers: 0.21.0
928
+
929
+ ## Citation
930
+
931
+ ### BibTeX
932
+
933
+ #### Sentence Transformers
934
+ ```bibtex
935
+ @inproceedings{reimers-2019-sentence-bert,
936
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
937
+ author = "Reimers, Nils and Gurevych, Iryna",
938
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
939
+ month = "11",
940
+ year = "2019",
941
+ publisher = "Association for Computational Linguistics",
942
+ url = "https://arxiv.org/abs/1908.10084",
943
+ }
944
+ ```
945
+
946
+ #### MultipleNegativesRankingLoss
947
+ ```bibtex
948
+ @misc{henderson2017efficient,
949
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
950
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
951
+ year={2017},
952
+ eprint={1705.00652},
953
+ archivePrefix={arXiv},
954
+ primaryClass={cs.CL}
955
+ }
956
+ ```
957
+
958
+ <!--
959
+ ## Glossary
960
+
961
+ *Clearly define terms in order to be accessible across audiences.*
962
+ -->
963
+
964
+ <!--
965
+ ## Model Card Authors
966
+
967
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
968
+ -->
969
+
970
+ <!--
971
+ ## Model Card Contact
972
+
973
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
974
+ -->
config.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "hidden_act": "gelu",
8
+ "hidden_dropout_prob": 0.1,
9
+ "hidden_size": 384,
10
+ "id2label": {
11
+ "0": "LABEL_0"
12
+ },
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 1536,
15
+ "label2id": {
16
+ "LABEL_0": 0
17
+ },
18
+ "layer_norm_eps": 1e-12,
19
+ "max_position_embeddings": 512,
20
+ "model_type": "bert",
21
+ "num_attention_heads": 12,
22
+ "num_hidden_layers": 12,
23
+ "pad_token_id": 0,
24
+ "position_embedding_type": "absolute",
25
+ "torch_dtype": "float16",
26
+ "transformers_version": "4.51.3",
27
+ "type_vocab_size": 2,
28
+ "use_cache": true,
29
+ "vocab_size": 30522
30
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.4.1",
4
+ "transformers": "4.51.3",
5
+ "pytorch": "2.5.1+cu124"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:63c73d8a3c53aa912fd35d5d5b652a7cdb117cb32a82a995198b8a460588f915
3
+ size 66741984
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 512,
3
+ "do_lower_case": true
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "model_max_length": 512,
51
+ "never_split": null,
52
+ "pad_token": "[PAD]",
53
+ "sep_token": "[SEP]",
54
+ "strip_accents": null,
55
+ "tokenize_chinese_chars": true,
56
+ "tokenizer_class": "BertTokenizer",
57
+ "unk_token": "[UNK]"
58
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff