Add new SentenceTransformer model

Browse files

Files changed (11) hide show

1_Pooling/config.json +10 -0
README.md +974 -0
config.json +30 -0
config_sentence_transformers.json +10 -0
model.safetensors +3 -0
modules.json +20 -0
sentence_bert_config.json +4 -0
special_tokens_map.json +37 -0
tokenizer.json +0 -0
tokenizer_config.json +58 -0
vocab.txt +0 -0

1_Pooling/config.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+  "word_embedding_dimension": 384,
+  "pooling_mode_cls_token": true,
+  "pooling_mode_mean_tokens": false,
+  "pooling_mode_max_tokens": false,
+  "pooling_mode_mean_sqrt_len_tokens": false,
+  "pooling_mode_weightedmean_tokens": false,
+  "pooling_mode_lasttoken": false,
+  "include_prompt": true
+}

README.md ADDED Viewed

	@@ -0,0 +1,974 @@

+---
+tags:
+- sentence-transformers
+- sentence-similarity
+- feature-extraction
+- generated_from_trainer
+- dataset_size:98112
+- loss:MultipleNegativesRankingLoss
+base_model: BAAI/bge-small-en-v1.5
+widget:
+- source_sentence: 'Represent this question for retrieving relevant documents: I''m
+    curious about strategies businesses can use to ensure data privacy and security.'
+  sentences:
+  - 'The Vicsek fractal is a self-replicating fractal, where each square is replaced
+    by a pattern of five smaller squares in each iteration. The number of line segments
+    in each iteration can be calculated using the formula:
+    Number of line segments = 4 * (5^(n-1))
+    where n is the iteration number.
+    For the fifth iteration (n=5):
+    Number of line segments = 4 * (5^(5-1))
+    Number of line segments = 4 * (5^4)
+    Number of line segments = 4 * (625)
+    Number of line segments = 2500
+    So, there are 2500 line segments needed to draw the fifth iteration of the Vicsek
+    fractal.'
+  - '**Advanced Techniques and Strategies for Data Privacy and Security:**
+    1. **Data encryption:** Encrypting data both at rest and in transit ensures it
+    remains protected even if accessed by unauthorized individuals.
+    2. **Regular security audits:** Conducting regular security audits helps businesses
+    identify and address vulnerabilities in their systems and processes.
+    3. **Data masking:** Data masking techniques such as tokenization and pseudonymization
+    protect sensitive data by replacing it with non-identifiable values.
+    4. **Access control management:** Implementing robust access control measures,
+    such as role-based access control (RBAC) and multi-factor authentication (MFA),
+    restricts access to data to authorized personnel only.
+    5. **Data minimization:** Businesses should only collect and store data that is
+    absolutely necessary for their operations to reduce the risk of data breaches.
+    **Practical Examples of Data Privacy and Security:**
+    - **Healthcare:** Hospitals and medical facilities use data encryption to protect
+    patient health records, ensuring compliance with HIPAA regulations.
+    - **Financial Services:** Banks and financial institutions implement MFA and access
+    control measures to safeguard customer financial data.
+    - **Retail:** E-commerce companies use data masking techniques to protect sensitive
+    customer information, such as credit card numbers.
+    **Interview Questions on Data Privacy and Security:**
+    - Describe the key principles of data privacy and security.
+    - Explain the different methods used for data encryption and their strengths and
+    weaknesses.
+    - How can organizations implement effective access control mechanisms to protect
+    data?
+    - What are the best practices for conducting security audits to ensure data privacy?
+    - Discuss the ethical and legal implications of data privacy and security breaches.'
+  - 'First, let''s write the system of linear equations as an augmented matrix:
+    [ 1  2 -1 |  5]
+    [ 2 -3  4 |  7]
+    [-6  7 -5 | -1]
+    Now, we''ll perform forward elimination to convert the matrix into an upper triangular
+    matrix.
+    Step 1: Eliminate x from the second and third rows.
+    To eliminate x from the second row, we''ll subtract 2 times the first row from
+    the second row:
+    [ 1  2 -1 |  5]
+    [ 0 -7  6 | -3]
+    [-6  7 -5 | -1]
+    To eliminate x from the third row, we''ll add 6 times the first row to the third
+    row:
+    [ 1  2 -1 |  5]
+    [ 0 -7  6 | -3]
+    [ 0  5 -1 | 29]
+    Step 2: Eliminate y from the third row.
+    To eliminate y from the third row, we''ll add (5/7) times the second row to the
+    third row:
+    [ 1  2 -1 |  5]
+    [ 0 -7  6 | -3]
+    [ 0  0  1 |  4]
+    Now, we have an upper triangular matrix, and we can perform back substitution
+    to find the values of x, y, and z.
+    Step 3: Back substitution
+    From the third row, we have z = 4.
+    Now, we''ll substitute z into the second row to find y:
+    -7y + 6(4) = -3
+    -7y + 24 = -3
+    -7y = -27
+    y = 27/7
+    Finally, we''ll substitute y and z into the first row to find x:
+    x + 2(27/7) - 4 = 5
+    x + 54/7 - 4 = 5
+    x = 5 - 54/7 + 4
+    x = (35 - 54 + 28)/7
+    x = 9/7
+    So, the solution to the system of linear equations is:
+    x = 9/7
+    y = 27/7
+    z = 4'
+- source_sentence: "Represent this question for retrieving relevant documents: Solve\
+    \ the following system of linear equations with complex coefficients: \n\n2x +\
+    \ 5y + 3iz = 8i \n\nx - 3y + 2z = 1 \n\n-4x + 2y + 5iz = -5"
+  sentences:
+  - To determine whether tympanic membrane (TM) electrodes induce behavioral pure-tone
+    threshold shifts. Pure-tone thresholds (250 to 8000 Hz) were measured twice in
+    test (n = 18) and control (n = 10) groups. TM electrodes were placed between first
+    and second threshold measurements in the test group, whereas the control group
+    did not receive electrodes. Pure-tone threshold shifts were compared between groups.
+    The effect of TM electrode contact location on threshold shifts was evaluated
+    in the test group. TM electrodes significantly increased average low-frequency
+    thresholds, 7.5 dB at 250 Hz and 4.2 dB at 500 Hz, and shifts were as large as
+    25 dB in individual ears. Also, threshold shifts did not appear to vary at any
+    frequency with TM electrode contact location.
+  - 'To solve the given system of linear equations with complex coefficients, we can
+    use the Gaussian elimination method. The given system of equations is:
+    (1) 2x + 5y + 3iz = 8i
+    (2) x - 3y + 2z = 1
+    (3) -4x + 2y + 5iz = -5
+    First, we will write the augmented matrix for this system:
+    |  2   5   3i |  8i |
+    |  1  -3   2  |  1  |
+    | -4   2   5i | -5  |
+    Next, we will perform row operations to get the matrix in row-echelon form. We
+    will start by making the first element of the second row 0. To do this, we can
+    subtract half of the first row from the second row:
+    |  2   5   3i |  8i |
+    |  0  -5.5 -1i | -3i |
+    | -4   2   5i | -5  |
+    Now, we will make the first element of the third row 0. To do this, we can add
+    twice the first row to the third row:
+    |  2   5   3i |  8i  |
+    |  0  -5.5 -1i | -3i  |
+    |  0   12  11i | 11i  |
+    Next, we will make the second element of the third row 0. To do this, we can add
+    (12/5.5) times the second row to the third row:
+    |  2   5    3i  |  8i  |
+    |  0  -5.5 -1i  | -3i  |
+    |  0   0   10.8i|  6i  |
+    Now, we have the matrix in row-echelon form. We can now solve for the variables
+    using back-substitution.
+    From the third row, we have:
+    10.8i * z = 6i
+    Dividing both sides by 10.8i, we get:
+    z = 6i / 10.8i = 6/10.8 = 1/1.8 = 5/9
+    Now, we can substitute z back into the second row to find y:
+    -5.5y - 1i(5/9) = -3i
+    Multiplying both sides by -1, we get:
+    5.5y + (5i/9) = 3i
+    Subtracting 5i/9 from both sides, we get:
+    5.5y = 3i - 5i/9 = (22i - 5i) / 9 = 17i/9
+    Dividing both sides by 5.5, we get:
+    y = (17i/9) / 5.5 = 17i / 49.5 = 17i / (99/2) = 34i / 99
+    Finally, we can substitute y and z back into the first row to find x:
+    2x + 5(34i/99) + 3i(5/9) = 8i
+    Multiplying both sides by 99, we get:
+    198x + 5(34i) + 3i(55) = 792i
+    198x + 170i + 165i = 792i
+    198x = 792i - 335i = 457i
+    Dividing both sides by 198, we get:
+    x = 457i / 198
+    So, the solution to the given system of linear equations is:
+    x = 457i / 198
+    y = 34i / 99
+    z = 5/9'
+  - Remodelling of the asthmatic airway includes increased deposition of proteoglycan
+    (PG) molecules. One of the stimuli driving airway remodelling may be excessive
+    mechanical stimulation. We hypothesized that fibroblasts from asthmatic patients
+    would respond to excessive mechanical strain with up-regulation of message for
+    PGs. We obtained fibroblasts from asthmatic patients (AF) and normal volunteers
+    (NF) using endobronchial biopsy. Cells were maintained in culture until the fifth
+    passage and then grown on a flexible collagen-coated membrane. Using the Flexercell
+    device, cells were then subjected to cyclic stretch at 30% amplitude at 1 Hz for
+    24 h. Control cells were unstrained. Total RNA was extracted from the cell layer
+    and quantitative RT-PCR performed for decorin, lumican and versican mRNA. In unstrained
+    cells, the expression of decorin mRNA was greater in AF than NF. With strain,
+    NF showed increased expression of versican mRNA and AF showed increased expression
+    of versican and decorin mRNA. The relative increase in versican mRNA expression
+    with strain was greater in AF than NF.
+- source_sentence: 'Represent this question for retrieving relevant documents: What
+    is the total arc length of the Lévy C curve after iterating 8 times if the original
+    line segment had a length of 1 unit?'
+  sentences:
+  - "Pose estimation is indeed a fascinating area in computer vision, but it's not\
+    \ entirely a walk in the park. Estimating the pose of a human or object involves\
+    \ a combination of complex mathematical techniques and algorithms. Let's delve\
+    \ deeper into some key aspects of pose estimation:\n\n1). **3D vs 2D Pose Estimation**:\
+    \ \n - 3D Pose Estimation aims to determine the 3-dimensional pose of a subject,\
+    \ providing depth information along with the 2D coordinates. This requires specialized\
+    \ techniques like stereo cameras or depth sensors to capture the 3D structure\
+    \ of the scene.\n - In comparison, 2D Pose Estimation focuses on estimating the\
+    \ 2D pose of a subject within a single image or video frame, providing information\
+    \ about joint locations in the image plane.\n\n2). **Model-based Pose Estimation**:\
+    \ \n - This approach leverages predefined models of human (or object) skeletons\
+    \ with known joint connections. The model is then fitted to the input image or\
+    \ video data to estimate the pose of the subject.  \n - A prominent example of\
+    \ Model-based Pose Estimation is the popular OpenPose library, which utilizes\
+    \ a part-based model to estimate human poses.\n\n3). **Model-free Pose Estimation**:\
+    \ \n - Contrary to model-based methods, model-free approaches do not rely on predefined\
+    \ models. Instead, they directly learn to estimate the pose from raw image or\
+    \ video data. \n - One such technique is the Convolutional Pose Machine (CPM)\
+    \ which uses convolutional neural networks to predict heatmaps for body joints,\
+    \ which are then refined to obtain the final pose estimation.\n\n4). **Case Study:\
+    \ Human Pose Estimation in Sports Analysis**: \n - Pose estimation plays a crucial\
+    \ role in sports analysis, enabling the quantification of player movements and\
+    \ kinematics. \n - For instance, in soccer, pose estimation techniques can be\
+    \ employed to track player positions, analyze their running patterns, and evaluate\
+    \ their performance during matches.\n\n5). **Comparative Analysis with Similar\
+    \ Concepts**: \n - Object Detection: While both pose estimation and object detection\
+    \ involve locating and identifying objects in images or videos, pose estimation\
+    \ specifically focuses on determining the pose or configuration of the object,\
+    \ while object detection primarily aims to identify and localize the object's\
+    \ presence.\n - Motion Capture: Pose estimation is closely related to motion capture,\
+    \ which involves tracking and recording the movements of human subjects. Motion\
+    \ capture systems typically employ specialized sensors or cameras to capture highly\
+    \ accurate 3D pose data, whereas pose estimation algorithms typically rely on\
+    \ computer vision techniques to infer poses from 2D or 3D image or video data.\n\
+    \n6). **Common Misconceptions and Clarifications**: \n - Pose estimation is not\
+    \ limited to humans: It can also be used to estimate the pose of objects, animals,\
+    \ and even vehicles.\n - Pose estimation is distinct from facial expression recognition:\
+    \ While both involve analyzing images or videos of people, pose estimation focuses\
+    \ on body posture and joint locations, whereas facial expression recognition aims\
+    \ to identify and interpret facial expressions."
+  - "The Lévy C curve is a self-replicating fractal that is created by iteratively\
+    \ replacing a straight line segment with two segments, each of which is 1/sqrt(2)\
+    \ times the length of the original segment, and joined at a right angle. \n\n\
+    After each iteration, the total arc length of the curve increases by a factor\
+    \ of 2/sqrt(2), which is equal to sqrt(2). \n\nIf the original line segment has\
+    \ a length of 1 unit, then after 8 iterations, the total arc length of the Lévy\
+    \ C curve will be:\n\nArc length = Original length * (sqrt(2))^n\nArc length =\
+    \ 1 * (sqrt(2))^8\nArc length = 1 * 2^4\nArc length = 1 * 16\nArc length = 16\
+    \ units\n\nSo, the total arc length of the Lévy C curve after iterating 8 times\
+    \ is 16 units."
+  - 'If the dictator keeps X points for themselves, the receiver will get the remaining
+    points, which can be calculated as:
+    Y = 10 - X
+    To find the fractional amount of the total points the receiver received, we can
+    create a fraction with Y as the numerator and the total points (10) as the denominator:
+    Fraction = Y/10 = (10 - X)/10
+    So, the receiver gets a fractional amount of (10 - X)/10 of the total points.'
+- source_sentence: 'Represent this question for retrieving relevant documents: Detailed
+    Elaboration on Dimensionality Reduction and Industry Application'
+  sentences:
+  - '**Dimensionality Reduction: A Comprehensive Overview**
+    Dimensionality reduction is a fundamental concept in machine learning and data
+    analysis. It involves transforming high-dimensional data into a lower-dimensional
+    representation while preserving the most important information. Dimensionality
+    reduction techniques have a wide range of applications in various industries,
+    such as:
+    * **Feature engineering:** reducing the number of features in a dataset to improve
+    the efficiency of machine learning algorithms.
+    * **Visualization:** enabling the visualization of high-dimensional data by projecting
+    it onto a lower-dimensional subspace.
+    * **Data compression:** reducing the storage and transmission costs of large datasets.
+    **Specific Industry Applications:**
+    * **Computer vision:** Extracting meaningful features from images and videos for
+    object recognition, image segmentation, and facial recognition.
+    * **Natural language processing:** Reducing the dimensionality of text data for
+    text classification, document summarization, and machine translation.
+    * **Bioinformatics:** Analyzing gene expression data and identifying biomarkers
+    for disease diagnosis and drug discovery.
+    * **Financial modeling:** Identifying patterns and trends in financial data for
+    risk assessment, portfolio optimization, and fraud detection.
+    * **Recommendation systems:** Generating personalized recommendations for products,
+    movies, or music based on user preferences.
+    To further enhance your understanding, I can provide detailed explanations of
+    specific techniques, industry case studies, or address any specific questions
+    you may have.'
+  - Hepatocellular carcinoma is one of the most common malignancies worldwide. The
+    only curative treatment is surgery. As hepatocellular carcinoma is often associated
+    with liver cirrhosis, patients are at risk for postoperative liver failure. In
+    the recent years, platelets are thought to play an important role in liver regeneration.The
+    aim of this study was to discover the relevance of postoperative platelet counts
+    after liver resection for hepatocellular carcinoma. Data of 68 patients who underwent
+    liver resection for hepatocellular carcinoma between July 2007 and July 2012 in
+    a single centre were analysed. Postoperative morbidity and mortality were evaluated
+    in regard to postoperative platelet counts. Comparative analysis between patients
+    with platelet counts ≤100 2x109/ l and >100 x109/ l at d1 was performed in regard
+    to postoperative outcome. Within this cohort, 43 patients (63%) suffered from
+    histologically proven liver cirrhosis. Postoperative mortality was statistically
+    significant associated with postoperative reduced platelet counts. Comparative
+    analysis showed significantly elevated postoperative bilirubin levels and lower
+    prothrombin time in patients with platelet counts ≤ 100 1x109/ l at d1.
+  - "Let G be a group of order 25. Since 25 = 5^2 and 5 is prime, by the Sylow theorems,\
+    \ the number of 5-Sylow subgroups in G, denoted by n_5, satisfies:\n\n1. n_5 divides\
+    \ 25/5 = 5, and\n2. n_5 ≡ 1 (mod 5).\n\nFrom these conditions, we have that n_5\
+    \ can only be 1 or 5. \n\nCase 1: n_5 = 1\nIn this case, there is only one 5-Sylow\
+    \ subgroup, say H, in G. By the Sylow theorems, H is a normal subgroup of G. Since\
+    \ the order of H is 5, which is prime, H is cyclic, i.e., H ≅ C_5 (the cyclic\
+    \ group of order 5). \n\nNow, let g be an element of G that is not in H. Since\
+    \ H is normal in G, the set {gh : h ∈ H} is also a subgroup of G. Let K = {gh\
+    \ : h ∈ H}. Note that the order of K is also 5, as there is a one-to-one correspondence\
+    \ between the elements of H and K. Thus, K is also a cyclic group of order 5,\
+    \ i.e., K ≅ C_5.\n\nSince the orders of H and K are both 5, their intersection\
+    \ is trivial, i.e., H ∩ K = {e}, where e is the identity element of G. Moreover,\
+    \ since the order of G is 25, any element of G can be written as a product of\
+    \ elements from H and K. Therefore, G is the internal direct product of H and\
+    \ K, i.e., G ≅ H × K ≅ C_5 × C_5.\n\nCase 2: n_5 = 5\nIn this case, there are\
+    \ five 5-Sylow subgroups in G. Let H be one of these subgroups. Since the order\
+    \ of H is 5, which is prime, H is cyclic, i.e., H ≅ C_5.\n\nNow, consider the\
+    \ action of G on the set of 5-Sylow subgroups by conjugation. This action gives\
+    \ rise to a homomorphism φ: G → S_5, where S_5 is the symmetric group on 5 elements.\
+    \ The kernel of φ, say N, is a normal subgroup of G. Since the action is nontrivial,\
+    \ N is a proper subgroup of G, and thus, the order of N is either 1 or 5. If the\
+    \ order of N is 1, then G is isomorphic to a subgroup of S_5, which is a contradiction\
+    \ since the order of G is 25 and there is no subgroup of S_5 with order 25. Therefore,\
+    \ the order of N must be 5.\n\nSince the order of N is 5, N is a cyclic group\
+    \ of order 5, i.e., N ≅ C_5. Moreover, N is a normal subgroup of G. Let g be an\
+    \ element of G that is not in N. Then, the set {gn : n ∈ N} is also a subgroup\
+    \ of G. Let K = {gn : n ∈ N}. Note that the order of K is also 5, as there is\
+    \ a one-to-one correspondence between the elements of N and K. Thus, K is also\
+    \ a cyclic group of order 5, i.e., K ≅ C_5.\n\nSince the orders of N and K are\
+    \ both 5, their intersection is trivial, i.e., N ∩ K = {e}, where e is the identity\
+    \ element of G. Moreover, since the order of G is 25, any element of G can be\
+    \ written as a product of elements from N and K. Therefore, G is the internal\
+    \ direct product of N and K, i.e., G ≅ N × K ≅ C_5 × C_5.\n\nIn conclusion, a\
+    \ group of order 25 is either cyclic or isomorphic to the direct product of two\
+    \ cyclic groups of order 5."
+- source_sentence: 'Represent this question for retrieving relevant documents: Does
+    low 25-Hydroxyvitamin D Level be Associated with Peripheral Arterial Disease in
+    Type 2 Diabetes Patients?'
+  sentences:
+  - 'Patients with type 2 diabetes have an increased risk of atherosclerosis and vascular
+    disease. Vitamin D deficiency is associated with vascular disease and is prevalent
+    in diabetes patients. We undertook this study to determine the association between
+    25-hydroxyvitamin D (25[OH]D) levels and prevalence of peripheral arterial disease
+    (PAD) in type 2 diabetes patients. A total of 1028 type 2 diabetes patients were
+    recruited at Nanjing Medical University Affiliated Nanjing Hospital from November
+    2011 to October 2013. PAD was defined as an ankle-brachial index (ABI) < 0.9.
+    Cardiovascular risk factors (blood pressure, HbA1c, lipid profile), comorbidities,
+    carotid intima-media thickness (IMT) and 25(OH)D were assessed. Overall prevalence
+    of PAD and of decreased 25(OH)D (<30 ng/mL) were 20.1% (207/1028) and 54.6% (561/1028),
+    respectively. PAD prevalence was higher in participants with decreased (23.9%)
+    than in those with normal (15.6%) 25(OH)D (≥30 ng/mL, p <0.01). Decreased 25(OH)D
+    was associated with increased risk of PAD (odds ratio [OR], 1.69, 95% CI: 1.17-2.44,
+    p <0.001) and PAD was significantly more likely to occur in participants ≥65 years
+    of age (OR, 2.56, 95% CI: 1.51 -4.48, vs. 1.21, 95% CI: 0.80-1.83, p-interaction = 0.027).
+    After adjusting for known cardiovascular risk factors and potential confounding
+    variables, the association of decreased 25(OH)D and PAD remained significant in
+    patients <65 years of age (OR, 1.55; 95% CI: 1.14-2.12, p = 0.006).'
+  - No study has been performed to compare the impacts of migraine and major depressive
+    episode (MDE) on depression, anxiety and somatic symptoms, and health-related
+    quality of life (HRQoL) among psychiatric outpatients. The aim of this study was
+    to investigate the above issue. This study enrolled consecutive psychiatric outpatients
+    with mood and/or anxiety disorders who undertook a first visit to a medical center.
+    Migraine was diagnosed according to the International Classification of Headache
+    Disorders, 2nd edition. Three psychometric scales and the Short-Form 36 were administered.
+    General linear models were used to estimate the difference in scores contributed
+    by either migraine or MDE. Multiple linear regressions were employed to compare
+    the variance of these scores explained by migraine or MDE. Among 214 enrolled
+    participants, 35.0% had migraine. Bipolar II disorder patients (70.0%) had the
+    highest percentage of migraine, followed by major depressive disorder (49.1%)
+    and only anxiety disorder (24.5%). Patients with migraine had worse depression,
+    anxiety, and somatic symptoms and lower SF-36 scores than those without. The estimated
+    differences in the scores of physical functioning, bodily pain, and somatic symptoms
+    contributed by migraine were not lower than those contributed by MDE. The regression
+    model demonstrated the variance explained by migraine was significantly greater
+    than that explained by MDE in physical and pain symptoms.
+  - 'Based on the information provided, we only know the number of patients who died
+    within the first year after the surgery. To determine the probability of a patient
+    surviving at least two years, we would need additional information about the number
+    of patients who died in the second year or survived beyond that.
+    Without this information, it is not possible to calculate the probability of a
+    patient surviving at least two years after the surgery.'
+pipeline_tag: sentence-similarity
+library_name: sentence-transformers
+metrics:
+- cosine_accuracy@1
+- cosine_accuracy@3
+- cosine_accuracy@5
+- cosine_accuracy@10
+- cosine_precision@1
+- cosine_precision@3
+- cosine_precision@5
+- cosine_recall@1
+- cosine_recall@3
+- cosine_recall@5
+- cosine_ndcg@10
+- cosine_mrr@10
+- cosine_map@100
+model-index:
+- name: SentenceTransformer based on BAAI/bge-small-en-v1.5
+  results:
+  - task:
+      type: logging
+      name: Logging
+    dataset:
+      name: ir eval
+      type: ir-eval
+    metrics:
+    - type: cosine_accuracy@1
+      value: 0.9241493167018252
+      name: Cosine Accuracy@1
+    - type: cosine_accuracy@3
+      value: 0.9788131706869669
+      name: Cosine Accuracy@3
+    - type: cosine_accuracy@5
+      value: 0.9906447766669724
+      name: Cosine Accuracy@5
+    - type: cosine_accuracy@10
+      value: 0.9965147207190681
+      name: Cosine Accuracy@10
+    - type: cosine_precision@1
+      value: 0.9241493167018252
+      name: Cosine Precision@1
+    - type: cosine_precision@3
+      value: 0.3262710568956556
+      name: Cosine Precision@3
+    - type: cosine_precision@5
+      value: 0.1981289553333945
+      name: Cosine Precision@5
+    - type: cosine_recall@1
+      value: 0.9241493167018252
+      name: Cosine Recall@1
+    - type: cosine_recall@3
+      value: 0.9788131706869669
+      name: Cosine Recall@3
+    - type: cosine_recall@5
+      value: 0.9906447766669724
+      name: Cosine Recall@5
+    - type: cosine_ndcg@10
+      value: 0.9634519649573985
+      name: Cosine Ndcg@10
+    - type: cosine_mrr@10
+      value: 0.9524509418552345
+      name: Cosine Mrr@10
+    - type: cosine_map@100
+      value: 0.9526115405885596
+      name: Cosine Map@100
+---
+# SentenceTransformer based on BAAI/bge-small-en-v1.5
+This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
+## Model Details
+### Model Description
+- **Model Type:** Sentence Transformer
+- **Base model:** [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) <!-- at revision 5c38ec7c405ec4b44b94cc5a9bb96e735b38267a -->
+- **Maximum Sequence Length:** 512 tokens
+- **Output Dimensionality:** 384 dimensions
+- **Similarity Function:** Cosine Similarity
+<!-- - **Training Dataset:** Unknown -->
+<!-- - **Language:** Unknown -->
+<!-- - **License:** Unknown -->
+### Model Sources
+- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
+- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
+- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
+### Full Model Architecture
+```
+SentenceTransformer(
+  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
+  (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
+  (2): Normalize()
+)
+```
+## Usage
+### Direct Usage (Sentence Transformers)
+First install the Sentence Transformers library:
+```bash
+pip install -U sentence-transformers
+```
+Then you can load this model and run inference.
+```python
+from sentence_transformers import SentenceTransformer
+# Download from the 🤗 Hub
+model = SentenceTransformer("sucharush/bge_MNR")
+# Run inference
+sentences = [
+    'Represent this question for retrieving relevant documents: Does low 25-Hydroxyvitamin D Level be Associated with Peripheral Arterial Disease in Type 2 Diabetes Patients?',
+    'Patients with type 2 diabetes have an increased risk of atherosclerosis and vascular disease. Vitamin D deficiency is associated with vascular disease and is prevalent in diabetes patients. We undertook this study to determine the association between 25-hydroxyvitamin D (25[OH]D) levels and prevalence of peripheral arterial disease (PAD) in type 2 diabetes patients. A total of 1028 type 2 diabetes patients were recruited at Nanjing Medical University Affiliated Nanjing Hospital from November 2011 to October 2013. PAD was defined as an ankle-brachial index (ABI)\xa0<\xa00.9. Cardiovascular risk factors (blood pressure, HbA1c, lipid profile), comorbidities, carotid intima-media thickness (IMT) and 25(OH)D were assessed. Overall prevalence of PAD and of decreased 25(OH)D (<30\xa0ng/mL) were 20.1% (207/1028) and 54.6% (561/1028), respectively. PAD prevalence was higher in participants with decreased (23.9%) than in those with normal (15.6%) 25(OH)D (≥30\xa0ng/mL, p\xa0<0.01). Decreased 25(OH)D was associated with increased risk of PAD (odds ratio [OR], 1.69, 95% CI: 1.17-2.44, p\xa0<0.001) and PAD was significantly more likely to occur in participants ≥65\xa0years of age (OR, 2.56, 95% CI: 1.51 -4.48, vs. 1.21, 95% CI: 0.80-1.83, p-interaction\xa0=\xa00.027). After adjusting for known cardiovascular risk factors and potential confounding variables, the association of decreased 25(OH)D and PAD remained significant in patients <65\xa0years of age (OR, 1.55; 95% CI: 1.14-2.12, p\xa0=\xa00.006).',
+    'Based on the information provided, we only know the number of patients who died within the first year after the surgery. To determine the probability of a patient surviving at least two years, we would need additional information about the number of patients who died in the second year or survived beyond that.\n\nWithout this information, it is not possible to calculate the probability of a patient surviving at least two years after the surgery.',
+]
+embeddings = model.encode(sentences)
+print(embeddings.shape)
+# [3, 384]
+# Get the similarity scores for the embeddings
+similarities = model.similarity(embeddings, embeddings)
+print(similarities.shape)
+# [3, 3]
+```
+<!--
+### Direct Usage (Transformers)
+<details><summary>Click to see the direct usage in Transformers</summary>
+</details>
+-->
+<!--
+### Downstream Usage (Sentence Transformers)
+You can finetune this model on your own dataset.
+<details><summary>Click to expand</summary>
+</details>
+-->
+<!--
+### Out-of-Scope Use
+*List how the model may foreseeably be misused and address what users ought not to do with the model.*
+-->
+## Evaluation
+### Metrics
+#### Logging
+* Dataset: `ir-eval`
+* Evaluated with <code>__main__.LoggingEvaluator</code>
+| Metric             | Value      |
+|:-------------------|:-----------|
+| cosine_accuracy@1  | 0.9241     |
+| cosine_accuracy@3  | 0.9788     |
+| cosine_accuracy@5  | 0.9906     |
+| cosine_accuracy@10 | 0.9965     |
+| cosine_precision@1 | 0.9241     |
+| cosine_precision@3 | 0.3263     |
+| cosine_precision@5 | 0.1981     |
+| cosine_recall@1    | 0.9241     |
+| cosine_recall@3    | 0.9788     |
+| cosine_recall@5    | 0.9906     |
+| **cosine_ndcg@10** | **0.9635** |
+| cosine_mrr@10      | 0.9525     |
+| cosine_map@100     | 0.9526     |
+<!--
+## Bias, Risks and Limitations
+*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
+-->
+<!--
+### Recommendations
+*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
+-->
+## Training Details
+### Training Dataset
+#### Unnamed Dataset
+* Size: 98,112 training samples
+* Columns: <code>sentence_0</code> and <code>sentence_1</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | sentence_0                                                                          | sentence_1                                                                          |
+  |:--------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
+  | type    | string                                                                              | string                                                                              |
+  | details | <ul><li>min: 18 tokens</li><li>mean: 55.27 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 317.52 tokens</li><li>max: 512 tokens</li></ul> |
+* Samples:
+  | sentence_0                                                                                                                                                                                                                                                                                                                                                   | sentence_1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
+  |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+  | <code>Represent this question for retrieving relevant documents: Are elevated levels of pro-inflammatory oxylipins in older subjects normalized by flaxseed consumption?</code>                                                                                                                                                                              | <code>Oxylipins, including eicosanoids, are highly bioactive molecules endogenously produced from polyunsaturated fatty acids. Oxylipins play a key role in chronic disease progression. It is possible, but unknown, if oxylipin concentrations change with the consumption of functional foods or differ with subject age. Therefore, in a parallel comparator trial, 20 healthy individuals were recruited into a younger (19-28years) or older (45-64years) age group (n=10/group). Participants ingested one muffin/day containing 30g of milled flaxseed (6g alpha-linolenic acid) for 4weeks. Plasma oxylipins were isolated through solid phase extraction, analyzed with HPLC-MS/MS targeted lipidomics, and quantified with the stable isotope dilution method. At baseline, the older group exhibited 13 oxylipins ≥2-fold the concentration of the younger group. Specifically, pro-inflammatory oxylipins 5-hydroxyeicosatetraenoic acid, 9,10,13-trihydroxyoctadecenoic acid, and 9,12,13-trihydroxyoctadecenoic acid were signi...</code>                                                 |
+  | <code>Represent this question for retrieving relevant documents: Find the isometries of the metric $ds^2 = dx^2 + dy^2$ over the rectangle $R=[0,a] \times [0,b]$, subject to the additional condition that any isometry $f$ maps $(0,0)$ to $(x_0, y_0)$. Find $x_0$ and $y_0$ such that the isometry $f$ is given by $f(x,y) = (x_0 + x, y_0 - y)$.</code> | <code>An isometry is a transformation that preserves the distance between points. In this case, we are looking for transformations that preserve the metric $ds^2 = dx^2 + dy^2$. Let's consider the transformation $f(x,y) = (x_0 + x, y_0 - y)$ and find the conditions on $x_0$ and $y_0$ for it to be an isometry.<br><br>First, let's compute the differential of the transformation:<br><br>$$df = \begin{pmatrix} 1 & 0 \\ 0 & -1 \end{pmatrix} \begin{pmatrix} dx \\ dy \end{pmatrix} = \begin{pmatrix} dx \\ -dy \end{pmatrix}$$<br><br>Now, let's compute the metric under this transformation:<br><br>$$ds'^2 = (dx')^2 + (dy')^2 = dx^2 + (-dy)^2 = dx^2 + dy^2$$<br><br>Since $ds'^2 = ds^2$, the transformation $f(x,y) = (x_0 + x, y_0 - y)$ is an isometry.<br><br>Now, let's find the conditions on $x_0$ and $y_0$ such that the isometry maps $(0,0)$ to $(x_0, y_0)$. Applying the transformation to $(0,0)$, we get:<br><br>$$f(0,0) = (x_0 + 0, y_0 - 0) = (x_0, y_0)$$<br><br>Since the transformation maps $(0,0)$ to $(x_0, y_0)$, there are no additional conditions...</code> |
+  | <code>Represent this question for retrieving relevant documents: Do two di-leucine motifs regulate trafficking and function of mouse ASIC2a?</code>                                                                                                                                                                                                          | <code>Acid-sensing ion channels (ASICs) are proton-gated cation channels that mediate acid-induced responses in neurons. ASICs are important for mechanosensation, learning and memory, fear, pain, and neuronal injury. ASIC2a is widely expressed in the nervous system and modulates ASIC channel trafficking and activity in both central and peripheral systems. Here, to better understand mechanisms regulating ASIC2a, we searched for potential protein motifs that regulate ASIC2a trafficking.</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
+* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim"
+  }
+  ```
+### Training Hyperparameters
+#### Non-Default Hyperparameters
+- `eval_strategy`: steps
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
+- `num_train_epochs`: 1
+- `batch_sampler`: no_duplicates
+- `multi_dataset_batch_sampler`: round_robin
+#### All Hyperparameters
+<details><summary>Click to expand</summary>
+- `overwrite_output_dir`: False
+- `do_predict`: False
+- `eval_strategy`: steps
+- `prediction_loss_only`: True
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
+- `per_gpu_train_batch_size`: None
+- `per_gpu_eval_batch_size`: None
+- `gradient_accumulation_steps`: 1
+- `eval_accumulation_steps`: None
+- `torch_empty_cache_steps`: None
+- `learning_rate`: 5e-05
+- `weight_decay`: 0.0
+- `adam_beta1`: 0.9
+- `adam_beta2`: 0.999
+- `adam_epsilon`: 1e-08
+- `max_grad_norm`: 1
+- `num_train_epochs`: 1
+- `max_steps`: -1
+- `lr_scheduler_type`: linear
+- `lr_scheduler_kwargs`: {}
+- `warmup_ratio`: 0.0
+- `warmup_steps`: 0
+- `log_level`: passive
+- `log_level_replica`: warning
+- `log_on_each_node`: True
+- `logging_nan_inf_filter`: True
+- `save_safetensors`: True
+- `save_on_each_node`: False
+- `save_only_model`: False
+- `restore_callback_states_from_checkpoint`: False
+- `no_cuda`: False
+- `use_cpu`: False
+- `use_mps_device`: False
+- `seed`: 42
+- `data_seed`: None
+- `jit_mode_eval`: False
+- `use_ipex`: False
+- `bf16`: False
+- `fp16`: False
+- `fp16_opt_level`: O1
+- `half_precision_backend`: auto
+- `bf16_full_eval`: False
+- `fp16_full_eval`: False
+- `tf32`: None
+- `local_rank`: 0
+- `ddp_backend`: None
+- `tpu_num_cores`: None
+- `tpu_metrics_debug`: False
+- `debug`: []
+- `dataloader_drop_last`: False
+- `dataloader_num_workers`: 0
+- `dataloader_prefetch_factor`: None
+- `past_index`: -1
+- `disable_tqdm`: False
+- `remove_unused_columns`: True
+- `label_names`: None
+- `load_best_model_at_end`: False
+- `ignore_data_skip`: False
+- `fsdp`: []
+- `fsdp_min_num_params`: 0
+- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
+- `tp_size`: 0
+- `fsdp_transformer_layer_cls_to_wrap`: None
+- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
+- `deepspeed`: None
+- `label_smoothing_factor`: 0.0
+- `optim`: adamw_torch
+- `optim_args`: None
+- `adafactor`: False
+- `group_by_length`: False
+- `length_column_name`: length
+- `ddp_find_unused_parameters`: None
+- `ddp_bucket_cap_mb`: None
+- `ddp_broadcast_buffers`: False
+- `dataloader_pin_memory`: True
+- `dataloader_persistent_workers`: False
+- `skip_memory_metrics`: True
+- `use_legacy_prediction_loop`: False
+- `push_to_hub`: False
+- `resume_from_checkpoint`: None
+- `hub_model_id`: None
+- `hub_strategy`: every_save
+- `hub_private_repo`: None
+- `hub_always_push`: False
+- `gradient_checkpointing`: False
+- `gradient_checkpointing_kwargs`: None
+- `include_inputs_for_metrics`: False
+- `include_for_metrics`: []
+- `eval_do_concat_batches`: True
+- `fp16_backend`: auto
+- `push_to_hub_model_id`: None
+- `push_to_hub_organization`: None
+- `mp_parameters`:
+- `auto_find_batch_size`: False
+- `full_determinism`: False
+- `torchdynamo`: None
+- `ray_scope`: last
+- `ddp_timeout`: 1800
+- `torch_compile`: False
+- `torch_compile_backend`: None
+- `torch_compile_mode`: None
+- `include_tokens_per_second`: False
+- `include_num_input_tokens_seen`: False
+- `neftune_noise_alpha`: None
+- `optim_target_modules`: None
+- `batch_eval_metrics`: False
+- `eval_on_start`: False
+- `use_liger_kernel`: False
+- `eval_use_gather_object`: False
+- `average_tokens_across_devices`: False
+- `prompts`: None
+- `batch_sampler`: no_duplicates
+- `multi_dataset_batch_sampler`: round_robin
+</details>
+### Training Logs
+| Epoch  | Step | Training Loss | ir-eval_cosine_ndcg@10 |
+|:------:|:----:|:-------------:|:----------------------:|
+| 0.1631 | 500  | 0.021         | 0.9523                 |
+| 0.3262 | 1000 | 0.0069        | 0.9600                 |
+| 0.4892 | 1500 | 0.0051        | 0.9593                 |
+| 0.6523 | 2000 | 0.0055        | 0.9605                 |
+| 0.8154 | 2500 | 0.0053        | 0.9638                 |
+| 0.9785 | 3000 | 0.0056        | 0.9634                 |
+| 1.0    | 3066 | -             | 0.9635                 |
+### Framework Versions
+- Python: 3.12.8
+- Sentence Transformers: 3.4.1
+- Transformers: 4.51.3
+- PyTorch: 2.5.1+cu124
+- Accelerate: 1.3.0
+- Datasets: 3.2.0
+- Tokenizers: 0.21.0
+## Citation
+### BibTeX
+#### Sentence Transformers
+```bibtex
+@inproceedings{reimers-2019-sentence-bert,
+    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
+    author = "Reimers, Nils and Gurevych, Iryna",
+    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
+    month = "11",
+    year = "2019",
+    publisher = "Association for Computational Linguistics",
+    url = "https://arxiv.org/abs/1908.10084",
+}
+```
+#### MultipleNegativesRankingLoss
+```bibtex
+@misc{henderson2017efficient,
+    title={Efficient Natural Language Response Suggestion for Smart Reply},
+    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
+    year={2017},
+    eprint={1705.00652},
+    archivePrefix={arXiv},
+    primaryClass={cs.CL}
+}
+```
+<!--
+## Glossary
+*Clearly define terms in order to be accessible across audiences.*
+-->
+<!--
+## Model Card Authors
+*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
+-->
+<!--
+## Model Card Contact
+*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
+-->

config.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "architectures": [
+    "BertModel"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 384,
+  "id2label": {
+    "0": "LABEL_0"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 1536,
+  "label2id": {
+    "LABEL_0": 0
+  },
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "torch_dtype": "float16",
+  "transformers_version": "4.51.3",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 30522
+}

config_sentence_transformers.json ADDED Viewed

	@@ -0,0 +1,10 @@

+{
+  "__version__": {
+    "sentence_transformers": "3.4.1",
+    "transformers": "4.51.3",
+    "pytorch": "2.5.1+cu124"
+  },
+  "prompts": {},
+  "default_prompt_name": null,
+  "similarity_fn_name": "cosine"
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:63c73d8a3c53aa912fd35d5d5b652a7cdb117cb32a82a995198b8a460588f915
+size 66741984

modules.json ADDED Viewed

	@@ -0,0 +1,20 @@

+[
+  {
+    "idx": 0,
+    "name": "0",
+    "path": "",
+    "type": "sentence_transformers.models.Transformer"
+  },
+  {
+    "idx": 1,
+    "name": "1",
+    "path": "1_Pooling",
+    "type": "sentence_transformers.models.Pooling"
+  },
+  {
+    "idx": 2,
+    "name": "2",
+    "path": "2_Normalize",
+    "type": "sentence_transformers.models.Normalize"
+  }
+]

sentence_bert_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "max_seq_length": 512,
+  "do_lower_case": true
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,37 @@

+{
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "mask_token": {
+    "content": "[MASK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "[PAD]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,58 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_basic_tokenize": true,
+  "do_lower_case": true,
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "never_split": null,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff