IoannisKat1's picture
Add finetuned model
03e322c verified
metadata
language:
  - en
license: apache-2.0
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - dense
  - generated_from_trainer
  - dataset_size:391
  - loss:MatryoshkaLoss
  - loss:MultipleNegativesRankingLoss
base_model: nomic-ai/modernbert-embed-base
widget:
  - source_sentence: What does 'personal data breach' entail?
    sentences:
      - >-
        1.Processing of personal data revealing racial or ethnic origin,
        political opinions, religious or philosophical beliefs, or trade union
        membership, and the processing of genetic data, biometric data for the
        purpose of uniquely identifying a natural person, data concerning health
        or data concerning a natural person's sex life or sexual orientation
        shall be prohibited.

        2.Paragraph 1 shall not apply if one of the following applies: (a)  the
        data subject has given explicit consent to the processing of those
        personal data for one or more specified purposes, except where Union or
        Member State law provide that the prohibition referred to in paragraph 1
        may not be lifted by the data subject; (b)  processing is necessary for
        the purposes of carrying out the obligations and exercising specific
        rights of the controller or of the data subject in the field of
        employment and social security and social protection law in so far as it
        is authorised by Union or Member State law or a collective agreement
        pursuant to Member State law providing for appropriate safeguards for
        the fundamental rights and the interests of the data subject; (c) 
        processing is necessary to protect the vital interests of the data
        subject or of another natural person where the data subject is
        physically or legally incapable of giving consent; (d)  processing is
        carried out in the course of its legitimate activities with appropriate
        safeguards by a foundation, association or any other not-for-profit body
        with a political, philosophical, religious or trade union aim and on
        condition that the processing relates solely to the members or to former
        members of the body or to persons who have regular contact with it in
        connection with its purposes and that the personal data are not
        disclosed outside that body without the consent of the data subjects;
        (e)  processing relates to personal data which are manifestly made
        public by the data subject; (f)  processing is necessary for the
        establishment, exercise or defence of legal claims or whenever courts
        are acting in their judicial capacity; (g)  processing is necessary for
        reasons of substantial public interest, on the basis of Union or Member
        State law which shall be proportionate to the aim pursued, respect the
        essence of the right to data protection and provide for suitable and
        specific measures to safeguard the fundamental rights and the interests
        of the data subject; (h)  processing is necessary for the purposes of
        preventive or occupational medicine, for the assessment of the working
        capacity of the employee, medical diagnosis, the provision of health or
        social care or treatment or the management of health or social care
        systems and services on the basis of Union or Member State law or
        pursuant to contract with a health professional and subject to the
        conditions and safeguards referred to in paragraph 3; (i)  processing is
        necessary for reasons of public interest in the area of public health,
        such as protecting against serious cross-border threats to health or
        ensuring high standards of quality and safety of health care and of
        medicinal products or medical devices, on the basis of Union or Member
        State law which provides for suitable and specific measures to safeguard
        the rights and freedoms of the data subject, in particular professional
        secrecy; 4.5.2016 L 119/38   (j)  processing is necessary for archiving
        purposes in the public interest, scientific or historical research
        purposes or statistical purposes in accordance with Article 89(1) based
        on Union or Member State law which shall be proportionate to the aim
        pursued, respect the essence of the right to data protection and provide
        for suitable and specific measures to safeguard the fundamental rights
        and the interests of the data subject.

        3.Personal data referred to in paragraph 1 may be processed for the
        purposes referred to in point (h) of paragraph 2 when those data are
        processed by or under the responsibility of a professional subject to
        the obligation of professional secrecy under Union or Member State law
        or rules established by national competent bodies or by another person
        also subject to an obligation of secrecy under Union or Member State law
        or rules established by national competent bodies.

        4.Member States may maintain or introduce further conditions, including
        limitations, with regard to the processing of genetic data, biometric
        data or data concerning health.
      - >-
        1) 'personal data' means any information relating to an identified or
        identifiable natural person ('data subject'); an identifiable natural
        person is one who can be identified, directly or indirectly, in
        particular by reference to an identifier such as a name, an
        identification number, location data, an online identifier or to one or
        more factors specific to the physical, physiological, genetic, mental,
        economic, cultural or social identity of that natural person;

        (2) ‘processing’ means any operation or set of operations which is
        performed on personal data or on sets of personal data, whether or not
        by automated means, such as collection, recording, organisation,
        structuring, storage, adaptation or alteration, retrieval, consultation,
        use, disclosure by transmission, dissemination or otherwise making
        available, alignment or combination, restriction, erasure or
        destruction;

        (3) ‘restriction of processing’ means the marking of stored personal
        data with the aim of limiting their processing in the future;

        (4) ‘profiling’ means any form of automated processing of personal data
        consisting of the use of personal data to evaluate certain personal
        aspects relating to a natural person, in particular to analyse or
        predict aspects concerning that natural person's performance at work,
        economic situation, health, personal preferences, interests,
        reliability, behaviour, location or movements;

        (5) ‘pseudonymisation’ means the processing of personal data in such a
        manner that the personal data can no longer be attributed to a specific
        data subject without the use of additional information, provided that
        such additional information is kept separately and is subject to
        technical and organisational measures to ensure that the personal data
        are not attributed to an identified or identifiable natural person;

        (6) ‘filing system’ means any structured set of personal data which are
        accessible according to specific criteria, whether centralised,
        decentralised or dispersed on a functional or geographical basis;

        (7) ‘controller’ means the natural or legal person, public authority,
        agency or other body which, alone or jointly with others, determines the
        purposes and means of the processing of personal data; where the
        purposes and means of such processing are determined by Union or Member
        State law, the controller or the specific criteria for its nomination
        may be provided for by Union or Member State law;

        (8) ‘processor’ means a natural or legal person, public authority,
        agency or other body which processes personal data on behalf of the
        controller;

        (9) ‘recipient’ means a natural or legal person, public authority,
        agency or another body, to which the personal data are disclosed,
        whether a third party or not. However, public authorities which may
        receive personal data in the framework of a particular inquiry in
        accordance with Union or Member State law shall not be regarded as
        recipients; the processing of those data by those public authorities
        shall be in compliance with the applicable data protection rules
        according to the purposes of the processing;

        (10) ‘third party’ means a natural or legal person, public authority,
        agency or body other than the data subject, controller, processor and
        persons who, under the direct authority of the controller or processor,
        are authorised to process personal data;

        (11) ‘consent’ of the data subject means any freely given, specific,
        informed and unambiguous indication of the data subject's wishes by
        which he or she, by a statement or by a clear affirmative action,
        signifies agreement to the processing of personal data relating to him
        or her;

        (12) ‘personal data breach’ means a breach of security leading to the
        accidental or unlawful destruction, loss, alteration, unauthorised
        disclosure of, or access to, personal data transmitted, stored or
        otherwise processed;

        (13) ‘genetic data’ means personal data relating to the inherited or
        acquired genetic characteristics of a natural person which give unique
        information about the physiology or the health of that natural person
        and which result, in particular, from an analysis of a biological sample
        from the natural person in question;

        (14) ‘biometric data’ means personal data resulting from specific
        technical processing relating to the physical, physiological or
        behavioural characteristics of a natural person, which allow or confirm
        the unique identification of that natural person, such as facial images
        or dactyloscopic data;

        (15) ‘data concerning health’ means personal data related to the
        physical or mental health of a natural person, including the provision
        of health care services, which reveal information about his or her
        health status;

        (16) ‘main establishment’ means: (a) as regards a controller with
        establishments in more than one Member State, the place of its central
        administration in the Union, unless the decisions on the purposes and
        means of the processing of personal data are taken in another
        establishment of the controller in the Union and the latter
        establishment has the power to have such decisions implemented, in which
        case the establishment having taken such decisions is to be considered
        to be the main establishment; (b) as regards a processor with
        establishments in more than one Member State, the place of its central
        administration in the Union, or, if the processor has no central
        administration in the Union, the establishment of the processor in the
        Union where the main processing activities in the context of the
        activities of an establishment of the processor take place to the extent
        that the processor is subject to specific obligations under this
        Regulation;

        (17) ‘representative’ means a natural or legal person established in the
        Union who, designated by the controller or processor in writing pursuant
        to Article 27, represents the controller or processor with regard to
        their respective obligations under this Regulation;

        (18) ‘enterprise’ means a natural or legal person engaged in an economic
        activity, irrespective of its legal form, including partnerships or
        associations regularly engaged in an economic activity;

        (19) ‘group of undertakings’ means a controlling undertaking and its
        controlled undertakings;

        (20) ‘binding corporate rules’ means personal data protection policies
        which are adhered to by a controller or processor established on the
        territory of a Member State for transfers or a set of transfers of
        personal data to a controller or processor in one or more third
        countries within a group of undertakings, or group of enterprises
        engaged in a joint economic activity;

        (21) ‘supervisory authority’ means an independent public authority which
        is established by a Member State pursuant to Article 51;

        (22) ‘supervisory authority concerned’ means a supervisory authority
        which is concerned by the processing of personal data because: (a) the
        controller or processor is established on the territory of the Member
        State of that supervisory authority; (b) data subjects residing in the
        Member State of that supervisory authority are substantially affected or
        likely to be substantially affected by the processing; or (c) a
        complaint has been lodged with that supervisory authority;

        (23) ‘cross-border processing’ means either: (a) processing of personal
        data which takes place in the context of the activities of
        establishments in more than one Member State of a controller or
        processor in the Union where the controller or processor is established
        in more than one Member State; or (b) processing of personal data which
        takes place in the context of the activities of a single establishment
        of a controller or processor in the Union but which substantially
        affects or is likely to substantially affect data subjects in more than
        one Member State.

        (24) ‘relevant and reasoned objection’ means an objection to a draft
        decision as to whether there is an infringement of this Regulation, or
        whether envisaged action in relation to the controller or processor
        complies with this Regulation, which clearly demonstrates the
        significance of the risks posed by the draft decision as regards the
        fundamental rights and freedoms of data subjects and, where applicable,
        the free flow of personal data within the Union;

        (25) ‘information society service’ means a service as defined in point
        (b) of Article 1(1) of Directive (EU) 2015/1535 of the European
        Parliament and of the Council (1);

        (26) ‘international organisation’ means an organisation and its
        subordinate bodies governed by public international law, or any other
        body which is set up by, or on the basis of, an agreement between two or
        more countries.
      - >-
        Any processing of personal data should be lawful and fair. It should be
        transparent to natural persons that personal data concerning them are
        collected, used, consulted or otherwise processed and to what extent the
        personal data are or will be processed. The principle of transparency
        requires that any information and communication relating to the
        processing of those personal data be easily accessible and easy to
        understand, and that clear and plain language be used. That principle
        concerns, in particular, information to the data subjects on the
        identity of the controller and the purposes of the processing and
        further information to ensure fair and transparent processing in respect
        of the natural persons concerned and their right to obtain confirmation
        and communication of personal data concerning them which are being
        processed. Natural persons should be made aware of risks, rules,
        safeguards and rights in relation to the processing of personal data and
        how to exercise their rights in relation to such processing. In
        particular, the specific purposes for which personal data are processed
        should be explicit and legitimate and determined at the time of the
        collection of the personal data. The personal data should be adequate,
        relevant and limited to what is necessary for the purposes for which
        they are processed. This requires, in particular, ensuring that the
        period for which the personal data are stored is limited to a strict
        minimum. Personal data should be processed only if the purpose of the
        processing could not reasonably be fulfilled by other means. In order to
        ensure that the personal data are not kept longer than necessary, time
        limits should be established by the controller for erasure or for a
        periodic review. Every reasonable step should be taken to ensure that
        personal data which are inaccurate are rectified or deleted. Personal
        data should be processed in a manner that ensures appropriate security
        and confidentiality of the personal data, including for preventing
        unauthorised access to or use of personal data and the equipment used
        for the processing.
  - source_sentence: >-
      In what situations could providing information to the data subject be
      considered impossible or involve a disproportionate effort?
    sentences:
      - >-
        1.The controller shall consult the supervisory authority prior to
        processing where a data protection impact assessment under Article 35
        indicates that the processing would result in a high risk in the absence
        of measures taken by the controller to mitigate the risk.

        2.Where the supervisory authority is of the opinion that the intended
        processing referred to in paragraph 1 would infringe this Regulation, in
        particular where the controller has insufficiently identified or
        mitigated the risk, the supervisory authority shall, within period of up
        to eight weeks of receipt of the request for consultation, provide
        written advice to the controller and, where applicable to the processor,
        and may use any of its powers referred to in Article 58. That period may
        be extended by six weeks, taking into account the complexity of the
        intended processing. The supervisory authority shall inform the
        controller and, where applicable, the processor, of any such extension
        within one month of receipt of the request for consultation together
        with the reasons for the delay. Those periods may be suspended until the
        supervisory authority has obtained information it has requested for the
        purposes of the consultation.

        3.When consulting the supervisory authority pursuant to paragraph 1, the
        controller shall provide the supervisory authority with: (a)  where
        applicable, the respective responsibilities of the controller, joint
        controllers and processors involved in the processing, in particular for
        processing within a group of undertakings; (b)  the purposes and means
        of the intended processing; (c)  the measures and safeguards provided to
        protect the rights and freedoms of data subjects pursuant to this
        Regulation; (d)  where applicable, the contact details of the data
        protection officer; 4.5.2016 L 119/54   (e)  the data protection impact
        assessment provided for in Article 35; and (f)  any other information
        requested by the supervisory authority.

        4.Member States shall consult the supervisory authority during the
        preparation of a proposal for a legislative measure to be adopted by a
        national parliament, or of a regulatory measure based on such a
        legislative measure, which relates to processing.

        5.Notwithstanding paragraph 1, Member State law may require controllers
        to consult with, and obtain prior authorisation from, the supervisory
        authority in relation to processing by a controller for the performance
        of a task carried out by the controller in the public interest,
        including processing in relation to social protection and public health
      - >-
        1.The Member States, the supervisory authorities, the Board and the
        Commission shall encourage, in particular at Union level, the
        establishment of data protection certification mechanisms and of data
        protection seals and marks, for the purpose of demonstrating compliance
        with this Regulation of processing operations by controllers and
        processors. The specific needs of micro, small and medium-sized
        enterprises shall be taken into account. 4.5.2016 L 119/58  

        2.In addition to adherence by controllers or processors subject to this
        Regulation, data protection certification mechanisms, seals or marks
        approved pursuant to paragraph 5 of this Article may be established for
        the purpose of demonstrating the existence of appropriate safeguards
        provided by controllers or processors that are not subject to this
        Regulation pursuant to Article 3 within the framework of personal data
        transfers to third countries or international organisations under the
        terms referred to in point (f) of Article 46(2). Such controllers or
        processors shall make binding and enforceable commitments, via
        contractual or other legally binding instruments, to apply those
        appropriate safeguards, including with regard to the rights of data
        subjects.

        3.The certification shall be voluntary and available via a process that
        is transparent.

        4.A certification pursuant to this Article does not reduce the
        responsibility of the controller or the processor for compliance with
        this Regulation and is without prejudice to the tasks and powers of the
        supervisory authorities which are competent pursuant to Article 55 or 56

        5.A certification pursuant to this Article shall be issued by the
        certification bodies referred to in Article 43 or by the competent
        supervisory authority, on the basis of criteria approved by that
        competent supervisory authority pursuant to Article 58(3) or by the
        Board pursuant to Article 63. Where the criteria are approved by the
        Board, this may result in a common certification, the European Data
        Protection Seal.

        6.The controller or processor which submits its processing to the
        certification mechanism shall provide the certification body referred to
        in Article 43, or where applicable, the competent supervisory authority,
        with all information and access to its processing activities which are
        necessary to conduct the certification procedure.

        7.Certification shall be issued to a controller or processor for a
        maximum period of three years and may be renewed, under the same
        conditions, provided that the relevant requirements continue to be met.
        Certification shall be withdrawn, as applicable, by the certification
        bodies referred to in Article 43 or by the competent supervisory
        authority where the requirements for the certification are not or are no
        longer met.

        8.The Board shall collate all certification mechanisms and data
        protection seals and marks in a register and shall make them publicly
        available by any appropriate means.
      - >-
        However, it is not necessary to impose the obligation to provide
        information where the data subject already possesses the information,
        where the recording or disclosure of the personal data is expressly laid
        down by law or where the provision of information to the data subject
        proves to be impossible or would involve a disproportionate effort. The
        latter could in particular be the case where processing is carried out
        for archiving purposes in the public interest, scientific or historical
        research purposes or statistical purposes. In that regard, the number of
        data subjects, the age of the data and any appropriate safeguards
        adopted should be taken into consideration.
  - source_sentence: >-
      What is the data subject provided with prior to further processing of
      personal data?
    sentences:
      - >-
        1.Where personal data relating to a data subject are collected from the
        data subject, the controller shall, at the time when personal data are
        obtained, provide the data subject with all of the following
        information: (a)  the identity and the contact details of the controller
        and, where applicable, of the controller's representative; (b)  the
        contact details of the data protection officer, where applicable; (c) 
        the purposes of the processing for which the personal data are intended
        as well as the legal basis for the processing; 4.5.2016 L 119/40   (d) 
        where the processing is based on point (f) of Article 6(1), the
        legitimate interests pursued by the controller or by a third party; (e) 
        the recipients or categories of recipients of the personal data, if any;
        (f)  where applicable, the fact that the controller intends to transfer
        personal data to a third country or international organisation and the
        existence or absence of an adequacy decision by the Commission, or in
        the case of transfers referred to in Article 46 or 47, or the second
        subparagraph of Article 49(1), reference to the appropriate or suitable
        safeguards and the means by which to obtain a copy of them or where they
        have been made available.

        2.In addition to the information referred to in paragraph 1, the
        controller shall, at the time when personal data are obtained, provide
        the data subject with the following further information necessary to
        ensure fair and transparent processing: (a)  the period for which the
        personal data will be stored, or if that is not possible, the criteria
        used to determine that period; (b)  the existence of the right to
        request from the controller access to and rectification or erasure of
        personal data or restriction of processing concerning the data subject
        or to object to processing as well as the right to data portability;
        (c)  where the processing is based on point (a) of Article 6(1) or point
        (a) of Article 9(2), the existence of the right to withdraw consent at
        any time, without affecting the lawfulness of processing based on
        consent before its withdrawal; (d)  the right to lodge a complaint with
        a supervisory authority; (e)  whether the provision of personal data is
        a statutory or contractual requirement, or a requirement necessary to
        enter into a contract, as well as whether the data subject is obliged to
        provide the personal data and of the possible consequences of failure to
        provide such data; (f)  the existence of automated decision-making,
        including profiling, referred to in Article 22(1) and (4) and, at least
        in those cases, meaningful information about the logic involved, as well
        as the significance and the envisaged consequences of such processing
        for the data subject.

        3.Where the controller intends to further process the personal data for
        a purpose other than that for which the personal data were collected,
        the controller shall provide the data subject prior to that further
        processing with information on that other purpose and with any relevant
        further information as referred to in paragraph 2

        4.Paragraphs 1, 2 and 3 shall not apply where and insofar as the data
        subject already has the information.
      - >-
        This Regulation respects and does not prejudice the status under
        existing constitutional law of churches and religious associations or
        communities in the Member States, as recognised in Article 17 TFEU.
      - >-
        1) 'personal data' means any information relating to an identified or
        identifiable natural person ('data subject'); an identifiable natural
        person is one who can be identified, directly or indirectly, in
        particular by reference to an identifier such as a name, an
        identification number, location data, an online identifier or to one or
        more factors specific to the physical, physiological, genetic, mental,
        economic, cultural or social identity of that natural person;

        (2) ‘processing’ means any operation or set of operations which is
        performed on personal data or on sets of personal data, whether or not
        by automated means, such as collection, recording, organisation,
        structuring, storage, adaptation or alteration, retrieval, consultation,
        use, disclosure by transmission, dissemination or otherwise making
        available, alignment or combination, restriction, erasure or
        destruction;

        (3) ‘restriction of processing’ means the marking of stored personal
        data with the aim of limiting their processing in the future;

        (4) ‘profiling’ means any form of automated processing of personal data
        consisting of the use of personal data to evaluate certain personal
        aspects relating to a natural person, in particular to analyse or
        predict aspects concerning that natural person's performance at work,
        economic situation, health, personal preferences, interests,
        reliability, behaviour, location or movements;

        (5) ‘pseudonymisation’ means the processing of personal data in such a
        manner that the personal data can no longer be attributed to a specific
        data subject without the use of additional information, provided that
        such additional information is kept separately and is subject to
        technical and organisational measures to ensure that the personal data
        are not attributed to an identified or identifiable natural person;

        (6) ‘filing system’ means any structured set of personal data which are
        accessible according to specific criteria, whether centralised,
        decentralised or dispersed on a functional or geographical basis;

        (7) ‘controller’ means the natural or legal person, public authority,
        agency or other body which, alone or jointly with others, determines the
        purposes and means of the processing of personal data; where the
        purposes and means of such processing are determined by Union or Member
        State law, the controller or the specific criteria for its nomination
        may be provided for by Union or Member State law;

        (8) ‘processor’ means a natural or legal person, public authority,
        agency or other body which processes personal data on behalf of the
        controller;

        (9) ‘recipient’ means a natural or legal person, public authority,
        agency or another body, to which the personal data are disclosed,
        whether a third party or not. However, public authorities which may
        receive personal data in the framework of a particular inquiry in
        accordance with Union or Member State law shall not be regarded as
        recipients; the processing of those data by those public authorities
        shall be in compliance with the applicable data protection rules
        according to the purposes of the processing;

        (10) ‘third party’ means a natural or legal person, public authority,
        agency or body other than the data subject, controller, processor and
        persons who, under the direct authority of the controller or processor,
        are authorised to process personal data;

        (11) ‘consent’ of the data subject means any freely given, specific,
        informed and unambiguous indication of the data subject's wishes by
        which he or she, by a statement or by a clear affirmative action,
        signifies agreement to the processing of personal data relating to him
        or her;

        (12) ‘personal data breach’ means a breach of security leading to the
        accidental or unlawful destruction, loss, alteration, unauthorised
        disclosure of, or access to, personal data transmitted, stored or
        otherwise processed;

        (13) ‘genetic data’ means personal data relating to the inherited or
        acquired genetic characteristics of a natural person which give unique
        information about the physiology or the health of that natural person
        and which result, in particular, from an analysis of a biological sample
        from the natural person in question;

        (14) ‘biometric data’ means personal data resulting from specific
        technical processing relating to the physical, physiological or
        behavioural characteristics of a natural person, which allow or confirm
        the unique identification of that natural person, such as facial images
        or dactyloscopic data;

        (15) ‘data concerning health’ means personal data related to the
        physical or mental health of a natural person, including the provision
        of health care services, which reveal information about his or her
        health status;

        (16) ‘main establishment’ means: (a) as regards a controller with
        establishments in more than one Member State, the place of its central
        administration in the Union, unless the decisions on the purposes and
        means of the processing of personal data are taken in another
        establishment of the controller in the Union and the latter
        establishment has the power to have such decisions implemented, in which
        case the establishment having taken such decisions is to be considered
        to be the main establishment; (b) as regards a processor with
        establishments in more than one Member State, the place of its central
        administration in the Union, or, if the processor has no central
        administration in the Union, the establishment of the processor in the
        Union where the main processing activities in the context of the
        activities of an establishment of the processor take place to the extent
        that the processor is subject to specific obligations under this
        Regulation;

        (17) ‘representative’ means a natural or legal person established in the
        Union who, designated by the controller or processor in writing pursuant
        to Article 27, represents the controller or processor with regard to
        their respective obligations under this Regulation;

        (18) ‘enterprise’ means a natural or legal person engaged in an economic
        activity, irrespective of its legal form, including partnerships or
        associations regularly engaged in an economic activity;

        (19) ‘group of undertakings’ means a controlling undertaking and its
        controlled undertakings;

        (20) ‘binding corporate rules’ means personal data protection policies
        which are adhered to by a controller or processor established on the
        territory of a Member State for transfers or a set of transfers of
        personal data to a controller or processor in one or more third
        countries within a group of undertakings, or group of enterprises
        engaged in a joint economic activity;

        (21) ‘supervisory authority’ means an independent public authority which
        is established by a Member State pursuant to Article 51;

        (22) ‘supervisory authority concerned’ means a supervisory authority
        which is concerned by the processing of personal data because: (a) the
        controller or processor is established on the territory of the Member
        State of that supervisory authority; (b) data subjects residing in the
        Member State of that supervisory authority are substantially affected or
        likely to be substantially affected by the processing; or (c) a
        complaint has been lodged with that supervisory authority;

        (23) ‘cross-border processing’ means either: (a) processing of personal
        data which takes place in the context of the activities of
        establishments in more than one Member State of a controller or
        processor in the Union where the controller or processor is established
        in more than one Member State; or (b) processing of personal data which
        takes place in the context of the activities of a single establishment
        of a controller or processor in the Union but which substantially
        affects or is likely to substantially affect data subjects in more than
        one Member State.

        (24) ‘relevant and reasoned objection’ means an objection to a draft
        decision as to whether there is an infringement of this Regulation, or
        whether envisaged action in relation to the controller or processor
        complies with this Regulation, which clearly demonstrates the
        significance of the risks posed by the draft decision as regards the
        fundamental rights and freedoms of data subjects and, where applicable,
        the free flow of personal data within the Union;

        (25) ‘information society service’ means a service as defined in point
        (b) of Article 1(1) of Directive (EU) 2015/1535 of the European
        Parliament and of the Council (1);

        (26) ‘international organisation’ means an organisation and its
        subordinate bodies governed by public international law, or any other
        body which is set up by, or on the basis of, an agreement between two or
        more countries.
  - source_sentence: >-
      What type of data may be processed for purposes related to point (h) of
      paragraph 2?
    sentences:
      - >-
        1.Processing of personal data revealing racial or ethnic origin,
        political opinions, religious or philosophical beliefs, or trade union
        membership, and the processing of genetic data, biometric data for the
        purpose of uniquely identifying a natural person, data concerning health
        or data concerning a natural person's sex life or sexual orientation
        shall be prohibited.

        2.Paragraph 1 shall not apply if one of the following applies: (a)  the
        data subject has given explicit consent to the processing of those
        personal data for one or more specified purposes, except where Union or
        Member State law provide that the prohibition referred to in paragraph 1
        may not be lifted by the data subject; (b)  processing is necessary for
        the purposes of carrying out the obligations and exercising specific
        rights of the controller or of the data subject in the field of
        employment and social security and social protection law in so far as it
        is authorised by Union or Member State law or a collective agreement
        pursuant to Member State law providing for appropriate safeguards for
        the fundamental rights and the interests of the data subject; (c) 
        processing is necessary to protect the vital interests of the data
        subject or of another natural person where the data subject is
        physically or legally incapable of giving consent; (d)  processing is
        carried out in the course of its legitimate activities with appropriate
        safeguards by a foundation, association or any other not-for-profit body
        with a political, philosophical, religious or trade union aim and on
        condition that the processing relates solely to the members or to former
        members of the body or to persons who have regular contact with it in
        connection with its purposes and that the personal data are not
        disclosed outside that body without the consent of the data subjects;
        (e)  processing relates to personal data which are manifestly made
        public by the data subject; (f)  processing is necessary for the
        establishment, exercise or defence of legal claims or whenever courts
        are acting in their judicial capacity; (g)  processing is necessary for
        reasons of substantial public interest, on the basis of Union or Member
        State law which shall be proportionate to the aim pursued, respect the
        essence of the right to data protection and provide for suitable and
        specific measures to safeguard the fundamental rights and the interests
        of the data subject; (h)  processing is necessary for the purposes of
        preventive or occupational medicine, for the assessment of the working
        capacity of the employee, medical diagnosis, the provision of health or
        social care or treatment or the management of health or social care
        systems and services on the basis of Union or Member State law or
        pursuant to contract with a health professional and subject to the
        conditions and safeguards referred to in paragraph 3; (i)  processing is
        necessary for reasons of public interest in the area of public health,
        such as protecting against serious cross-border threats to health or
        ensuring high standards of quality and safety of health care and of
        medicinal products or medical devices, on the basis of Union or Member
        State law which provides for suitable and specific measures to safeguard
        the rights and freedoms of the data subject, in particular professional
        secrecy; 4.5.2016 L 119/38   (j)  processing is necessary for archiving
        purposes in the public interest, scientific or historical research
        purposes or statistical purposes in accordance with Article 89(1) based
        on Union or Member State law which shall be proportionate to the aim
        pursued, respect the essence of the right to data protection and provide
        for suitable and specific measures to safeguard the fundamental rights
        and the interests of the data subject.

        3.Personal data referred to in paragraph 1 may be processed for the
        purposes referred to in point (h) of paragraph 2 when those data are
        processed by or under the responsibility of a professional subject to
        the obligation of professional secrecy under Union or Member State law
        or rules established by national competent bodies or by another person
        also subject to an obligation of secrecy under Union or Member State law
        or rules established by national competent bodies.

        4.Member States may maintain or introduce further conditions, including
        limitations, with regard to the processing of genetic data, biometric
        data or data concerning health.
      - >-
        1.The data protection officer shall have at least the following tasks:
        (a)  to inform and advise the controller or the processor and the
        employees who carry out processing of their obligations pursuant to this
        Regulation and to other Union or Member State data protection
        provisions; (b)  to monitor compliance with this Regulation, with other
        Union or Member State data protection provisions and with the policies
        of the controller or processor in relation to the protection of personal
        data, including the assignment of responsibilities, awareness-raising
        and training of staff involved in processing operations, and the related
        audits; (c)  to provide advice where requested as regards the data
        protection impact assessment and monitor its performance pursuant to
        Article 35; (d)  to cooperate with the supervisory authority; (e)  to
        act as the contact point for the supervisory authority on issues
        relating to processing, including the prior consultation referred to in
        Article 36, and to consult, where appropriate, with regard to any other
        matter.

        2.The data protection officer shall in the performance of his or her
        tasks have due regard to the risk associated with processing operations,
        taking into account the nature, scope, context and purposes of
        processing. Section 5 Codes of conduct and certification
      - >-
        Processing should be lawful where it is necessary in the context of a
        contract or the intention to enter into a contract.
  - source_sentence: >-
      What may impede authorities in the discharge of their responsibilities
      under Union law?
    sentences:
      - >-
        1.The controller and the processor shall designate a data protection
        officer in any case where: (a)  the processing is carried out by a
        public authority or body, except for courts acting in their judicial
        capacity; (b)  the core activities of the controller or the processor
        consist of processing operations which, by virtue of their nature, their
        scope and/or their purposes, require regular and systematic monitoring
        of data subjects on a large scale; or (c)  the core activities of the
        controller or the processor consist of processing on a large scale of
        special categories of data pursuant to Article 9 and personal data
        relating to criminal convictions and offences referred to in Article 10

        2.A group of undertakings may appoint a single data protection officer
        provided that a data protection officer is easily accessible from each
        establishment.

        3.Where the controller or the processor is a public authority or body, a
        single data protection officer may be designated for several such
        authorities or bodies, taking account of their organisational structure
        and size.

        4.In cases other than those referred to in paragraph 1, the controller
        or processor or associations and other bodies representing categories of
        controllers or processors may or, where required by Union or Member
        State law shall, designate a data protection officer. The data
        protection officer may act for such associations and other bodies
        representing controllers or processors.

        5.The data protection officer shall be designated on the basis of
        professional qualities and, in particular, expert knowledge of data
        protection law and practices and the ability to fulfil the tasks
        referred to in Article 39

        6.The data protection officer may be a staff member of the controller or
        processor, or fulfil the tasks on the basis of a service contract.

        7.The controller or the processor shall publish the contact details of
        the data protection officer and communicate them to the supervisory
        authority.
      - >-
        This Regulation is without prejudice to international agreements
        concluded between the Union and third countries regulating the transfer
        of personal data including appropriate safeguards for the data subjects.
        Member States may conclude international agreements which involve the
        transfer of personal data to third countries or international
        organisations, as far as such agreements do not affect this Regulation
        or any other provisions of Union law and include an appropriate level of
        protection for the fundamental rights of the data subjects.
      - >-
        The objectives and principles of Directive 95/46/EC remain sound, but it
        has not prevented fragmentation in the implementation of data protection
        across the Union, legal uncertainty or a widespread public perception
        that there are significant risks to the protection of natural persons,
        in particular with regard to online activity. Differences in the level
        of protection of the rights and freedoms of natural persons, in
        particular the right to the protection of personal data, with regard to
        the processing of personal data in the Member States may prevent the
        free flow of personal data throughout the Union. Those differences may
        therefore constitute an obstacle to the pursuit of economic activities
        at the level of the Union, distort competition and impede authorities in
        the discharge of their responsibilities under Union law. Such a
        difference in levels of protection is due to the existence of
        differences in the implementation and application of Directive 95/46/EC.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@3
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@3
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@3
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@10
  - cosine_mrr@10
  - cosine_map@100
model-index:
  - name: modernbert-embed-base
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 768
          type: dim_768
        metrics:
          - type: cosine_accuracy@1
            value: 0.4058898847631242
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.41037131882202305
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.4385403329065301
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.471190781049936
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.4058898847631242
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.4050362782757149
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.39705505761843796
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.36651728553137003
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.04172967581938629
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.12212076683897896
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.18584066050972378
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.2836218270585116
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.4292262848394862
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.4170203036400217
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.48482154237960223
            name: Cosine Map@100
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 512
          type: dim_512
        metrics:
          - type: cosine_accuracy@1
            value: 0.39820742637644047
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.4026888604353393
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.42701664532650446
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.45902688860435337
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.39820742637644047
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.39820742637644047
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.3892445582586428
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.356978233034571
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.04102662618120145
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.12062294908153026
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.18402636375152
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.27956498455762785
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.41985375125260577
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.4087001808832792
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.4748374115934728
            name: Cosine Map@100
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 256
          type: dim_256
        metrics:
          - type: cosine_accuracy@1
            value: 0.38092189500640206
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.38412291933418696
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.40973111395646605
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.44366197183098594
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.38092189500640206
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.3800682885189927
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.3714468629961588
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.3419974391805378
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.03938517779616356
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.115945325123842
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.1763856331416056
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.2686379160273794
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.4022720775585408
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.3912121720220308
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.4594474328308739
            name: Cosine Map@100
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 128
          type: dim_128
        metrics:
          - type: cosine_accuracy@1
            value: 0.3495518565941101
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.353393085787452
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.38028169014084506
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.4154929577464789
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.3495518565941101
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.34848484848484845
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.339820742637644
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.312291933418694
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.037856544549247154
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.11129608559954554
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.1684035717787531
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.25324669198316696
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.37104719123202995
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.3604691278987048
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.4276170602366832
            name: Cosine Map@100
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: dim 64
          type: dim_64
        metrics:
          - type: cosine_accuracy@1
            value: 0.3002560819462228
            name: Cosine Accuracy@1
          - type: cosine_accuracy@3
            value: 0.3072983354673495
            name: Cosine Accuracy@3
          - type: cosine_accuracy@5
            value: 0.33034571062740076
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.3649167733674776
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.3002560819462228
            name: Cosine Precision@1
          - type: cosine_precision@3
            value: 0.30110968843363206
            name: Cosine Precision@3
          - type: cosine_precision@5
            value: 0.29475032010243274
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.2714468629961588
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.03258312564919841
            name: Cosine Recall@1
          - type: cosine_recall@3
            value: 0.09635373620336293
            name: Cosine Recall@3
          - type: cosine_recall@5
            value: 0.14603365016280198
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.21983024392840253
            name: Cosine Recall@10
          - type: cosine_ndcg@10
            value: 0.32194373763795797
            name: Cosine Ndcg@10
          - type: cosine_mrr@10
            value: 0.31124626547161705
            name: Cosine Mrr@10
          - type: cosine_map@100
            value: 0.37592384285873587
            name: Cosine Map@100

modernbert-embed-base

This is a sentence-transformers model finetuned from nomic-ai/modernbert-embed-base. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: nomic-ai/modernbert-embed-base
  • Maximum Sequence Length: 8192 tokens
  • Output Dimensionality: 768 dimensions
  • Similarity Function: Cosine Similarity
  • Language: en
  • License: apache-2.0

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
    'What may impede authorities in the discharge of their responsibilities under Union law?',
    'The objectives and principles of Directive 95/46/EC remain sound, but it has not prevented fragmentation in the implementation of data protection across the Union, legal uncertainty or a widespread public perception that there are significant risks to the protection of natural persons, in particular with regard to online activity. Differences in the level of protection of the rights and freedoms of natural persons, in particular the right to the protection of personal data, with regard to the processing of personal data in the Member States may prevent the free flow of personal data throughout the Union. Those differences may therefore constitute an obstacle to the pursuit of economic activities at the level of the Union, distort competition and impede authorities in the discharge of their responsibilities under Union law. Such a difference in levels of protection is due to the existence of differences in the implementation and application of Directive 95/46/EC.',
    'This Regulation is without prejudice to international agreements concluded between the Union and third countries regulating the transfer of personal data including appropriate safeguards for the data subjects. Member States may conclude international agreements which involve the transfer of personal data to third countries or international organisations, as far as such agreements do not affect this Regulation or any other provisions of Union law and include an appropriate level of protection for the fundamental rights of the data subjects.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.4715, 0.1018],
#         [0.4715, 1.0000, 0.2730],
#         [0.1018, 0.2730, 1.0000]])

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.4059
cosine_accuracy@3 0.4104
cosine_accuracy@5 0.4385
cosine_accuracy@10 0.4712
cosine_precision@1 0.4059
cosine_precision@3 0.405
cosine_precision@5 0.3971
cosine_precision@10 0.3665
cosine_recall@1 0.0417
cosine_recall@3 0.1221
cosine_recall@5 0.1858
cosine_recall@10 0.2836
cosine_ndcg@10 0.4292
cosine_mrr@10 0.417
cosine_map@100 0.4848

Information Retrieval

Metric Value
cosine_accuracy@1 0.3982
cosine_accuracy@3 0.4027
cosine_accuracy@5 0.427
cosine_accuracy@10 0.459
cosine_precision@1 0.3982
cosine_precision@3 0.3982
cosine_precision@5 0.3892
cosine_precision@10 0.357
cosine_recall@1 0.041
cosine_recall@3 0.1206
cosine_recall@5 0.184
cosine_recall@10 0.2796
cosine_ndcg@10 0.4199
cosine_mrr@10 0.4087
cosine_map@100 0.4748

Information Retrieval

Metric Value
cosine_accuracy@1 0.3809
cosine_accuracy@3 0.3841
cosine_accuracy@5 0.4097
cosine_accuracy@10 0.4437
cosine_precision@1 0.3809
cosine_precision@3 0.3801
cosine_precision@5 0.3714
cosine_precision@10 0.342
cosine_recall@1 0.0394
cosine_recall@3 0.1159
cosine_recall@5 0.1764
cosine_recall@10 0.2686
cosine_ndcg@10 0.4023
cosine_mrr@10 0.3912
cosine_map@100 0.4594

Information Retrieval

Metric Value
cosine_accuracy@1 0.3496
cosine_accuracy@3 0.3534
cosine_accuracy@5 0.3803
cosine_accuracy@10 0.4155
cosine_precision@1 0.3496
cosine_precision@3 0.3485
cosine_precision@5 0.3398
cosine_precision@10 0.3123
cosine_recall@1 0.0379
cosine_recall@3 0.1113
cosine_recall@5 0.1684
cosine_recall@10 0.2532
cosine_ndcg@10 0.371
cosine_mrr@10 0.3605
cosine_map@100 0.4276

Information Retrieval

Metric Value
cosine_accuracy@1 0.3003
cosine_accuracy@3 0.3073
cosine_accuracy@5 0.3303
cosine_accuracy@10 0.3649
cosine_precision@1 0.3003
cosine_precision@3 0.3011
cosine_precision@5 0.2948
cosine_precision@10 0.2714
cosine_recall@1 0.0326
cosine_recall@3 0.0964
cosine_recall@5 0.146
cosine_recall@10 0.2198
cosine_ndcg@10 0.3219
cosine_mrr@10 0.3112
cosine_map@100 0.3759

Training Details

Training Dataset

Unnamed Dataset

  • Size: 391 training samples
  • Columns: anchor and positive
  • Approximate statistics based on the first 391 samples:
    anchor positive
    type string string
    details
    • min: 7 tokens
    • mean: 15.05 tokens
    • max: 30 tokens
    • min: 25 tokens
    • mean: 667.99 tokens
    • max: 2429 tokens
  • Samples:
    anchor positive
    On what date did the act occur? Court (Civil/Criminal): Civil
    Provisions: Directive 2015/366, Law 4537/2018
    Time of the act: 31.08.2022
    Outcome (not guilty, guilty): Partially accepts the claim.
    Reasoning: The Athens Peace Court ordered the bank to return the amount that was withdrawn from the plaintiffs' account and to pay additional compensation for the moral damage they suffered.
    Facts: The case concerns plaintiffs who fell victim to electronic fraud via phishing, resulting in the withdrawal of money from their bank account. The plaintiffs claimed that the bank did not take the necessary security measures to protect their accounts and sought compensation for the financial loss and moral damage they suffered. The court determined that the bank is responsible for the loss of the money, as it did not prove that the transactions were authorized by the plaintiffs. Furthermore, the court recognized that the bank's refusal to return the funds constitutes an infringement of the plaintiffs' personal rights, as it...
    For what purposes can more specific rules be provided regarding the employment context? 1.Member States may, by law or by collective agreements, provide for more specific rules to ensure the protection of the rights and freedoms in respect of the processing of employees' personal data in the employment context, in particular for the purposes of the recruitment, the performance of the contract of employment, including discharge of obligations laid down by law or by collective agreements, management, planning and organisation of work, equality and diversity in the workplace, health and safety at work, protection of employer's or customer's property and for the purposes of the exercise and enjoyment, on an individual or collective basis, of rights and benefits related to employment, and for the purpose of the termination of the employment relationship.
    2.Those rules shall include suitable and specific measures to safeguard the data subject's human dignity, legitimate interests and fundamental rights, with particular regard to the transparency of processing, the transfer of p...
    On which date were transactions detailed in the provided text conducted? Court (Civil/Criminal): Civil

    Provisions:

    Time of commission of the act:

    Outcome (not guilty, guilty):

    Rationale:

    Facts:
    The plaintiff holds credit card number ............ with the defendant banking corporation. Based on the application for alternative networks dated 19/7/2015 with number ......... submitted at a branch of the defendant, he was granted access to the electronic banking service (e-banking) to conduct banking transactions (debit, credit, updates, payments) remotely. On 30/11/2020, the plaintiff fell victim to electronic fraud through the "phishing" method, whereby an unknown perpetrator managed to withdraw a total amount of €3,121.75 from the aforementioned credit card. Specifically, the plaintiff received an email at 1:35 PM on 29/11/2020 from sender ...... with address ........, informing him that due to an impending system change, he needed to verify the mobile phone number linked to the credit card, urging him to complete the verification...
  • Loss: MatryoshkaLoss with these parameters:
    {
        "loss": "MultipleNegativesRankingLoss",
        "matryoshka_dims": [
            768,
            512,
            256,
            128,
            64
        ],
        "matryoshka_weights": [
            1,
            1,
            1,
            1,
            1
        ],
        "n_dims_per_step": -1
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: epoch
  • per_device_train_batch_size: 2
  • per_device_eval_batch_size: 2
  • gradient_accumulation_steps: 2
  • learning_rate: 2e-05
  • num_train_epochs: 20
  • lr_scheduler_type: cosine
  • warmup_ratio: 0.1
  • bf16: True
  • load_best_model_at_end: True
  • optim: adamw_torch_fused
  • batch_sampler: no_duplicates

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: epoch
  • prediction_loss_only: True
  • per_device_train_batch_size: 2
  • per_device_eval_batch_size: 2
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 2
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 2e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 20
  • max_steps: -1
  • lr_scheduler_type: cosine
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: True
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: True
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • tp_size: 0
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch_fused
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • eval_use_gather_object: False
  • average_tokens_across_devices: False
  • prompts: None
  • batch_sampler: no_duplicates
  • multi_dataset_batch_sampler: proportional
  • router_mapping: {}
  • learning_rate_mapping: {}

Training Logs

Click to expand
Epoch Step Training Loss dim_768_cosine_ndcg@10 dim_512_cosine_ndcg@10 dim_256_cosine_ndcg@10 dim_128_cosine_ndcg@10 dim_64_cosine_ndcg@10
0.0102 1 4.0461 - - - - -
0.0204 2 7.4174 - - - - -
0.0306 3 4.0528 - - - - -
0.0408 4 2.6554 - - - - -
0.0510 5 0.5018 - - - - -
0.0612 6 0.7805 - - - - -
0.0714 7 2.9274 - - - - -
0.0816 8 4.5888 - - - - -
0.0918 9 2.5851 - - - - -
0.1020 10 0.4261 - - - - -
0.1122 11 0.6066 - - - - -
0.1224 12 1.5421 - - - - -
0.1327 13 0.5044 - - - - -
0.1429 14 1.6806 - - - - -
0.1531 15 1.8214 - - - - -
0.1633 16 2.6111 - - - - -
0.1735 17 8.3034 - - - - -
0.1837 18 0.5837 - - - - -
0.1939 19 2.4009 - - - - -
0.2041 20 0.8685 - - - - -
0.2143 21 3.1922 - - - - -
0.2245 22 4.7617 - - - - -
0.2347 23 1.962 - - - - -
0.2449 24 7.5857 - - - - -
0.2551 25 0.1287 - - - - -
0.2653 26 3.0167 - - - - -
0.2755 27 3.8032 - - - - -
0.2857 28 3.8445 - - - - -
0.2959 29 1.6414 - - - - -
0.3061 30 6.3828 - - - - -
0.3163 31 3.1969 - - - - -
0.3265 32 0.7605 - - - - -
0.3367 33 5.0711 - - - - -
0.3469 34 2.6523 - - - - -
0.3571 35 0.4005 - - - - -
0.3673 36 1.757 - - - - -
0.3776 37 3.1397 - - - - -
0.3878 38 3.6261 - - - - -
0.3980 39 2.7427 - - - - -
0.4082 40 0.4561 - - - - -
0.4184 41 0.0331 - - - - -
0.4286 42 5.1981 - - - - -
0.4388 43 0.5115 - - - - -
0.4490 44 1.119 - - - - -
0.4592 45 1.8869 - - - - -
0.4694 46 2.7846 - - - - -
0.4796 47 2.4171 - - - - -
0.4898 48 2.6935 - - - - -
0.5 49 1.0925 - - - - -
0.5102 50 2.0241 - - - - -
0.5204 51 7.4609 - - - - -
0.5306 52 3.2983 - - - - -
0.5408 53 3.8886 - - - - -
0.5510 54 0.5936 - - - - -
0.5612 55 0.8204 - - - - -
0.5714 56 0.1836 - - - - -
0.5816 57 0.4946 - - - - -
0.5918 58 0.2755 - - - - -
0.6020 59 0.1641 - - - - -
0.6122 60 1.2537 - - - - -
0.6224 61 4.3895 - - - - -
0.6327 62 3.2041 - - - - -
0.6429 63 3.2087 - - - - -
0.6531 64 8.0364 - - - - -
0.6633 65 0.7748 - - - - -
0.6735 66 4.7505 - - - - -
0.6837 67 2.2919 - - - - -
0.6939 68 0.6432 - - - - -
0.7041 69 0.97 - - - - -
0.7143 70 4.787 - - - - -
0.7245 71 2.6329 - - - - -
0.7347 72 1.2897 - - - - -
0.7449 73 2.2093 - - - - -
0.7551 74 1.7263 - - - - -
0.7653 75 0.9284 - - - - -
0.7755 76 0.2508 - - - - -
0.7857 77 0.0072 - - - - -
0.7959 78 0.1753 - - - - -
0.8061 79 1.2562 - - - - -
0.8163 80 0.1105 - - - - -
0.8265 81 4.0241 - - - - -
0.8367 82 1.655 - - - - -
0.8469 83 0.0406 - - - - -
0.8571 84 0.0033 - - - - -
0.8673 85 3.2183 - - - - -
0.8776 86 0.1812 - - - - -
0.8878 87 0.2222 - - - - -
0.8980 88 0.6726 - - - - -
0.9082 89 3.5891 - - - - -
0.9184 90 0.3833 - - - - -
0.9286 91 0.0257 - - - - -
0.9388 92 4.635 - - - - -
0.9490 93 2.1625 - - - - -
0.9592 94 0.3742 - - - - -
0.9694 95 0.1946 - - - - -
0.9796 96 0.2705 - - - - -
0.9898 97 12.4745 - - - - -
1.0 98 1.718 0.4242 0.4158 0.3964 0.3525 0.3108
1.0102 99 5.4827 - - - - -
1.0204 100 7.4285 - - - - -
1.0306 101 2.6083 - - - - -
1.0408 102 0.2821 - - - - -
1.0510 103 0.2032 - - - - -
1.0612 104 0.2603 - - - - -
1.0714 105 0.0869 - - - - -
1.0816 106 0.0194 - - - - -
1.0918 107 0.0118 - - - - -
1.1020 108 3.5743 - - - - -
1.1122 109 0.5869 - - - - -
1.1224 110 0.0305 - - - - -
1.1327 111 0.4096 - - - - -
1.1429 112 2.2927 - - - - -
1.1531 113 1.5007 - - - - -
1.1633 114 1.2148 - - - - -
1.1735 115 0.0026 - - - - -
1.1837 116 0.4087 - - - - -
1.1939 117 0.0577 - - - - -
1.2041 118 5.2828 - - - - -
1.2143 119 0.5063 - - - - -
1.2245 120 0.0159 - - - - -
1.2347 121 0.0006 - - - - -
1.2449 122 0.0429 - - - - -
1.2551 123 1.1297 - - - - -
1.2653 124 0.9201 - - - - -
1.2755 125 0.0284 - - - - -
1.2857 126 1.9473 - - - - -
1.2959 127 0.022 - - - - -
1.3061 128 0.0054 - - - - -
1.3163 129 0.1004 - - - - -
1.3265 130 0.0276 - - - - -
1.3367 131 2.3906 - - - - -
1.3469 132 0.0375 - - - - -
1.3571 133 4.9546 - - - - -
1.3673 134 0.1619 - - - - -
1.3776 135 0.0087 - - - - -
1.3878 136 0.3457 - - - - -
1.3980 137 0.0816 - - - - -
1.4082 138 1.1452 - - - - -
1.4184 139 0.5385 - - - - -
1.4286 140 0.1222 - - - - -
1.4388 141 0.3915 - - - - -
1.4490 142 3.0359 - - - - -
1.4592 143 0.2768 - - - - -
1.4694 144 0.6184 - - - - -
1.4796 145 2.7128 - - - - -
1.4898 146 0.2769 - - - - -
1.5 147 0.0037 - - - - -
1.5102 148 1.0417 - - - - -
1.5204 149 1.4451 - - - - -
1.5306 150 6.425 - - - - -
1.5408 151 0.3295 - - - - -
1.5510 152 0.0203 - - - - -
1.5612 153 0.0204 - - - - -
1.5714 154 0.0023 - - - - -
1.5816 155 0.1413 - - - - -
1.5918 156 1.0637 - - - - -
1.6020 157 0.1995 - - - - -
1.6122 158 0.0941 - - - - -
1.6224 159 3.9788 - - - - -
1.6327 160 0.5844 - - - - -
1.6429 161 3.5071 - - - - -
1.6531 162 7.8894 - - - - -
1.6633 163 3.4079 - - - - -
1.6735 164 7.5755 - - - - -
1.6837 165 0.7972 - - - - -
1.6939 166 0.0106 - - - - -
1.7041 167 0.5323 - - - - -
1.7143 168 0.0157 - - - - -
1.7245 169 1.2181 - - - - -
1.7347 170 0.0096 - - - - -
1.7449 171 0.0152 - - - - -
1.7551 172 0.068 - - - - -
1.7653 173 0.0014 - - - - -
1.7755 174 0.0034 - - - - -
1.7857 175 0.0006 - - - - -
1.7959 176 0.4503 - - - - -
1.8061 177 4.1669 - - - - -
1.8163 178 0.6081 - - - - -
1.8265 179 2.4056 - - - - -
1.8367 180 0.5261 - - - - -
1.8469 181 0.2616 - - - - -
1.8571 182 0.2859 - - - - -
1.8673 183 6.4765 - - - - -
1.8776 184 0.0109 - - - - -
1.8878 185 0.0034 - - - - -
1.8980 186 0.1816 - - - - -
1.9082 187 0.039 - - - - -
1.9184 188 0.0239 - - - - -
1.9286 189 2.548 - - - - -
1.9388 190 1.4144 - - - - -
1.9490 191 0.0047 - - - - -
1.9592 192 0.0127 - - - - -
1.9694 193 2.928 - - - - -
1.9796 194 0.0012 - - - - -
1.9898 195 0.1156 - - - - -
2.0 196 0.0001 0.4222 0.4141 0.3954 0.3532 0.3121
2.0102 197 0.768 - - - - -
2.0204 198 0.0073 - - - - -
2.0306 199 1.6622 - - - - -
2.0408 200 0.0003 - - - - -
2.0510 201 0.0398 - - - - -
2.0612 202 0.0001 - - - - -
2.0714 203 0.3767 - - - - -
2.0816 204 0.4468 - - - - -
2.0918 205 0.1021 - - - - -
2.1020 206 1.5802 - - - - -
2.1122 207 0.1798 - - - - -
2.1224 208 0.0015 - - - - -
2.1327 209 0.0055 - - - - -
2.1429 210 0.6201 - - - - -
2.1531 211 1.263 - - - - -
2.1633 212 0.0194 - - - - -
2.1735 213 0.0005 - - - - -
2.1837 214 10.7772 - - - - -
2.1939 215 1.4789 - - - - -
2.2041 216 0.3912 - - - - -
2.2143 217 0.2786 - - - - -
2.2245 218 0.6376 - - - - -
2.2347 219 0.0059 - - - - -
2.2449 220 1.3822 - - - - -
2.2551 221 1.2364 - - - - -
2.2653 222 2.8296 - - - - -
2.2755 223 0.47 - - - - -
2.2857 224 1.2266 - - - - -
2.2959 225 0.0115 - - - - -
2.3061 226 0.017 - - - - -
2.3163 227 0.0165 - - - - -
2.3265 228 0.0807 - - - - -
2.3367 229 0.3864 - - - - -
2.3469 230 0.2179 - - - - -
2.3571 231 9.596 - - - - -
2.3673 232 3.8921 - - - - -
2.3776 233 0.0677 - - - - -
2.3878 234 0.0184 - - - - -
2.3980 235 0.1947 - - - - -
2.4082 236 0.5775 - - - - -
2.4184 237 0.1769 - - - - -
2.4286 238 0.0112 - - - - -
2.4388 239 9.3438 - - - - -
2.4490 240 0.092 - - - - -
2.4592 241 0.8527 - - - - -
2.4694 242 0.1134 - - - - -
2.4796 243 0.0002 - - - - -
2.4898 244 0.0092 - - - - -
2.5 245 0.002 - - - - -
2.5102 246 9.4742 - - - - -
2.5204 247 8.5164 - - - - -
2.5306 248 2.4357 - - - - -
2.5408 249 1.1891 - - - - -
2.5510 250 4.1178 - - - - -
2.5612 251 0.001 - - - - -
2.5714 252 0.1828 - - - - -
2.5816 253 4.9505 - - - - -
2.5918 254 0.8772 - - - - -
2.6020 255 0.054 - - - - -
2.6122 256 1.2223 - - - - -
2.6224 257 0.5202 - - - - -
2.6327 258 0.002 - - - - -
2.6429 259 0.0017 - - - - -
2.6531 260 0.0026 - - - - -
2.6633 261 0.4856 - - - - -
2.6735 262 0.0067 - - - - -
2.6837 263 1.2193 - - - - -
2.6939 264 2.4912 - - - - -
2.7041 265 0.0031 - - - - -
2.7143 266 0.5973 - - - - -
2.7245 267 0.0007 - - - - -
2.7347 268 1.3781 - - - - -
2.7449 269 0.0083 - - - - -
2.7551 270 0.0001 - - - - -
2.7653 271 0.2631 - - - - -
2.7755 272 0.0525 - - - - -
2.7857 273 0.0008 - - - - -
2.7959 274 0.0738 - - - - -
2.8061 275 0.0019 - - - - -
2.8163 276 0.0008 - - - - -
2.8265 277 0.4261 - - - - -
2.8367 278 0.0072 - - - - -
2.8469 279 1.9606 - - - - -
2.8571 280 0.0348 - - - - -
2.8673 281 0.1742 - - - - -
2.8776 282 0.0018 - - - - -
2.8878 283 0.3129 - - - - -
2.8980 284 0.3552 - - - - -
2.9082 285 1.901 - - - - -
2.9184 286 0.1566 - - - - -
2.9286 287 0.0247 - - - - -
2.9388 288 0.0009 - - - - -
2.9490 289 0.0001 - - - - -
2.9592 290 0.0004 - - - - -
2.9694 291 0.0262 - - - - -
2.9796 292 0.0334 - - - - -
2.9898 293 0.0146 - - - - -
3.0 294 0.0044 0.4480 0.4383 0.4153 0.3779 0.3315
3.0102 295 0.2686 - - - - -
3.0204 296 0.0008 - - - - -
3.0306 297 0.0106 - - - - -
3.0408 298 0.0551 - - - - -
3.0510 299 1.2816 - - - - -
3.0612 300 0.002 - - - - -
3.0714 301 0.0406 - - - - -
3.0816 302 0.0081 - - - - -
3.0918 303 0.0064 - - - - -
3.1020 304 0.0061 - - - - -
3.1122 305 0.4775 - - - - -
3.1224 306 0.3185 - - - - -
3.1327 307 0.0105 - - - - -
3.1429 308 0.0001 - - - - -
3.1531 309 10.5217 - - - - -
3.1633 310 0.0041 - - - - -
3.1735 311 0.1077 - - - - -
3.1837 312 0.0984 - - - - -
3.1939 313 0.0279 - - - - -
3.2041 314 0.0009 - - - - -
3.2143 315 0.1379 - - - - -
3.2245 316 0.0 - - - - -
3.2347 317 0.0003 - - - - -
3.2449 318 0.0852 - - - - -
3.2551 319 0.0015 - - - - -
3.2653 320 0.0011 - - - - -
3.2755 321 0.0006 - - - - -
3.2857 322 1.2658 - - - - -
3.2959 323 0.0457 - - - - -
3.3061 324 0.0111 - - - - -
3.3163 325 1.0571 - - - - -
3.3265 326 0.0001 - - - - -
3.3367 327 0.0014 - - - - -
3.3469 328 0.3352 - - - - -
3.3571 329 1.3782 - - - - -
3.3673 330 0.008 - - - - -
3.3776 331 0.0007 - - - - -
3.3878 332 0.0018 - - - - -
3.3980 333 0.1579 - - - - -
3.4082 334 0.3014 - - - - -
3.4184 335 0.0626 - - - - -
3.4286 336 0.0074 - - - - -
3.4388 337 0.002 - - - - -
3.4490 338 0.0047 - - - - -
3.4592 339 0.0601 - - - - -
3.4694 340 0.0119 - - - - -
3.4796 341 0.0003 - - - - -
3.4898 342 0.0319 - - - - -
3.5 343 0.024 - - - - -
3.5102 344 0.0034 - - - - -
3.5204 345 0.1909 - - - - -
3.5306 346 0.08 - - - - -
3.5408 347 0.0003 - - - - -
3.5510 348 0.0396 - - - - -
3.5612 349 0.0127 - - - - -
3.5714 350 0.0146 - - - - -
3.5816 351 0.0916 - - - - -
3.5918 352 0.075 - - - - -
3.6020 353 0.0012 - - - - -
3.6122 354 0.4742 - - - - -
3.6224 355 0.0002 - - - - -
3.6327 356 0.0332 - - - - -
3.6429 357 0.1531 - - - - -
3.6531 358 0.0094 - - - - -
3.6633 359 0.0141 - - - - -
3.6735 360 0.005 - - - - -
3.6837 361 0.0292 - - - - -
3.6939 362 0.0856 - - - - -
3.7041 363 0.5175 - - - - -
3.7143 364 0.7858 - - - - -
3.7245 365 0.0228 - - - - -
3.7347 366 0.0007 - - - - -
3.7449 367 0.1121 - - - - -
3.7551 368 0.0003 - - - - -
3.7653 369 0.1813 - - - - -
3.7755 370 0.0109 - - - - -
3.7857 371 0.0042 - - - - -
3.7959 372 0.0002 - - - - -
3.8061 373 0.0645 - - - - -
3.8163 374 0.0001 - - - - -
3.8265 375 0.0007 - - - - -
3.8367 376 0.0001 - - - - -
3.8469 377 0.0004 - - - - -
3.8571 378 0.0008 - - - - -
3.8673 379 0.0635 - - - - -
3.8776 380 0.0009 - - - - -
3.8878 381 0.9885 - - - - -
3.8980 382 0.0363 - - - - -
3.9082 383 0.144 - - - - -
3.9184 384 1.6117 - - - - -
3.9286 385 0.6172 - - - - -
3.9388 386 0.0111 - - - - -
3.9490 387 0.0106 - - - - -
3.9592 388 0.0252 - - - - -
3.9694 389 0.0249 - - - - -
3.9796 390 0.0537 - - - - -
3.9898 391 0.0229 - - - - -
4.0 392 0.0001 0.4438 0.4303 0.4204 0.3813 0.3314
4.0102 393 0.2346 - - - - -
4.0204 394 0.0079 - - - - -
4.0306 395 0.0058 - - - - -
4.0408 396 0.0035 - - - - -
4.0510 397 0.0002 - - - - -
4.0612 398 0.028 - - - - -
4.0714 399 0.0001 - - - - -
4.0816 400 0.0003 - - - - -
4.0918 401 0.0121 - - - - -
4.1020 402 0.1073 - - - - -
4.1122 403 0.0012 - - - - -
4.1224 404 0.0003 - - - - -
4.1327 405 0.0025 - - - - -
4.1429 406 0.0097 - - - - -
4.1531 407 0.0127 - - - - -
4.1633 408 0.0001 - - - - -
4.1735 409 0.007 - - - - -
4.1837 410 0.0154 - - - - -
4.1939 411 0.0002 - - - - -
4.2041 412 0.0207 - - - - -
4.2143 413 0.0682 - - - - -
4.2245 414 0.1168 - - - - -
4.2347 415 0.0019 - - - - -
4.2449 416 1.7119 - - - - -
4.2551 417 0.0001 - - - - -
4.2653 418 0.0004 - - - - -
4.2755 419 3.5151 - - - - -
4.2857 420 7.6674 - - - - -
4.2959 421 2.1193 - - - - -
4.3061 422 1.1982 - - - - -
4.3163 423 0.0018 - - - - -
4.3265 424 0.0008 - - - - -
4.3367 425 0.0581 - - - - -
4.3469 426 0.0319 - - - - -
4.3571 427 0.0041 - - - - -
4.3673 428 0.0 - - - - -
4.3776 429 0.0001 - - - - -
4.3878 430 0.0005 - - - - -
4.3980 431 0.0002 - - - - -
4.4082 432 0.0012 - - - - -
4.4184 433 0.0395 - - - - -
4.4286 434 0.001 - - - - -
4.4388 435 0.0006 - - - - -
4.4490 436 0.0262 - - - - -
4.4592 437 4.1211 - - - - -
4.4694 438 0.0119 - - - - -
4.4796 439 0.0006 - - - - -
4.4898 440 0.0865 - - - - -
4.5 441 0.0007 - - - - -
4.5102 442 0.0011 - - - - -
4.5204 443 0.0804 - - - - -
4.5306 444 0.0596 - - - - -
4.5408 445 0.0006 - - - - -
4.5510 446 0.0019 - - - - -
4.5612 447 0.5596 - - - - -
4.5714 448 0.0018 - - - - -
4.5816 449 0.0379 - - - - -
4.5918 450 0.0076 - - - - -
4.6020 451 0.0012 - - - - -
4.6122 452 0.0006 - - - - -
4.6224 453 0.6476 - - - - -
4.6327 454 0.0 - - - - -
4.6429 455 0.0214 - - - - -
4.6531 456 0.0005 - - - - -
4.6633 457 4.8527 - - - - -
4.6735 458 0.4774 - - - - -
4.6837 459 0.0003 - - - - -
4.6939 460 0.0001 - - - - -
4.7041 461 0.0075 - - - - -
4.7143 462 0.0001 - - - - -
4.7245 463 7.4959 - - - - -
4.7347 464 0.0 - - - - -
4.7449 465 2.1102 - - - - -
4.7551 466 0.0027 - - - - -
4.7653 467 0.0035 - - - - -
4.7755 468 0.574 - - - - -
4.7857 469 0.0191 - - - - -
4.7959 470 0.0214 - - - - -
4.8061 471 0.0016 - - - - -
4.8163 472 0.0003 - - - - -
4.8265 473 0.0003 - - - - -
4.8367 474 0.0038 - - - - -
4.8469 475 0.0 - - - - -
4.8571 476 0.4292 - - - - -
4.8673 477 0.0009 - - - - -
4.8776 478 0.041 - - - - -
4.8878 479 0.0909 - - - - -
4.8980 480 0.0024 - - - - -
4.9082 481 0.0001 - - - - -
4.9184 482 0.3607 - - - - -
4.9286 483 0.994 - - - - -
4.9388 484 0.0186 - - - - -
4.9490 485 0.206 - - - - -
4.9592 486 0.0008 - - - - -
4.9694 487 0.0006 - - - - -
4.9796 488 0.2176 - - - - -
4.9898 489 0.2219 - - - - -
5.0 490 0.0112 0.4349 0.4291 0.4190 0.3825 0.3352
5.0102 491 0.0005 - - - - -
5.0204 492 0.0016 - - - - -
5.0306 493 0.0091 - - - - -
5.0408 494 0.0467 - - - - -
5.0510 495 0.0229 - - - - -
5.0612 496 0.0 - - - - -
5.0714 497 0.0014 - - - - -
5.0816 498 0.0045 - - - - -
5.0918 499 0.0002 - - - - -
5.1020 500 0.105 - - - - -
5.1122 501 0.0 - - - - -
5.1224 502 0.0063 - - - - -
5.1327 503 0.0242 - - - - -
5.1429 504 0.0 - - - - -
5.1531 505 0.0033 - - - - -
5.1633 506 0.0004 - - - - -
5.1735 507 0.0014 - - - - -
5.1837 508 0.0027 - - - - -
5.1939 509 2.3163 - - - - -
5.2041 510 0.5547 - - - - -
5.2143 511 0.0802 - - - - -
5.2245 512 0.0011 - - - - -
5.2347 513 0.0001 - - - - -
5.2449 514 0.0109 - - - - -
5.2551 515 0.0044 - - - - -
5.2653 516 0.0036 - - - - -
5.2755 517 0.0018 - - - - -
5.2857 518 0.0073 - - - - -
5.2959 519 0.0025 - - - - -
5.3061 520 0.0001 - - - - -
5.3163 521 0.0031 - - - - -
5.3265 522 0.1512 - - - - -
5.3367 523 0.0001 - - - - -
5.3469 524 0.0169 - - - - -
5.3571 525 0.0021 - - - - -
5.3673 526 0.0088 - - - - -
5.3776 527 0.0003 - - - - -
5.3878 528 0.0308 - - - - -
5.3980 529 0.0 - - - - -
5.4082 530 0.3433 - - - - -
5.4184 531 0.0003 - - - - -
5.4286 532 0.0036 - - - - -
5.4388 533 0.0008 - - - - -
5.4490 534 0.0056 - - - - -
5.4592 535 0.0028 - - - - -
5.4694 536 0.0009 - - - - -
5.4796 537 0.0015 - - - - -
5.4898 538 0.0023 - - - - -
5.5 539 0.0007 - - - - -
5.5102 540 0.0001 - - - - -
5.5204 541 0.0231 - - - - -
5.5306 542 0.1314 - - - - -
5.5408 543 4.2928 - - - - -
5.5510 544 0.0168 - - - - -
5.5612 545 0.0002 - - - - -
5.5714 546 0.0003 - - - - -
5.5816 547 0.0051 - - - - -
5.5918 548 0.0001 - - - - -
5.6020 549 0.003 - - - - -
5.6122 550 0.0037 - - - - -
5.6224 551 0.0047 - - - - -
5.6327 552 0.0042 - - - - -
5.6429 553 0.0011 - - - - -
5.6531 554 0.0007 - - - - -
5.6633 555 0.0036 - - - - -
5.6735 556 0.0572 - - - - -
5.6837 557 0.4782 - - - - -
5.6939 558 0.0033 - - - - -
5.7041 559 0.0453 - - - - -
5.7143 560 0.0006 - - - - -
5.7245 561 0.0003 - - - - -
5.7347 562 0.0018 - - - - -
5.7449 563 0.0589 - - - - -
5.7551 564 0.0001 - - - - -
5.7653 565 0.0013 - - - - -
5.7755 566 0.0001 - - - - -
5.7857 567 0.0011 - - - - -
5.7959 568 0.0019 - - - - -
5.8061 569 0.0055 - - - - -
5.8163 570 0.6808 - - - - -
5.8265 571 0.0007 - - - - -
5.8367 572 0.0008 - - - - -
5.8469 573 0.7029 - - - - -
5.8571 574 0.003 - - - - -
5.8673 575 0.0008 - - - - -
5.8776 576 0.0001 - - - - -
5.8878 577 3.5868 - - - - -
5.8980 578 0.0019 - - - - -
5.9082 579 0.0023 - - - - -
5.9184 580 0.0625 - - - - -
5.9286 581 0.1886 - - - - -
5.9388 582 0.0253 - - - - -
5.9490 583 0.6732 - - - - -
5.9592 584 0.0001 - - - - -
5.9694 585 0.239 - - - - -
5.9796 586 5.5812 - - - - -
5.9898 587 0.0129 - - - - -
6.0 588 0.0002 0.4292 0.4199 0.4023 0.3710 0.3219

Framework Versions

  • Python: 3.12.11
  • Sentence Transformers: 5.1.0
  • Transformers: 4.51.3
  • PyTorch: 2.8.0+cu126
  • Accelerate: 1.10.1
  • Datasets: 4.0.0
  • Tokenizers: 0.21.4

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MatryoshkaLoss

@misc{kusupati2024matryoshka,
    title={Matryoshka Representation Learning},
    author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
    year={2024},
    eprint={2205.13147},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}