multilingual-e5-large
This is a sentence-transformers model finetuned from intfloat/multilingual-e5-large. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: intfloat/multilingual-e5-large
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 1024 dimensions
- Similarity Function: Cosine Similarity
- Language: en
- License: apache-2.0
Model Sources
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'XLMRobertaModel'})
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("sentence_transformers_model_id")
sentences = [
"What are the defendant's claims described as?",
'**Court (Civil/Criminal): Civil** \n**Provisions:** \n**Time of commission of the act:** \n**Outcome (not guilty, guilty):** \n**Reasoning:** Partially accepts the lawsuit. \n**Facts:** The plaintiff, who works as a lawyer, maintains a savings account with the defendant banking corporation under account number GR.............. Pursuant to a contract dated June 11, 2010, established in Thessaloniki between the defendant and the plaintiff, the plaintiff was granted access to the electronic banking system (e-banking) to conduct banking transactions remotely. On October 10, 2020, the plaintiff fell victim to electronic fraud through the "phishing" method, whereby an unknown perpetrator managed to extract and transfer €3,000.00 from the plaintiff’s account to another account of the same bank. Specifically, on that day at 6:51 a.m., the plaintiff received an email from the sender ".........", with the address ..........., informing him that his debit card had been suspended and that online payments and cash withdrawals could not be made until the issue was resolved. The email urged him to confirm his details within the next 72 hours by following a link titled "card activation." \nThe plaintiff read the above email on his mobile phone around 8:00 a.m., and believing it came from the defendant, he followed the instructions and accessed a website that was identical (a clone) to that of the defendant. On this page, he was asked to enter his login credentials to connect to the service, which he did, and he was subsequently asked to input his debit card details for the alleged activation, which he also provided. Then, to complete the process, a number was sent to his mobile phone at 8:07 a.m. from the sender ........, which he entered, and two minutes later he received a message from the same sender in English stating that the quick access code had been activated on his mobile. A few minutes later, at 8:18 a.m., he received an email from the defendant informing him of the transfer of €3,000.00 from his account to account number GR ........... held at the same bank, with the beneficiary\'s details being .......... As soon as the plaintiff read this, he immediately called the defendant\'s call center and canceled his debit card, the access codes for the service ......., and locked the application .......... At the same time, he verbally submitted a request to dispute and cancel the contested transaction, and in a subsequent phone call, he also canceled his credit card. On the same day, he also sent an email to the defendant informing them in writing of the above and requesting the cancellation of the transaction and the return of the amount of €3,000.00 to his account, as this transfer was not made by him but by an unknown perpetrator through electronic fraud and was not approved by him. It should also be noted that the plaintiff, as the sole beneficiary according to the aforementioned contract for using the defendant\'s Internet Banking service, never received any update via SMS or the VIBER application from the bank regarding the transaction details before its completion, nor did he receive a one-time code (OTP) to approve the contested transaction. He subsequently filed a complaint against unknown persons at the Cyber Crime Division for the crime of fraud. The defendant sent an email to the plaintiff on October 16, 2020, informing him that his request had been forwarded to the appropriate department of the bank for investigation, stating that the bank would never send him an email or SMS asking him to enter his personal data and that as of October 7, 2020, there was a notice posted for its customers regarding malicious attempts to steal personal data in the "Our News" section on ....... A month after the disputed incident, on November 10, 2020, an amount of €2,296.82 was transferred to the plaintiff\'s account from the account to which the fraudulent credit had been made. The plaintiff immediately sent an email to the defendant asking to be informed whether this transfer was a return of part of the amount that had been illegally withdrawn from his account and requested the return of the remaining amount of €703.18. In its response dated January 13, 2021, the defendant confirmed that the aforementioned amount indeed came from the account to which the fraudulent credit had been made, following a freeze of that account initiated by the defendant during the investigation of the incident, but refused to return the remaining amount, claiming it bore no responsibility for the leak of the personal codes to third parties, according to the terms of the service contract established between them. \nFrom the entirety of the evidence presented to the court, there is no indication of the authenticity of the contested transaction, as the plaintiff did not give his consent for the execution of the transfer of the amount of €3,000.00, especially in light of the provision in Article 72 paragraph 2 of Law 4537/2018 stating that the mere use of the Internet Banking service by the plaintiff does not necessarily constitute sufficient evidence that the payer approved the payment action. Specifically, it was proven that the contested transaction was not carried out following a strong identification of the plaintiff – the sole beneficiary of the account – and his approval, as the latter may have entered his personal codes on the counterfeit website; however, he was never informed, before the completion of the contested transaction, of the amount that would be transferred from his account to a third-party account, nor did he receive on his mobile phone, either via SMS or through the VIBER application or any other means, the one-time code - extra PIN for its completion, which he was required to enter to approve the contested transaction (payment action) and thus complete his identification, a fact that was not countered by any evidence from the defendant. Furthermore, it is noted that the defendant\'s claims that it bears no responsibility under the terms of the banking services contract, whereby it is not liable for any damage to its customer in cases of unauthorized use of their personal access codes to the Internet Banking service, are to be rejected as fundamentally unfounded. This is because the aforementioned contractual terms are invalid according to the provision of Article 103 of Law 4537/2018, as they contradict the provisions of Articles 71, 73, and 92 of the same Law, which provide for the provider\'s universal liability and its exemption only for unusual and unforeseen circumstances that are beyond the control of the party invoking them and whose consequences could not have been avoided despite all efforts to the contrary; these provisions establish mandatory law in favor of users, as according to Article 103 of Law 4537/2018, payment service providers are prohibited from deviating from the provisions to the detriment of payment service users, unless the possibility of deviation is explicitly provided and they can decide to offer only more favorable terms to payment service users; the aforementioned contractual terms do not constitute more favorable terms but rather disadvantageous terms for the payment service user. In this case, however, the defendant did not prove the authenticity of the transaction and its approval by the plaintiff and did not invoke, nor did any unusual and unforeseen circumstances beyond its control, the consequences of which could not have been avoided despite all efforts to the contrary, come to light. Therefore, the contested transaction transferring the amount of €3,000.00 is considered, in the absence of demonstrable consent from the plaintiff, unapproved according to the provisions of Article 64 of Law 4537/2018, and the defendant\'s contrary claims are rejected, especially since the plaintiff proceeded, according to Article 71 paragraph 1 of Law 4537/2018, without undue delay to notify the defendant regarding the contested unapproved payment action. Consequently, the defendant is liable for compensating the plaintiff for the positive damage he suffered under Article 73 of Law 4537/2018 and is obliged to pay him the requested amount of €703.18, while the plaintiff’s fault in the occurrence of this damage cannot be established, as he entered his personal details in an online environment that was a faithful imitation of that of the defendant, as evidenced by the comparison of the screenshots of the fake website and the real website provided by the plaintiff, a fact that he could not have known while being fully convinced that he was transacting with the defendant. Furthermore, the defendant’s liability to compensate the plaintiff is based on the provision of Article 8 of Law 2251/1994, which applies in this case, as the plaintiff\'s damage resulted from inadequate fulfillment of its obligations in the context of providing its services, but also on the provision of Article 914 of the Civil Code in the sense of omission on its part of unlawfully and culpably imposed actions. In this case, given that during the relevant period there had been a multitude of similar incidents of fraud against the defendant\'s customers, the latter, as a service provider to the consumer public and bearing transactional obligations of care and security towards them, displayed gross negligence regarding the security provided for electronic transaction services, which was compromised by the fraudulent theft of funds, as it did not comply with all required high-security measures for executing the contested transaction, failing to implement the strict customer identification verification process and to check the authenticity of the account to which the funds were sent, thus not assuming the suspicious nature of the transaction, did not adopt comprehensive and improved protective measures to fully protect its customers against malicious attacks and online fraud and to prevent the infiltration of unauthorized third parties, nor did it fulfill its obligations to inform, accurately inform, and warn its consumers - customers, as it failed to adequately inform them of attempts to steal their personal data through the sending of informative emails or SMS, while merely posting in a section rather than on a central banner (as it later did) does not constitute adequate information such that it meets the requirement of protecting its customers and the increased safeguarding of their interests. Although the plaintiff acted promptly and informed the defendant on the same day about the contested incident, the defendant did not act as promptly regarding the investigation of the incident and the freezing of the account that held the fraudulent credit to prevent the plaintiff\'s loss, but only returned part of the funds to the plaintiff a month later. This behavior, beyond being culpable due to gross negligence, was also unlawful, as it would have been illegal even without the contractual relationship, as contrary to the provisions of Law 4537/2018 and Law 2251/1994, regarding the lack of security of the services that the consumer is legitimately entitled to expect, as well as the building of trust that is essential in banking transactions, elements that it was obligated to provide within the sphere of the services offered, and contrary to the principles of good faith and commercial ethics, as crystallized in the provision of Article 288 of the Civil Code, as well as the general duty imposed by Article 914 of the Civil Code not to cause harm to another culpably. This resulted not only in positive damage to the plaintiff but also in causing him moral harm consisting of his mental distress and the disruption, agitation, and sorrow he experienced, for which he must be awarded financial compensation. Taking into account all the general circumstances of the case, the extent of the plaintiff\'s damage, the severity of the defendant\'s fault, the mental distress suffered by the plaintiff, the insecurity he felt regarding his deposits, the sorrow he experienced, and the stress caused by his financial loss, which occurred during the pandemic period when his earnings from his professional activity had significantly decreased, as well as the financial and social situation of the parties, it is the court\'s opinion that he should be granted, as financial compensation for his moral harm, an amount of €250.00, which is deemed reasonable and fair. Therefore, the total monetary amount that the plaintiff is entitled to for his positive damage and financial compensation for the moral harm suffered amounts to a total of (€703.18 + €250.00) = €953.18.',
"1.Without prejudice to other tasks set out under this Regulation, each supervisory authority shall on its territory: (a) monitor and enforce the application of this Regulation; (b) promote public awareness and understanding of the risks, rules, safeguards and rights in relation to processing. Activities addressed specifically to children shall receive specific attention; (c) advise, in accordance with Member State law, the national parliament, the government, and other institutions and bodies on legislative and administrative measures relating to the protection of natural persons' rights and freedoms with regard to processing; (d) promote the awareness of controllers and processors of their obligations under this Regulation; (e) upon request, provide information to any data subject concerning the exercise of their rights under this Regulation and, if appropriate, cooperate with the supervisory authorities in other Member States to that end; (f) handle complaints lodged by a data subject, or by a body, organisation or association in accordance with Article 80, and investigate, to the extent appropriate, the subject matter of the complaint and inform the complainant of the progress and the outcome of the investigation within a reasonable period, in particular if further investigation or coordination with another supervisory authority is necessary; (g) cooperate with, including sharing information and provide mutual assistance to, other supervisory authorities with a view to ensuring the consistency of application and enforcement of this Regulation; (h) conduct investigations on the application of this Regulation, including on the basis of information received from another supervisory authority or other public authority; (i) monitor relevant developments, insofar as they have an impact on the protection of personal data, in particular the development of information and communication technologies and commercial practices; (j) adopt standard contractual clauses referred to in Article 28(8) and in point (d) of Article 46(2); (k) establish and maintain a list in relation to the requirement for data protection impact assessment pursuant to Article 35(4); (l) give advice on the processing operations referred to in Article 36(2); (m) encourage the drawing up of codes of conduct pursuant to Article 40(1) and provide an opinion and approve such codes of conduct which provide sufficient safeguards, pursuant to Article 40(5); (n) encourage the establishment of data protection certification mechanisms and of data protection seals and marks pursuant to Article 42(1), and approve the criteria of certification pursuant to Article 42(5); (o) where applicable, carry out a periodic review of certifications issued in accordance with Article 42(7); 4.5.2016 L 119/68 (p) draft and publish the criteria for accreditation of a body for monitoring codes of conduct pursuant to Article 41 and of a certification body pursuant to Article 43; (q) conduct the accreditation of a body for monitoring codes of conduct pursuant to Article 41 and of a certification body pursuant to Article 43; (r) authorise contractual clauses and provisions referred to in Article 46(3); (s) approve binding corporate rules pursuant to Article 47; (t) contribute to the activities of the Board; (u) keep internal records of infringements of this Regulation and of measures taken in accordance with Article 58(2); and (v) fulfil any other tasks related to the protection of personal data.\n2.Each supervisory authority shall facilitate the submission of complaints referred to in point (f) of paragraph 1 by measures such as a complaint submission form which can also be completed electronically, without excluding other means of communication.\n3.The performance of the tasks of each supervisory authority shall be free of charge for the data subject and, where applicable, for the data protection officer.\n4.Where requests are manifestly unfounded or excessive, in particular because of their repetitive character, the supervisory authority may charge a reasonable fee based on administrative costs, or refuse to act on the request. The supervisory authority shall bear the burden of demonstrating the manifestly unfounded or excessive character of the request.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
similarities = model.similarity(embeddings, embeddings)
print(similarities)
Evaluation
Metrics
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.4268 |
| cosine_accuracy@3 |
0.4646 |
| cosine_accuracy@5 |
0.4924 |
| cosine_accuracy@10 |
0.5581 |
| cosine_precision@1 |
0.4268 |
| cosine_precision@3 |
0.4125 |
| cosine_precision@5 |
0.3747 |
| cosine_precision@10 |
0.3141 |
| cosine_recall@1 |
0.0983 |
| cosine_recall@3 |
0.2524 |
| cosine_recall@5 |
0.3276 |
| cosine_recall@10 |
0.4365 |
| cosine_ndcg@10 |
0.4842 |
| cosine_mrr@10 |
0.4552 |
| cosine_map@100 |
0.546 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.4343 |
| cosine_accuracy@3 |
0.4747 |
| cosine_accuracy@5 |
0.5101 |
| cosine_accuracy@10 |
0.5606 |
| cosine_precision@1 |
0.4343 |
| cosine_precision@3 |
0.4192 |
| cosine_precision@5 |
0.3838 |
| cosine_precision@10 |
0.3225 |
| cosine_recall@1 |
0.1006 |
| cosine_recall@3 |
0.2564 |
| cosine_recall@5 |
0.3338 |
| cosine_recall@10 |
0.4402 |
| cosine_ndcg@10 |
0.4935 |
| cosine_mrr@10 |
0.4636 |
| cosine_map@100 |
0.5565 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.4293 |
| cosine_accuracy@3 |
0.4672 |
| cosine_accuracy@5 |
0.4975 |
| cosine_accuracy@10 |
0.5556 |
| cosine_precision@1 |
0.4293 |
| cosine_precision@3 |
0.4125 |
| cosine_precision@5 |
0.3773 |
| cosine_precision@10 |
0.3232 |
| cosine_recall@1 |
0.0971 |
| cosine_recall@3 |
0.2454 |
| cosine_recall@5 |
0.3179 |
| cosine_recall@10 |
0.4288 |
| cosine_ndcg@10 |
0.4864 |
| cosine_mrr@10 |
0.4565 |
| cosine_map@100 |
0.5433 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.4091 |
| cosine_accuracy@3 |
0.4343 |
| cosine_accuracy@5 |
0.4672 |
| cosine_accuracy@10 |
0.5429 |
| cosine_precision@1 |
0.4091 |
| cosine_precision@3 |
0.3914 |
| cosine_precision@5 |
0.3571 |
| cosine_precision@10 |
0.3101 |
| cosine_recall@1 |
0.0907 |
| cosine_recall@3 |
0.2304 |
| cosine_recall@5 |
0.2996 |
| cosine_recall@10 |
0.4118 |
| cosine_ndcg@10 |
0.465 |
| cosine_mrr@10 |
0.436 |
| cosine_map@100 |
0.5261 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.4293 |
| cosine_accuracy@3 |
0.452 |
| cosine_accuracy@5 |
0.4848 |
| cosine_accuracy@10 |
0.5354 |
| cosine_precision@1 |
0.4293 |
| cosine_precision@3 |
0.4074 |
| cosine_precision@5 |
0.3747 |
| cosine_precision@10 |
0.3215 |
| cosine_recall@1 |
0.0938 |
| cosine_recall@3 |
0.2337 |
| cosine_recall@5 |
0.303 |
| cosine_recall@10 |
0.4065 |
| cosine_ndcg@10 |
0.4754 |
| cosine_mrr@10 |
0.451 |
| cosine_map@100 |
0.5323 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.346 |
| cosine_accuracy@3 |
0.3788 |
| cosine_accuracy@5 |
0.4217 |
| cosine_accuracy@10 |
0.4798 |
| cosine_precision@1 |
0.346 |
| cosine_precision@3 |
0.335 |
| cosine_precision@5 |
0.3162 |
| cosine_precision@10 |
0.2816 |
| cosine_recall@1 |
0.0734 |
| cosine_recall@3 |
0.1885 |
| cosine_recall@5 |
0.2555 |
| cosine_recall@10 |
0.3659 |
| cosine_ndcg@10 |
0.406 |
| cosine_mrr@10 |
0.3744 |
| cosine_map@100 |
0.4621 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 1,580 training samples
- Columns:
anchor and positive
- Approximate statistics based on the first 1000 samples:
|
anchor |
positive |
| type |
string |
string |
| details |
- min: 7 tokens
- mean: 17.19 tokens
- max: 37 tokens
|
- min: 27 tokens
- mean: 378.1 tokens
- max: 512 tokens
|
- Samples:
| anchor |
positive |
What date is mentioned in the text? |
1.The controller and the processor shall ensure that the data protection officer is involved, properly and in a timely manner, in all issues which relate to the protection of personal data. 4.5.2016 L 119/55 2.The controller and processor shall support the data protection officer in performing the tasks referred to in |
Under what condition is the culpable character of the action raised in regards to computer software infringement? |
Any person who, in contravention of the provisions of this law or of the provisions of lawfully ratified multilateral international conventions on the protection of copyright, unlawfully makes a fixation of a work or of copies, reproduces them directly or indirectly, temporarily or permanently in any form, in whole or in part, translates, adapts, alters or transforms them, or distributes them to the public by sale or other means, or possesses with the intent of distributing them, rents, performs in public, broadcasts by radio or television or any other means, communicates to the public works or copies by any means, imports copies of a work illegally produced abroad without the consent of the author and, in general, exploits works, reproductions or copies being the object of copyright or acts against the moral right of the author to decide freely on the publication and the presentation of his work to the public without additions or deletions, shall be liable to imprisonment of no less t... |
Under what circumstances does the Board issue an opinion? |
1.The Board shall issue an opinion where a competent supervisory authority intends to adopt any of the measures below. To that end, the competent supervisory authority shall communicate the draft decision to the Board, when it: (a) aims to adopt a list of the processing operations subject to the requirement for a data protection impact assessment pursuant to Article 35(4); (b) concerns a matter pursuant to Article 40(7) whether a draft code of conduct or an amendment or extension to a code of conduct complies with this Regulation; 4.5.2016 L 119/73 (c) aims to approve the criteria for accreditation of a body pursuant to Article 41(3) or a certification body pursuant to Article 43(3); (d) aims to determine standard data protection clauses referred to in point (d) of Article 46(2) and in Article 28(8); (e) aims to authorise contractual clauses referred to in point (a) of Article 46(3); or (f) aims to approve binding corporate rules within the meaning of Article 47 2.Any superviso... |
- Loss:
MatryoshkaLoss with these parameters:{
"loss": "MultipleNegativesRankingLoss",
"matryoshka_dims": [
1024,
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: epoch
gradient_accumulation_steps: 4
learning_rate: 3e-05
num_train_epochs: 20
lr_scheduler_type: cosine
warmup_ratio: 0.1
bf16: True
load_best_model_at_end: True
optim: adamw_torch_fused
batch_sampler: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir: False
do_predict: False
eval_strategy: epoch
prediction_loss_only: True
per_device_train_batch_size: 8
per_device_eval_batch_size: 8
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 4
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 3e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 20
max_steps: -1
lr_scheduler_type: cosine
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 42
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: True
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: True
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
tp_size: 0
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch_fused
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: None
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
include_for_metrics: []
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
average_tokens_across_devices: False
prompts: None
batch_sampler: no_duplicates
multi_dataset_batch_sampler: proportional
router_mapping: {}
learning_rate_mapping: {}
Training Logs
| Epoch |
Step |
Training Loss |
dim_1024_cosine_ndcg@10 |
dim_768_cosine_ndcg@10 |
dim_512_cosine_ndcg@10 |
dim_256_cosine_ndcg@10 |
dim_128_cosine_ndcg@10 |
dim_64_cosine_ndcg@10 |
| -1 |
-1 |
- |
0.4392 |
0.4292 |
0.4042 |
0.3789 |
0.2817 |
0.2030 |
| 0.2020 |
10 |
35.1992 |
- |
- |
- |
- |
- |
- |
| 0.4040 |
20 |
34.4091 |
- |
- |
- |
- |
- |
- |
| 0.6061 |
30 |
32.5592 |
- |
- |
- |
- |
- |
- |
| 0.8081 |
40 |
29.6977 |
- |
- |
- |
- |
- |
- |
| 1.0 |
50 |
26.32 |
0.4430 |
0.4484 |
0.4304 |
0.3995 |
0.3534 |
0.2942 |
| 1.2020 |
60 |
21.8779 |
- |
- |
- |
- |
- |
- |
| 1.4040 |
70 |
24.8794 |
- |
- |
- |
- |
- |
- |
| 1.6061 |
80 |
19.4179 |
- |
- |
- |
- |
- |
- |
| 1.8081 |
90 |
17.9807 |
- |
- |
- |
- |
- |
- |
| 2.0 |
100 |
16.9844 |
0.4632 |
0.4737 |
0.4369 |
0.4113 |
0.3995 |
0.3424 |
| 2.2020 |
110 |
15.8444 |
- |
- |
- |
- |
- |
- |
| 2.4040 |
120 |
13.7469 |
- |
- |
- |
- |
- |
- |
| 2.6061 |
130 |
13.8095 |
- |
- |
- |
- |
- |
- |
| 2.8081 |
140 |
12.2849 |
- |
- |
- |
- |
- |
- |
| 3.0 |
150 |
11.2766 |
0.4656 |
0.4623 |
0.4506 |
0.4313 |
0.4182 |
0.3665 |
| 3.2020 |
160 |
10.7749 |
- |
- |
- |
- |
- |
- |
| 3.4040 |
170 |
9.6846 |
- |
- |
- |
- |
- |
- |
| 3.6061 |
180 |
10.8248 |
- |
- |
- |
- |
- |
- |
| 3.8081 |
190 |
10.4771 |
- |
- |
- |
- |
- |
- |
| 4.0 |
200 |
11.4718 |
0.4836 |
0.4855 |
0.4826 |
0.4651 |
0.4384 |
0.3908 |
| 4.2020 |
210 |
9.7309 |
- |
- |
- |
- |
- |
- |
| 4.4040 |
220 |
8.4888 |
- |
- |
- |
- |
- |
- |
| 4.6061 |
230 |
8.3962 |
- |
- |
- |
- |
- |
- |
| 4.8081 |
240 |
8.339 |
- |
- |
- |
- |
- |
- |
| 5.0 |
250 |
9.5316 |
0.4754 |
0.4897 |
0.4746 |
0.4630 |
0.4509 |
0.4025 |
| 5.2020 |
260 |
7.867 |
- |
- |
- |
- |
- |
- |
| 5.4040 |
270 |
7.3297 |
- |
- |
- |
- |
- |
- |
| 5.6061 |
280 |
7.2536 |
- |
- |
- |
- |
- |
- |
| 5.8081 |
290 |
9.9241 |
- |
- |
- |
- |
- |
- |
| 6.0 |
300 |
8.4595 |
0.4897 |
0.4878 |
0.4686 |
0.4595 |
0.4586 |
0.3938 |
| 6.2020 |
310 |
7.4616 |
- |
- |
- |
- |
- |
- |
| 6.4040 |
320 |
7.3904 |
- |
- |
- |
- |
- |
- |
| 6.6061 |
330 |
7.4447 |
- |
- |
- |
- |
- |
- |
| 6.8081 |
340 |
6.7114 |
- |
- |
- |
- |
- |
- |
| 7.0 |
350 |
6.2379 |
0.4842 |
0.4935 |
0.4864 |
0.465 |
0.4754 |
0.406 |
| 7.2020 |
360 |
6.467 |
- |
- |
- |
- |
- |
- |
| 7.4040 |
370 |
6.9809 |
- |
- |
- |
- |
- |
- |
| 7.6061 |
380 |
6.348 |
- |
- |
- |
- |
- |
- |
| 7.8081 |
390 |
5.3424 |
- |
- |
- |
- |
- |
- |
| 8.0 |
400 |
6.6571 |
0.4966 |
0.4946 |
0.4791 |
0.4724 |
0.4675 |
0.3936 |
| 8.2020 |
410 |
7.0065 |
- |
- |
- |
- |
- |
- |
| 8.4040 |
420 |
6.5904 |
- |
- |
- |
- |
- |
- |
| 8.6061 |
430 |
5.5579 |
- |
- |
- |
- |
- |
- |
| 8.8081 |
440 |
4.7633 |
- |
- |
- |
- |
- |
- |
| 9.0 |
450 |
3.9055 |
0.4855 |
0.5003 |
0.4845 |
0.4846 |
0.4703 |
0.4108 |
| -1 |
-1 |
- |
0.4842 |
0.4935 |
0.4864 |
0.4650 |
0.4754 |
0.4060 |
- The bold row denotes the saved checkpoint.
Framework Versions
- Python: 3.12.11
- Sentence Transformers: 5.1.0
- Transformers: 4.51.3
- PyTorch: 2.8.0+cu126
- Accelerate: 1.10.1
- Datasets: 4.0.0
- Tokenizers: 0.21.4
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MatryoshkaLoss
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}