multilingual-e5-large
This is a sentence-transformers model finetuned from intfloat/multilingual-e5-large. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: intfloat/multilingual-e5-large
- Maximum Sequence Length: 512 tokens
- Output Dimensionality: 1024 dimensions
- Similarity Function: Cosine Similarity
- Language: en
- License: apache-2.0
Model Sources
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'XLMRobertaModel'})
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("IoannisKat1/intfloat-multilingual-e5-large-new2")
sentences = [
'What can the contract be based on, besides individual contracts, in part according to this excerpt?',
"1.Where processing is to be carried out on behalf of a controller, the controller shall use only processors providing sufficient guarantees to implement appropriate technical and organisational measures in such a manner that processing will meet the requirements of this Regulation and ensure the protection of the rights of the data subject.\n2.The processor shall not engage another processor without prior specific or general written authorisation of the controller. In the case of general written authorisation, the processor shall inform the controller of any intended changes concerning the addition or replacement of other processors, thereby giving the controller the opportunity to object to such changes.\n3.Processing by a processor shall be governed by a contract or other legal act under Union or Member State law, that is binding on the processor with regard to the controller and that sets out the subject-matter and duration of the processing, the nature and purpose of the processing, the type of personal data and categories of data subjects and the obligations and rights of the controller. That contract or other legal act shall stipulate, in particular, that the processor: (a) processes the personal data only on documented instructions from the controller, including with regard to transfers of personal data to a third country or an international organisation, unless required to do so by Union or Member State law to which the processor is subject; in such a case, the processor shall inform the controller of that legal requirement before processing, unless that law prohibits such information on important grounds of public interest; (b) ensures that persons authorised to process the personal data have committed themselves to confidentiality or are under an appropriate statutory obligation of confidentiality; (c) takes all measures required pursuant to Article 32; (d) respects the conditions referred to in paragraphs 2 and 4 for engaging another processor; (e) taking into account the nature of the processing, assists the controller by appropriate technical and organisational measures, insofar as this is possible, for the fulfilment of the controller's obligation to respond to requests for exercising the data subject's rights laid down in Chapter III; (f) assists the controller in ensuring compliance with the obligations pursuant to Articles 32 to 36 taking into account the nature of processing and the information available to the processor; (g) at the choice of the controller, deletes or returns all the personal data to the controller after the end of the provision of services relating to processing, and deletes existing copies unless Union or Member State law requires storage of the personal data; (h) makes available to the controller all information necessary to demonstrate compliance with the obligations laid down in this Article and allow for and contribute to audits, including inspections, conducted by the controller or another auditor mandated by the controller. 4.5.2016 L 119/49 With regard to point (h) of the first subparagraph, the processor shall immediately inform the controller if, in its opinion, an instruction infringes this Regulation or other Union or Member State data protection provisions.\n4.Where a processor engages another processor for carrying out specific processing activities on behalf of the controller, the same data protection obligations as set out in the contract or other legal act between the controller and the processor as referred to in paragraph 3 shall be imposed on that other processor by way of a contract or other legal act under Union or Member State law, in particular providing sufficient guarantees to implement appropriate technical and organisational measures in such a manner that the processing will meet the requirements of this Regulation. Where that other processor fails to fulfil its data protection obligations, the initial processor shall remain fully liable to the controller for the performance of that other processor's obligations.\n5.Adherence of a processor to an approved code of conduct as referred to in Article 40 or an approved certification mechanism as referred to in Article 42 may be used as an element by which to demonstrate sufficient guarantees as referred to in paragraphs 1 and 4 of this Article.\n6.Without prejudice to an individual contract between the controller and the processor, the contract or the other legal act referred to in paragraphs 3 and 4 of this Article may be based, in whole or in part, on standard contractual clauses referred to in paragraphs 7 and 8 of this Article, including when they are part of a certification granted to the controller or processor pursuant to Articles 42 and 43\n7.The Commission may lay down standard contractual clauses for the matters referred to in paragraph 3 and 4 of this Article and in accordance with the examination procedure referred to in Article 93(2).\n8.A supervisory authority may adopt standard contractual clauses for the matters referred to in paragraph 3 and 4 of this Article and in accordance with the consistency mechanism referred to in Article 63\n9.The contract or the other legal act referred to in paragraphs 3 and 4 shall be in writing, including in electronic form.\n10.Without prejudice to Articles 82, 83 and 84, if a processor infringes this Regulation by determining the purposes and means of processing, the processor shall be considered to be a controller in respect of that processing.",
'1.The Board shall draw up an annual report regarding the protection of natural persons with regard to processing in the Union and, where relevant, in third countries and international organisations. The report shall be made public and be transmitted to the European Parliament, to the Council and to the Commission.\n2.The annual report shall include a review of the practical application of the guidelines, recommendations and best practices referred to in point (l) of Article 70(1) as well as of the binding decisions referred to in Article 65.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
similarities = model.similarity(embeddings, embeddings)
print(similarities)
Evaluation
Metrics
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.3965 |
| cosine_accuracy@3 |
0.4419 |
| cosine_accuracy@5 |
0.4773 |
| cosine_accuracy@10 |
0.5303 |
| cosine_precision@1 |
0.3965 |
| cosine_precision@3 |
0.3864 |
| cosine_precision@5 |
0.3682 |
| cosine_precision@10 |
0.329 |
| cosine_recall@1 |
0.083 |
| cosine_recall@3 |
0.2081 |
| cosine_recall@5 |
0.2836 |
| cosine_recall@10 |
0.3941 |
| cosine_ndcg@10 |
0.4599 |
| cosine_mrr@10 |
0.4271 |
| cosine_map@100 |
0.5169 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.3939 |
| cosine_accuracy@3 |
0.4419 |
| cosine_accuracy@5 |
0.5025 |
| cosine_accuracy@10 |
0.5505 |
| cosine_precision@1 |
0.3939 |
| cosine_precision@3 |
0.3855 |
| cosine_precision@5 |
0.3763 |
| cosine_precision@10 |
0.3447 |
| cosine_recall@1 |
0.0817 |
| cosine_recall@3 |
0.2045 |
| cosine_recall@5 |
0.2833 |
| cosine_recall@10 |
0.4024 |
| cosine_ndcg@10 |
0.4717 |
| cosine_mrr@10 |
0.431 |
| cosine_map@100 |
0.5281 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.3788 |
| cosine_accuracy@3 |
0.4318 |
| cosine_accuracy@5 |
0.4848 |
| cosine_accuracy@10 |
0.5303 |
| cosine_precision@1 |
0.3788 |
| cosine_precision@3 |
0.3729 |
| cosine_precision@5 |
0.3606 |
| cosine_precision@10 |
0.326 |
| cosine_recall@1 |
0.0797 |
| cosine_recall@3 |
0.2026 |
| cosine_recall@5 |
0.2789 |
| cosine_recall@10 |
0.3943 |
| cosine_ndcg@10 |
0.4537 |
| cosine_mrr@10 |
0.4149 |
| cosine_map@100 |
0.5082 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.3712 |
| cosine_accuracy@3 |
0.399 |
| cosine_accuracy@5 |
0.4419 |
| cosine_accuracy@10 |
0.4949 |
| cosine_precision@1 |
0.3712 |
| cosine_precision@3 |
0.3552 |
| cosine_precision@5 |
0.3328 |
| cosine_precision@10 |
0.2995 |
| cosine_recall@1 |
0.0798 |
| cosine_recall@3 |
0.1976 |
| cosine_recall@5 |
0.26 |
| cosine_recall@10 |
0.3649 |
| cosine_ndcg@10 |
0.4262 |
| cosine_mrr@10 |
0.3972 |
| cosine_map@100 |
0.4892 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.3359 |
| cosine_accuracy@3 |
0.3763 |
| cosine_accuracy@5 |
0.4192 |
| cosine_accuracy@10 |
0.4798 |
| cosine_precision@1 |
0.3359 |
| cosine_precision@3 |
0.3266 |
| cosine_precision@5 |
0.3111 |
| cosine_precision@10 |
0.2803 |
| cosine_recall@1 |
0.0742 |
| cosine_recall@3 |
0.1853 |
| cosine_recall@5 |
0.2527 |
| cosine_recall@10 |
0.3572 |
| cosine_ndcg@10 |
0.4004 |
| cosine_mrr@10 |
0.3675 |
| cosine_map@100 |
0.4576 |
Information Retrieval
| Metric |
Value |
| cosine_accuracy@1 |
0.2778 |
| cosine_accuracy@3 |
0.3157 |
| cosine_accuracy@5 |
0.3434 |
| cosine_accuracy@10 |
0.3864 |
| cosine_precision@1 |
0.2778 |
| cosine_precision@3 |
0.2685 |
| cosine_precision@5 |
0.253 |
| cosine_precision@10 |
0.2222 |
| cosine_recall@1 |
0.0651 |
| cosine_recall@3 |
0.1629 |
| cosine_recall@5 |
0.2204 |
| cosine_recall@10 |
0.3079 |
| cosine_ndcg@10 |
0.3306 |
| cosine_mrr@10 |
0.303 |
| cosine_map@100 |
0.3986 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 1,580 training samples
- Columns:
anchor and positive
- Approximate statistics based on the first 1000 samples:
|
anchor |
positive |
| type |
string |
string |
| details |
- min: 7 tokens
- mean: 17.21 tokens
- max: 43 tokens
|
- min: 27 tokens
- mean: 373.71 tokens
- max: 512 tokens
|
- Samples:
| anchor |
positive |
What measures should each supervisory authority take to facilitate the submission of complaints? |
Every data subject should have the right to lodge a complaint with a single supervisory authority, in particular in the Member State of his or her habitual residence, and the right to an effective judicial remedy in accordance 4.5.2016 L 119/25 Official Journal of the European Union EN with Article 47 of the Charter if the data subject considers that his or her rights under this Regulation are infringed or where the supervisory authority does not act on a complaint, partially or wholly rejects or dismisses a complaint or does not act where such action is necessary to protect the rights of the data subject. The investigation following a complaint should be carried out, subject to judicial review, to the extent that is appropriate in the specific case. The supervisory authority should inform the data subject of the progress and the outcome of the complaint within a reasonable period. If the case requires further investigation or coordination with another supervisory authority, intermed... |
What did the evidence not indicate? |
Court (Civil/Criminal): Civil Provisions: Time of commission of the act: Outcome (not guilty, guilty): Reasoning: Claim for compensation and monetary satisfaction due to moral damage against a mobile phone company and a credit institution within the framework of inadequate fulfillment of a payment services contract for "web banking." Appropriate actions for mobile phone companies in case of a request for a "sim" card replacement due to wear or loss. They must verify the customer's identity based on the personal details and identification information registered in their system but are not liable for any changes in the latter that were not timely communicated to them. Further security measures such as phone communication or sending an SMS to the mobile number holder are not required. Payment services under Law 4357/2018. Obligation of the payment service provider, such as banks, to inform the payer after receiving a relevant order for making a payment. The con... |
What was the amount transferred from her account? |
Court (Civil/Criminal): Criminal Provisions: Article 42 paragraphs 1, 2, 3, and 7 of Law 4557/2018 Time of commission of the act: Outcome (not guilty, guilty): Reasoning: Obligation of the payment service provider, such as banks, to inform their contracting customer after receiving a relevant order for a payment to be made on their behalf. Content of the above notification at the stage of receiving the payment order and during its execution. Terms of liability for the provider regarding compensation for non-execution, erroneous, or delayed execution of payment transactions. In particular, in the case of an unauthorized or erroneous payment, the user is required to notify the provider within a specified timeframe as soon as they become aware of the corresponding transaction. The provisions of Law 4357/2018 establish mandatory legal regulations in favor of users of payment services and cannot be contractually modified to their detriment, but only to their benefit. Defenses availa... |
- Loss:
MatryoshkaLoss with these parameters:{
"loss": "MultipleNegativesRankingLoss",
"matryoshka_dims": [
1024,
768,
512,
256,
128,
64
],
"matryoshka_weights": [
1,
1,
1,
1,
1,
1
],
"n_dims_per_step": -1
}
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: epoch
per_device_train_batch_size: 2
gradient_accumulation_steps: 8
learning_rate: 2e-05
num_train_epochs: 12
lr_scheduler_type: cosine
warmup_ratio: 0.1
bf16: True
tf32: True
load_best_model_at_end: True
optim: adamw_torch_fused
batch_sampler: no_duplicates
All Hyperparameters
Click to expand
overwrite_output_dir: False
do_predict: False
eval_strategy: epoch
prediction_loss_only: True
per_device_train_batch_size: 2
per_device_eval_batch_size: 8
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 8
eval_accumulation_steps: None
torch_empty_cache_steps: None
learning_rate: 2e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 12
max_steps: -1
lr_scheduler_type: cosine
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: False
seed: 42
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: True
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: True
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: True
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
tp_size: 0
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_torch_fused
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: None
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
include_for_metrics: []
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
eval_on_start: False
use_liger_kernel: False
eval_use_gather_object: False
average_tokens_across_devices: False
prompts: None
batch_sampler: no_duplicates
multi_dataset_batch_sampler: proportional
router_mapping: {}
learning_rate_mapping: {}
Training Logs
Click to expand
| Epoch |
Step |
Training Loss |
dim_1024_cosine_ndcg@10 |
dim_768_cosine_ndcg@10 |
dim_512_cosine_ndcg@10 |
dim_256_cosine_ndcg@10 |
dim_128_cosine_ndcg@10 |
dim_64_cosine_ndcg@10 |
| -1 |
-1 |
- |
0.4214 |
0.4041 |
0.3894 |
0.3254 |
0.2396 |
0.1752 |
| 0.1013 |
10 |
20.5156 |
- |
- |
- |
- |
- |
- |
| 0.2025 |
20 |
19.5068 |
- |
- |
- |
- |
- |
- |
| 0.3038 |
30 |
17.3704 |
- |
- |
- |
- |
- |
- |
| 0.4051 |
40 |
17.1827 |
- |
- |
- |
- |
- |
- |
| 0.5063 |
50 |
16.6068 |
- |
- |
- |
- |
- |
- |
| 0.6076 |
60 |
16.4217 |
- |
- |
- |
- |
- |
- |
| 0.7089 |
70 |
15.5364 |
- |
- |
- |
- |
- |
- |
| 0.8101 |
80 |
13.3384 |
- |
- |
- |
- |
- |
- |
| 0.9114 |
90 |
15.6398 |
- |
- |
- |
- |
- |
- |
| 0.9924 |
98 |
- |
0.4681 |
0.4793 |
0.4615 |
0.4166 |
0.3638 |
0.2744 |
| 1.0203 |
100 |
14.2832 |
- |
- |
- |
- |
- |
- |
| 1.1215 |
110 |
10.0518 |
- |
- |
- |
- |
- |
- |
| 1.2228 |
120 |
10.3808 |
- |
- |
- |
- |
- |
- |
| 1.3241 |
130 |
10.9265 |
- |
- |
- |
- |
- |
- |
| 1.4253 |
140 |
10.2787 |
- |
- |
- |
- |
- |
- |
| 1.5266 |
150 |
10.9999 |
- |
- |
- |
- |
- |
- |
| 1.6278 |
160 |
6.8139 |
- |
- |
- |
- |
- |
- |
| 1.7291 |
170 |
7.986 |
- |
- |
- |
- |
- |
- |
| 1.8304 |
180 |
9.2866 |
- |
- |
- |
- |
- |
- |
| 1.9316 |
190 |
9.2912 |
- |
- |
- |
- |
- |
- |
| 1.9924 |
196 |
- |
0.4772 |
0.4612 |
0.4645 |
0.3945 |
0.3636 |
0.2986 |
| 2.0405 |
200 |
9.9778 |
- |
- |
- |
- |
- |
- |
| 2.1418 |
210 |
7.8425 |
- |
- |
- |
- |
- |
- |
| 2.2430 |
220 |
7.7307 |
- |
- |
- |
- |
- |
- |
| 2.3443 |
230 |
6.6603 |
- |
- |
- |
- |
- |
- |
| 2.4456 |
240 |
5.8628 |
- |
- |
- |
- |
- |
- |
| 2.5468 |
250 |
7.5488 |
- |
- |
- |
- |
- |
- |
| 2.6481 |
260 |
8.5646 |
- |
- |
- |
- |
- |
- |
| 2.7494 |
270 |
7.7542 |
- |
- |
- |
- |
- |
- |
| 2.8506 |
280 |
6.046 |
- |
- |
- |
- |
- |
- |
| 2.9519 |
290 |
4.2612 |
- |
- |
- |
- |
- |
- |
| 2.9924 |
294 |
- |
0.4663 |
0.4403 |
0.4505 |
0.4067 |
0.3673 |
0.3267 |
| 3.0608 |
300 |
4.7943 |
- |
- |
- |
- |
- |
- |
| 3.1620 |
310 |
7.1236 |
- |
- |
- |
- |
- |
- |
| 3.2633 |
320 |
7.8359 |
- |
- |
- |
- |
- |
- |
| 3.3646 |
330 |
7.2883 |
- |
- |
- |
- |
- |
- |
| 3.4658 |
340 |
6.8383 |
- |
- |
- |
- |
- |
- |
| 3.5671 |
350 |
6.1145 |
- |
- |
- |
- |
- |
- |
| 3.6684 |
360 |
5.8697 |
- |
- |
- |
- |
- |
- |
| 3.7696 |
370 |
5.3551 |
- |
- |
- |
- |
- |
- |
| 3.8709 |
380 |
7.7562 |
- |
- |
- |
- |
- |
- |
| 3.9722 |
390 |
4.1286 |
- |
- |
- |
- |
- |
- |
| 3.9924 |
392 |
- |
0.5004 |
0.4837 |
0.4654 |
0.4095 |
0.3771 |
0.3238 |
| 4.0810 |
400 |
6.6456 |
- |
- |
- |
- |
- |
- |
| 4.1823 |
410 |
7.8539 |
- |
- |
- |
- |
- |
- |
| 4.2835 |
420 |
5.2917 |
- |
- |
- |
- |
- |
- |
| 4.3848 |
430 |
5.5573 |
- |
- |
- |
- |
- |
- |
| 4.4861 |
440 |
6.957 |
- |
- |
- |
- |
- |
- |
| 4.5873 |
450 |
6.3068 |
- |
- |
- |
- |
- |
- |
| 4.6886 |
460 |
6.0006 |
- |
- |
- |
- |
- |
- |
| 4.7899 |
470 |
6.1419 |
- |
- |
- |
- |
- |
- |
| 4.8911 |
480 |
5.0808 |
- |
- |
- |
- |
- |
- |
| 4.9924 |
490 |
6.0219 |
0.4752 |
0.4754 |
0.4581 |
0.4243 |
0.3931 |
0.3410 |
| 5.1013 |
500 |
3.7305 |
- |
- |
- |
- |
- |
- |
| 5.2025 |
510 |
4.827 |
- |
- |
- |
- |
- |
- |
| 5.3038 |
520 |
3.1179 |
- |
- |
- |
- |
- |
- |
| 5.4051 |
530 |
6.141 |
- |
- |
- |
- |
- |
- |
| 5.5063 |
540 |
6.3686 |
- |
- |
- |
- |
- |
- |
| 5.6076 |
550 |
4.9029 |
- |
- |
- |
- |
- |
- |
| 5.7089 |
560 |
3.6987 |
- |
- |
- |
- |
- |
- |
| 5.8101 |
570 |
5.5046 |
- |
- |
- |
- |
- |
- |
| 5.9114 |
580 |
5.0166 |
- |
- |
- |
- |
- |
- |
| 5.9924 |
588 |
- |
0.4737 |
0.4748 |
0.4567 |
0.4185 |
0.3890 |
0.3435 |
| 6.0203 |
590 |
3.9625 |
- |
- |
- |
- |
- |
- |
| 6.1215 |
600 |
6.7869 |
- |
- |
- |
- |
- |
- |
| 6.2228 |
610 |
3.6329 |
- |
- |
- |
- |
- |
- |
| 6.3241 |
620 |
6.2702 |
- |
- |
- |
- |
- |
- |
| 6.4253 |
630 |
3.3559 |
- |
- |
- |
- |
- |
- |
| 6.5266 |
640 |
4.0666 |
- |
- |
- |
- |
- |
- |
| 6.6278 |
650 |
3.5322 |
- |
- |
- |
- |
- |
- |
| 6.7291 |
660 |
4.8831 |
- |
- |
- |
- |
- |
- |
| 6.8304 |
670 |
6.6302 |
- |
- |
- |
- |
- |
- |
| 6.9316 |
680 |
5.7623 |
- |
- |
- |
- |
- |
- |
| 6.9924 |
686 |
- |
0.4687 |
0.4713 |
0.4520 |
0.4194 |
0.3950 |
0.3338 |
| 7.0405 |
690 |
5.5453 |
- |
- |
- |
- |
- |
- |
| 7.1418 |
700 |
2.8097 |
- |
- |
- |
- |
- |
- |
| 7.2430 |
710 |
3.5171 |
- |
- |
- |
- |
- |
- |
| 7.3443 |
720 |
3.5449 |
- |
- |
- |
- |
- |
- |
| 7.4456 |
730 |
4.6169 |
- |
- |
- |
- |
- |
- |
| 7.5468 |
740 |
3.567 |
- |
- |
- |
- |
- |
- |
| 7.6481 |
750 |
5.7251 |
- |
- |
- |
- |
- |
- |
| 7.7494 |
760 |
3.7201 |
- |
- |
- |
- |
- |
- |
| 7.8506 |
770 |
3.1051 |
- |
- |
- |
- |
- |
- |
| 7.9519 |
780 |
3.9642 |
- |
- |
- |
- |
- |
- |
| 7.9924 |
784 |
- |
0.4599 |
0.4717 |
0.4537 |
0.4262 |
0.4004 |
0.3306 |
| 8.0608 |
790 |
3.923 |
- |
- |
- |
- |
- |
- |
| 8.1620 |
800 |
3.52 |
- |
- |
- |
- |
- |
- |
| 8.2633 |
810 |
3.1567 |
- |
- |
- |
- |
- |
- |
| 8.3646 |
820 |
6.1725 |
- |
- |
- |
- |
- |
- |
| 8.4658 |
830 |
3.259 |
- |
- |
- |
- |
- |
- |
| 8.5671 |
840 |
6.6232 |
- |
- |
- |
- |
- |
- |
| 8.6684 |
850 |
3.7085 |
- |
- |
- |
- |
- |
- |
| 8.7696 |
860 |
4.0311 |
- |
- |
- |
- |
- |
- |
| 8.8709 |
870 |
7.2503 |
- |
- |
- |
- |
- |
- |
| 8.9722 |
880 |
2.2984 |
- |
- |
- |
- |
- |
- |
| 8.9924 |
882 |
- |
0.4632 |
0.4752 |
0.4550 |
0.4247 |
0.3953 |
0.3281 |
| 9.0810 |
890 |
4.519 |
- |
- |
- |
- |
- |
- |
| 9.1823 |
900 |
2.99 |
- |
- |
- |
- |
- |
- |
| 9.2835 |
910 |
5.3026 |
- |
- |
- |
- |
- |
- |
| 9.3848 |
920 |
3.8492 |
- |
- |
- |
- |
- |
- |
| 9.4861 |
930 |
1.9454 |
- |
- |
- |
- |
- |
- |
| 9.5873 |
940 |
3.538 |
- |
- |
- |
- |
- |
- |
| 9.6886 |
950 |
4.1874 |
- |
- |
- |
- |
- |
- |
| 9.7899 |
960 |
4.2356 |
- |
- |
- |
- |
- |
- |
| 9.8911 |
970 |
4.5356 |
- |
- |
- |
- |
- |
- |
| 9.9924 |
980 |
4.0243 |
0.4665 |
0.4681 |
0.4507 |
0.4236 |
0.3937 |
0.3288 |
| -1 |
-1 |
- |
0.4599 |
0.4717 |
0.4537 |
0.4262 |
0.4004 |
0.3306 |
- The bold row denotes the saved checkpoint.
Framework Versions
- Python: 3.12.11
- Sentence Transformers: 5.1.0
- Transformers: 4.51.3
- PyTorch: 2.8.0+cu126
- Accelerate: 1.10.1
- Datasets: 4.0.0
- Tokenizers: 0.21.4
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MatryoshkaLoss
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}