Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 13
How to use sahithkumar7/final-mpnet-base-peft with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("sahithkumar7/final-mpnet-base-peft")
sentences = [
"What is the department of medicine located at?",
"Publisher’s Note: MDPI stays neutral\nwith regard to jurisdictional claims in\npublished maps and institutional afil-\n\niations.\n\nonon)\n\nCopyright: © 2021 by the author.\nLicensee MDPI, Basel, Switzerland.\nThis article is an open access article\ndistributed under the terms and\nconditions of the Creative Commons\nAttribution (CC BY) license (https://\ncreativecommons.org/licenses/by/\n4.0/).\n\nJoan and Sanford I. Weill Department of Medicine, Weill Cornell Medical College, 525 East 68th Street,\nRoom M-522, Box 130, New York, NY 10065, USA; str2020@med.cornell.edu or Stefan.Ryter@proterris.com",
"Results At the parameters used, the ultrasound did not directly affect bCSC proliferation, with no evident changes in\nmorphology. In contrast, the ultrasound protocol affected the migration and invasion ability of bCSCs, limiting their\ncapacity to advance while a major affection was detected on their angiogenic properties. LIPUS-treated bCSCs were\nunable to transform into aggressive metastatic cancer cells, by decreasing their migration and invasion capacity as\nwell as vessel formation. Finally, RNA-seq analysis revealed major changes in gene expression, with 676 differentially",
"Tesfaye, M. & Savoldo, B. Adoptive cell therapy in\ntreating pediatric solid tumors. Curr. Oncol. Rep. 20,\n73 (2018).\n\nMarofi, F. et al. CAR T cells in solid tumors: challenges\nand opportunities. Stem Cell Res. Ther. 12, 81 (2021).\nDeng, Q. et al. Characteristics of anti-CD19 CAR T cell\ninfusion products associated with efficacy and toxicity\n\nin patients with large B cell lymphomas. Nat. Med. 26,\n\n1878-1887 (2020).\nBoulch, M. A cross-talk between CAR T cell subsets\nand the tumor microenvironment is essential for\nsustained cytotoxic activity. Sci. Immunol. 6,\neabd4344 (2021)."
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This is a sentence-transformers model finetuned from microsoft/mpnet-base on the json dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'PeftModelForFeatureExtraction'})
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sahithkumar7/final-mpnet-base-peft")
# Run inference
sentences = [
'What is the number of genes obtained from comparing control and LIPUS-stimulated samples?',
'Differentially expressed genes (DEGs) were obtained\nbetween control and LIPUS-stimulated samples using\nan adjusted P<0.05 and|log2FC| > 1 as cutoffs to define\nstatistically significant differential expression. 676 genes\nwere obtained from which 578 were upregulated when\nstimulated with LIPUS and 98 genes were subregulated\n(Supp. Figure 1). To further understand the functions\nand pathways associated with the differentially expressed\ngenes (DEG), Gene Ontology (GO) and Kyoto Encyclo-\npedia of Genes and Genomes (KEGG) analyses were con-\nducted using the DAVID database [37, 38].',
'independent studies have shown a raising trend in both cancer incidence [2] and a high-salt\ndietary lifestyle [7], there is no direct correlation between dietary salt intake and breast\ncancer. Interestingly, in the human body, certain organs such as the skin and lymph nodes\nhave a natural tendency to accumulate salt [8]. Although unknown, the pathophysiological\nsignificance of this selective accumulation of sodium in certain organs and solid tumors is\nan area of intense research.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.9165, 0.8393],
# [0.9165, 1.0000, 0.9040],
# [0.8393, 0.9040, 1.0000]])
initial_test and final_testTripletEvaluator| Metric | initial_test | final_test |
|---|---|---|
| cosine_accuracy | 0.84 | 0.84 |
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
What is the limitation of FBG-based sensors in tactile feedback? |
Furthermore, FBG-based 3-axis tactile sensors have been |
141]. Therefore, it is not known to what extent spared |
What are the advantages of strain elastography? |
frontiersin.org |
Publisher’s Note: MDPI stays neutral |
What is the material used for the substrate in a piezoelectric element? |
gain for biomedical applications. |
Histopatholo |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
anchor, positive, and negative| anchor | positive | negative | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| anchor | positive | negative |
|---|---|---|
What can differentiate into a very wide variety of tissues? |
lead to decreased rates of graft-versus-host disease. They |
metabolic regulation may affect the function of more than one organelle. Therefore, if the |
What are the two most common types of pluripotent stem cells? |
III]. AMNIOTIC CELLS AS A SOURCE FOR STEM |
Explanation: criterion 6 indicates a positive diagnosis only within the DC VI group |
What percentage of stem cells are present in bone marrow? |
ing 30% in some tissues.43-45 This is a significant difference |
migration of bCSCs. This finding raises the possibil- |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
eval_strategy: stepsper_device_train_batch_size: 16per_device_eval_batch_size: 16warmup_ratio: 0.1fp16: Truebatch_sampler: no_duplicatesoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 16per_device_eval_batch_size: 16per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 3max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Truefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}tp_size: 0fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: no_duplicatesmulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss | Validation Loss | initial_test_cosine_accuracy | final_test_cosine_accuracy |
|---|---|---|---|---|---|
| -1 | -1 | - | - | 0.7800 | - |
| 0.4 | 20 | 3.1259 | 3.0317 | 0.7800 | - |
| 0.8 | 40 | 3.0559 | 2.9474 | 0.7400 | - |
| 1.2 | 60 | 3.0016 | 2.8108 | 0.7800 | - |
| 1.6 | 80 | 2.8156 | 2.6489 | 0.7800 | - |
| 2.0 | 100 | 2.6108 | 2.4933 | 0.7800 | - |
| 2.4 | 120 | 2.5426 | 2.3866 | 0.8200 | - |
| 2.8 | 140 | 2.4371 | 2.3262 | 0.8400 | - |
| -1 | -1 | - | - | - | 0.8400 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Base model
microsoft/mpnet-base