Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 13
How to use Yash911/e5-finetuned with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("Yash911/e5-finetuned")
sentences = [
"AVS Video Editor AVS Video Editor is a video editing software published by Online Media Technologies Ltd. It is a part of AVS4YOU software suite which includes video, audio, image editing and conversion, disc editing and burning, document conversion and registry cleaner programs. It offers the opportunity to create and edit videos with a vast variety of video and audio effects, text and transitions; capture video from screen, web or DV cameras and VHS tape; record voice; create menus for discs, as well as to save them to plenty of video file formats, burn to discs or publish on Facebook, YouTube, Flickr, etc. 0",
"Adobe Premiere Rush is a video editing software that allows users to create and edit videos for social media platforms such as YouTube, Instagram, and Facebook. It is designed for quick and easy editing and can be accessed across multiple devices. The software provides basic editing tools like trimming, cutting, transitions, animations, and sound optimization. Its user-friendly interface makes it suitable for beginners and requires minimal experience with video editing.",
"H.245 is a protocol used for control signaling in videoconferencing systems. It is responsible for negotiating the capabilities and settings of the system during a call, such as video quality, audio codecs, and call setup. Understanding H.245 requires specialized knowledge of videoconferencing technologies and protocols.",
"Coeliac disease is a long-term autoimmune disorder, primarily affecting the small intestine, where individuals develop intolerance to gluten, present in foods such as wheat, rye and barley. Classic symptoms include gastrointestinal problems such as chronic diarrhoea, abdominal distention, malabsorption, loss of appetite, and among children failure to grow normally. This often begins between six months and two years of age. Non-classic symptoms are more common, especially in people older than two years. There may be mild or absent gastrointestinal symptoms, a wide number of symptoms involving any part of the body, or no obvious symptoms. Coeliac disease was first described in childhood; however, it may develop at any age. It is associated with other autoimmune diseases, such as Type 1 diabetes mellitus and Hashimoto's thyroiditis, among others."
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This is a sentence-transformers model finetuned from Yash911/e5-finetuned. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertModel'})
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("Yash911/e5-finetuned")
# Run inference
sentences = [
'AWS User Pools AWS User Pools is a fully managed user directory service that allows application developers to easily add registration and login functionality to their apps. It provides features such as multi-factor authentication, password policies, social sign-in, and customizable email templates. AWS User Pools allows developers to focus on building their applications while providing secure and scalable user authentication and authorization. 0',
'Oracle Waveset is an identity management system developed by Oracle Corporation. It provides a centralized platform for managing user identities and access to resources within an organization. It helps to streamline and automate the process of user provisioning, de-provisioning, and managing access privileges. Oracle Waveset also supports password management, authentication, and integration with various authentication systems, such as LDAP and Active Directory. It is commonly used in large-scale enterprises and organizations that require strict access control and compliance with regulatory requirements.',
'Gastritis is inflammation of the lining of the stomach. It may occur as a short episode or may be of a long duration. There may be no symptoms but, when symptoms are present, the most common is upper abdominal pain. Other possible symptoms include nausea and vomiting, bloating, loss of appetite and heartburn. Complications may include stomach bleeding, stomach ulcers, and stomach tumors. When due to autoimmune problems, low red blood cells due to not enough vitamin B12 may occur, a condition known as pernicious anemia.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[ 1.0000, 0.6894, -0.0088],
# [ 0.6894, 1.0000, -0.0543],
# [-0.0088, -0.0543, 1.0000]])
sentence_0, sentence_1, sentence_2, and label| sentence_0 | sentence_1 | sentence_2 | label | |
|---|---|---|---|---|
| type | string | string | string | int |
| details |
|
|
|
|
| sentence_0 | sentence_1 | sentence_2 | label |
|---|---|---|---|
Burroughs MCP The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. 0 |
The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. |
Yellow fever is a viral disease of typically short duration. In most cases, symptoms include fever, chills, loss of appetite, nausea, muscle pains particularly in the back, and headaches. Symptoms typically improve within five days. In about 15% of people, within a day of improving the fever comes back, abdominal pain occurs, and liver damage begins causing yellow skin. If this occurs, the risk of bleeding and kidney problems is increased. |
1 |
Burroughs MCP The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. 0 |
The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. |
A wax sculpture is a depiction made using a waxy substance. Often these are effigies, usually of a notable individual, but there are also death masks and scenes with many figures, mostly in relief. |
1 |
Burroughs MCP The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. 0 |
The MCP is the proprietary operating system of the Burroughs small, medium and large systems, including the Unisys Clearpath/MCP systems. |
Hyperemesis gravidarum (HG) is a pregnancy complication that is characterized by severe nausea, vomiting, weight loss, and possibly dehydration. Feeling faint may also occur. It is considered more severe than morning sickness. Symptoms often get better after the 20th week of pregnancy but may last the entire pregnancy duration. |
1 |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
num_train_epochs: 1fp16: Truemulti_dataset_batch_sampler: round_robinoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: noprediction_loss_only: Trueper_device_train_batch_size: 8per_device_eval_batch_size: 8per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1num_train_epochs: 1max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.0warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Truefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: round_robinrouter_mapping: {}learning_rate_mapping: {}| Epoch | Step | Training Loss |
|---|---|---|
| 0.08 | 500 | 0.0323 |
| 0.16 | 1000 | 0.0356 |
| 0.24 | 1500 | 0.0426 |
| 0.32 | 2000 | 0.0451 |
| 0.4 | 2500 | 0.0306 |
| 0.48 | 3000 | 0.0341 |
| 0.56 | 3500 | 0.0374 |
| 0.64 | 4000 | 0.0291 |
| 0.72 | 4500 | 0.0266 |
| 0.8 | 5000 | 0.0214 |
| 0.88 | 5500 | 0.0331 |
| 0.96 | 6000 | 0.0281 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Unable to build the model tree, the base model loops to the model itself. Learn more.