Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 13
This is a sentence-transformers model finetuned from BAAI/bge-m3. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 1024, 'do_lower_case': False}) with Transformer model: XLMRobertaModel
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("Jrinky/model3")
# Run inference
sentences = [
'When did he migrate to New South Wales',
'He attended Derby Grammar School and Beaufort House in London, and migrated to New South Wales in 1883. He settled in Newcastle, where he worked as a shipping agent, eventually partnering with his brothers in a firm. On 6 May 1893 he married Gertrude Mary Saddington, with whom he had five children.',
'Shizuka Shirakawa, Scholar of Chinese-language literature. Horin Fukuoji, Nihonga painter. 2005\n Mitsuko Mori. Actress. Makoto Saitō (1921–2008). Political scientist, specializing in American diplomatic and political history. Ryuzan Aoki, Ceramic artist. Toshio Sawada, Civil engineer. Shigeaki Hinohara, Doctor. 2006\n Yoshiaki Arata. A pioneer of nuclear fusion research. Jakuchō Setouchi. Writer/Buddhist nun. Hidekazu Yoshida. Music critic. Chusaku Oyama, Nihonga painter. Miyohei Shinohara, Economist. 2007\n Akira Mikazuki. Former justice minister and professor emeritus. Shinya Nakamura. Sculptor. Kōji Nakanishi. Organic chemist. Tokindo Okada, Developmental biologist. Shigeyama Sensaku, Kyogen performer. 2008\n Hironoshin Furuhashi (1928–2009). Sportsman and sports bureaucrat. Kiyoshi Itō. A mathematician whose work is now called Itō calculus. Donald Keene.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
anchor and positive| anchor | positive | |
|---|---|---|
| type | string | string |
| details |
|
|
| anchor | positive |
|---|---|
What type of tournament structure was used in this freestyle wrestling competition |
This freestyle wrestling competition consisted of a single-elimination tournament, with a repechage used to determine the winners of two bronze medals. Results |
What was the status of Josip Broz Tito under the 1974 Constitution of Yugoslavia regarding his presidency |
1 Wednesday, 22 April 1998. 2 (8.30 a.m.). 3 JUDGE CASSESE: Good morning. May I ask the |
How quickly can you get loan approval and funds transferred with Crawfort |
Then click on the submit button, and it’s done. Make your dream come true with Crawfort |
selfloss.Infonce with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
anchor and positive| anchor | positive | |
|---|---|---|
| type | string | string |
| details |
|
|
| anchor | positive |
|---|---|
What is Hector Guimard best known for |
Hector Guimard (, 10 March 1867 – 20 May 1942) was a French architect and designer, and a prominent figure of the Art Nouveau style. He achieved early fame with his design for the Castel Beranger, the first Art Nouveau apartment building in Paris, which was selected in an 1899 competition as one of the best new building facades in the city. He is best known for the glass and iron edicules or canopies, with ornamental Art Nouveau curves, which he designed to cover the entrances of the first stations of the Paris Metro. Between 1890 and 1930, Guimard designed and built some fifty buildings, in addition to one hundred and forty-one subway entrances for Paris Metro, as well as numerous pieces of furniture and other decorative works. However, in the 1910s Art Nouveau went out of fashion and by the 1960s most of his works had been demolished, and only two of his original Metro edicules were still in place. Guimard's critical reputation revived in the 1960s, in part due to subsequent acquisit... |
What does Mark Kantrowitz say about the inclusion of loans in financial aid packages |
"They don't always understand that part of the financial aid package includes loans," he says. But loans "don't really reduce your costs," explains Mark Kantrowitz, founder of the financial aid website FinAid.org and publisher of Edvisors Network. "They simply spread them out over time. ... A loan is a loan. |
How can Ayurveda support women's health during menopause |
Especially as we journey towards menopause, Ayurveda is there to support us with its millenary wisdom. These are some easy routines to incorporate for the daily care of the vulva and vagina, our most delicate flower. Sesame oil: our best allied against dryness, it cannot be missing in our diet. |
selfloss.Infonce with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
eval_strategy: stepsper_device_train_batch_size: 2per_device_eval_batch_size: 2learning_rate: 2e-05num_train_epochs: 5warmup_ratio: 0.1fp16: Truebatch_sampler: no_duplicatesoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 2per_device_eval_batch_size: 2per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonelearning_rate: 2e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 5max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Truefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Falsehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseeval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Nonedispatch_batches: Nonesplit_batches: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseprompts: Nonebatch_sampler: no_duplicatesmulti_dataset_batch_sampler: proportional| Epoch | Step | Training Loss | Validation Loss |
|---|---|---|---|
| 0.2033 | 100 | 0.2694 | 0.0690 |
| 0.4065 | 200 | 0.0822 | 0.0528 |
| 0.6098 | 300 | 0.0689 | 0.0497 |
| 0.8130 | 400 | 0.0644 | 0.0469 |
| 1.0163 | 500 | 0.0643 | 0.0443 |
| 1.2195 | 600 | 0.0378 | 0.0473 |
| 1.4228 | 700 | 0.04 | 0.0479 |
| 1.6260 | 800 | 0.0358 | 0.0461 |
| 1.8293 | 900 | 0.0332 | 0.0507 |
| 2.0325 | 1000 | 0.0283 | 0.0538 |
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Base model
BAAI/bge-m3
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Jrinky/model3") sentences = [ "Who are some notable individuals named Roger Mason", "Rav Kook's writings are extensive, and he is considered one of the most celebrated and influential rabbis of the 20th century. Some rabbis recommend that students of his begin studying his writings with Ein Ayah. References\n\nExternal links\n Ayin Ayah (full text), Hebrew Wikisource\n * Ayn Aya Classes in English\n\nTalmud\nAggadic Midrashim\nAbraham Isaac Kook\nHebrew-language religious books", "Roger Mason may refer to:\n\nRoger Mason (baseball) (born 1958), American baseball player\nRoger Mason (geologist) (born 1941), discoverer of Ediacaran fossils\nRoger Mason Jr. (born 1980), American basketball player\nRoger Mason (musician), Australian keyboardist\nL. Roger Mason, Jr., former assistant director of National Intelligence for Systems and Resource Analyses", "Timetabled passenger services on both lines had ceased by the end of February 1959. Shipping\nThe Bourne-Morton Canal or Bourne Old Eau connected the town to the sea in Roman times. Until the mid-19th century, the present Bourne Eau was capable of carrying commercial boat traffic from the Wash coast and Spalding. This resulted from the investment following the Bourne Navigation Act of 1780. Passage became impossible once the junction of the Eau and the River Glen was converted from gates to a sluice in 1860. Media\nLocal news and television programmes are provided by BBC Yorkshire and Lincolnshire and ITV Yorkshire. Television signals are received from the Belmont TV transmitter, the Waltham TV transmitter can also be received which broadcast BBC East Midlands and ITV Central programmes. Local radio stations are BBC Radio Lincolnshire, Greatest Hits Radio Lincolnshire and Lincs FM. The town's local newspapers are Bourne Local and Stamford Mercury. Sport\nBourne Town Football Club plays football in the United Counties Football League, whilst Bourne Cricket Club plays in the Lincolnshire ECB Premier League. These teams play their home games at the Abbey Lawn, a recreation ground privately owned by the Bourne United Charities. Motor sports\n\nThe racing-car marques English Racing Automobiles (ERA) and British Racing Motors (BRM) were both founded in Bourne by Raymond Mays, an international racing driver and designer who lived in Bourne. The former ERA and BRM workshops in Spalding Road are adjacent to Eastgate House, the Mays' family home in the town's Eastgate. Landmarks\n\nThere are currently 71 listed buildings in the parish of Bourne, the most important being Bourne Abbey and the Parish Church of St Peter and St Paul (1138), which is the only one scheduled Grade I. Notable people\nBourne is reputedly the birthplace of Hereward the Wake (in about 1035), although the 12th-century source of this information, De Gestis Herwardi Saxonis, refers only to his father as being \"of Bourne\" and to the father's house and retainers there. Robert Mannyng (1264–1340) is credited with putting the speech of the ordinary people of his time into recognisable form. He is better known as Robert de Brunne because of his long period of residence as a canon at Bourne Abbey. There he completed his life's work of popularising religious and historical material in a Middle English dialect that was easily understood at that time. William Cecil (1520–1598) became the first Lord Burghley after serving Queen Elizabeth I. He was born at a house in the centre of Bourne that is now the Burghley Arms. Dr William Dodd (1729–1777), was an Anglican clergyman, man of letters and forger. He was prosecuted, sentenced to death and publicly hanged at Tyburn in 1777. Charles Frederick Worth (1825–1895), son of a solicitor, lived at Wake House in North Street. He moved to Paris and became a renowned designer of women's fashion and the founder of haute couture. The French government awarded him the Légion d'honneur. Sir George White (1840-1912), MP for North West Norfolk, a seat he held for twelve years until he died in 1912. He was knighted for public service in 1907." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4]