Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 13
How to use knguyennguyen/mpnet_jacket4k_enhanced with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("knguyennguyen/mpnet_jacket4k_enhanced")
sentences = [
"men's sleeveless vest with a polished exterior and a tailored fit.. men's sleeveless vest with a polished exterior and a tailored fit.",
"Title: Arnodefrance Lity Of Gog Denim Jacket Graphic Print Washed Jacket Hip Pop Button Down Trucker Jacket Descripion: ['Arnodefrance provides more trendy clothing choices for trendy brand lovers and fashion icons. It has always been aimed at creating an international first-line trendy brand. It has unique cutting treatment, personalized color matching and comfortable soft fabrics. It expresses modern youth through clothing design. In a happy world, people play a self-style, create topics, and always maintain a trendy attitude to question common sense and pursue their own answers.']",
"Title: Columbia Girls' Big Benton Fleece Jacket, Spring Blue/Blue Chill, Medium Descripion: [\"There's nothing more necessary than a fleece layer in a litter adventurer's outdoor winter wardrobe—that's why the Benton Springs Full Zip Fleece Jacket exists. Columbia's soft, winter-ready jacket is the ultimate warmth provider and the everyday style piece. Crafted of our super-soft 100% polyester MTR filament fleece, this Benton Springs Full Zip Fleece Jacket is the perfect layering piece and first line of defense to combat the cold. It contains a modern classic fit that allows for comfortable movement and zippered side pockets to keep your small items (including your hands) secure. An added bonus is the warm collar that's flexible so you can choose whether you want to wear it up or down, depending on your desired level of toastiness. Our Benton Springs Full Zip Fleece Jacket is available in many accommodating sizes and colors as well. To ensure the size you choose is right, utilize our sizing chart and the following measurement instructions: For the sleeves, start at the center back of your neck and measure across the shoulder and down to the sleeve. If you come up with a partial number, round up to the next even number. For the chest, measure at the fullest part of the chest, under the armpits and over the shoulder blades, keeping the tape measure firm and level.\"]",
"Title: Men's Slim Vest Sleeveless Jacket Casual PU Leather Vests Button Open V-Neck Simple Joker Slim Fit Vest Winter Descripion: ['SPECIFICATIONGender:MENFabric Type:BroadclothStyle:Smart CasualMaterial:NylonMaterial:ViscoseItem Type:Vests']"
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]This is a sentence-transformers model finetuned from sentence-transformers/all-mpnet-base-v2. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 128, 'do_lower_case': False}) with Transformer model: MPNetModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("knguyennguyen/mpnet_jacket4k_enhanced")
# Run inference
sentences = [
'enamel pin with a compact size, durable material, and a secure backing.. enamel pin with a compact size, durable material, and a secure backing.',
'Title: Funny Chill Demon Enamel Pin Novelty Brooch Buttons Jewelry for Jackets Jeans Backpack Cloth Lapel Bag Hat Gift for Luci Fans Disenchantment Lovers Men Women Boy Girl Descripion: [\'-Size - About 1.2" -Hard enamel -Black shiny metal -One rubber clutch\']',
"Title: Cute Cat Enamel Pin I LOVE ALL THE CATS Brooch Cartoon Animal Lapel Badge for Backpacks Jackets Clothes Bag Party Decoration Jewelry Gift for Friends Descripion: ['Cute Cat Enamel Pin I LOVE ALL THE CATS Brooch Cartoon Animal Lapel Badge for Backpacks Jackets Clothes Bag Party Decoration Jewelry Gift for Friends Cute Cat Enamel Pin I LOVE ALL THE CATS Brooch Cartoon Animal Lapel Badge for Backpacks Jackets Clothes Bag Party Decoration Jewelry Gift for Friends Cute Cat Enamel Pin I LOVE ALL THE CATS Brooch Cartoon Animal Lapel Badge for Backpacks Jackets Clothes Bag Party Decoration Jewelry Gift for Friends Cute Cat Enamel Pin I LOVE ALL THE CATS Brooch Cartoon Animal Lapel Badge for Backpacks Jackets Clothes Bag Party Decoration Jewelry Gift for Friends']",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
sentence_0 and sentence_1| sentence_0 | sentence_1 | |
|---|---|---|
| type | string | string |
| details |
|
|
| sentence_0 | sentence_1 |
|---|---|
cosplay jacket designed for men, made from synthetic material, featuring a closure mechanism and suitable for various festive occasions. |
Title: Poetic Walk Kill la Kill Cosplay Matoi Ryuko Costume Jacket Baseball Coat Uniform Sports Coat Descripion: ["Anime Kill la Kill Cosplay Matoi Ryuko Costume Jacket Baseball Coat Uniform Sports Coat Package:One good quality jacket. Fabric:Polyester. Size:Mens size,please choose size from size table,if you couldn't ensure the size,please email us your measurements:female/male,height,bust,waist and hip,then we could check which size fit for you . Occasion: Halloween,Birthday, Masquerade, Christmas, Carnival,theme parties,clothing parties, costume ball, family gatherings, Halloween Party .Cosplay and all kinds of seasonal holidays and parties ."] |
a collarless leather jacket for stylish outerwear |
Title: Cole Haan Women's Leather Collarless Jacket Descripion: ['Collarless smooth lamb leather jacket with exposed snap detail at necline.'] |
jacket featuring a flexible closure, adjustable head covering, and secure storage options.. jacket featuring a flexible closure, adjustable head covering, and secure storage options. |
Title: PUMA Puma X Helly Hansen Jacket Descripion: ['Equip Your Wardrobe With The Latest Styles And Technology From This Duo Of Sportswear Titans, Puma And Helly Hansen. Known For Their Excellence With Outerwear, Puma Has Teamed Up With The Experts Over At Helly Hansen To Produce High-Performance, High Style Options For This Line Of Winterwear.'] |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
per_device_train_batch_size: 128per_device_eval_batch_size: 128num_train_epochs: 5multi_dataset_batch_sampler: round_robinoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: noprediction_loss_only: Trueper_device_train_batch_size: 128per_device_eval_batch_size: 128per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1num_train_epochs: 5max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.0warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Falsehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseeval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Nonedispatch_batches: Nonesplit_batches: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falsebatch_sampler: batch_samplermulti_dataset_batch_sampler: round_robin@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Base model
sentence-transformers/all-mpnet-base-v2