base_model:BAAI/bge-base-en-v1.5datasets: []
language:-enlibrary_name:sentence-transformerslicense:apache-2.0metrics:-cosine_accuracy@1-cosine_accuracy@3-cosine_accuracy@5-cosine_accuracy@10-cosine_precision@1-cosine_precision@3-cosine_precision@5-cosine_precision@10-cosine_recall@1-cosine_recall@3-cosine_recall@5-cosine_recall@10-cosine_ndcg@10-cosine_mrr@10-cosine_map@100pipeline_tag:sentence-similaritytags:-sentence-transformers-sentence-similarity-feature-extraction-generated_from_trainer-dataset_size:6300-loss:MatryoshkaLoss-loss:MultipleNegativesRankingLosswidget:-source_sentence:>- Mergers and acquisitions, joint ventures and strategic investments complement our internal development and enhance our partnerships to align with Visa’s priorities.sentences:->- How much did the unbilled accounts receivable amount to as of December 30, 2023?->- What was the main reason for Visa to engage in mergers and acquisitions, joint ventures, and strategic investments?-WhatisthemissionofIntuit?-source_sentence:>- Garmin’s audio brands, Fusion and JL Audio, offer premium audio products and accessories, including head units, speakers, amplifiers, subwoofers, and other audio components. These products are designed specifically for the marine, powersports, aftermarket automotive, home, or RV environments, offering premium sound quality and supporting many connectivity options for integrating with MFDs, smartphones, and Garmin wearables.sentences:->- What type of insurance policies cover some of the defense and settlement costs associated with litigation mentioned?->- What types of audio products does Garmin's Fusion and JL Audio brands offer?->- What should investors consider when comparing Adjusted EBITDA across different companies?-source_sentence:>- Medical device products that are marketed in the European Union must comply with the requirements of the Medical Device Regulation (the MDR), which came into effect in May 2021. The MDR provides for regulatory oversight with respect to the design, manufacture, clinical trials, labeling and adverse event reporting for medical devices.sentences:->- What are the requirements for medical devices to be marketed in the European Union under the MDR?->- By what percentage did the pre-tax earnings increase from 2021 to 2022 in the manufacturing sector?-Whatwerethecashandcashequivalentsattheendof2023?-source_sentence:>- In March 2023, the Board of Directors sanctioned a restructuring plan concentrated on investment prioritization towards significant growth prospects and the optimization of the company's real estate assets. This includes substantial organizational changes such as reductions in office space and workforce.sentences:->- How many physicians are part of the domestic Office of the Chief Medical Officer at DaVita as of December 31, 2023?->- What changes in expenses did Delta Air Lines' ancillary businesses and refinery segment encounter in 2023 compared to 2022?->- What are the restructuring targets of the company's Board of Directors as of 2023?-source_sentence:>- The quality of GM dealerships and our relationship with our dealers are critical to our success, now, and as we transition to our all-electric future, given that they maintain the primary sales and service interface with the end consumer of our products. In addition to the terms of our contracts with our dealers, we are regulated by various country and state franchise laws and regulations that may supersede those contractual terms and impose specific regulatorysentences:->- How does General[39 chars] Motors ensure quality in their dealership network?-Howcanthepublicaccessthecompany'sfinancialandlegalreports?->- Is the outcome of the investigation into Tesla's waste segregation practices currently determinable?model-index:-name:BGEbaseFinancialMatryoshkaresults:-task:type:information-retrievalname:InformationRetrievaldataset:name:dim768type:dim_768metrics:-type:cosine_accuracy@1value:0.6785714285714286name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8171428571428572name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8671428571428571name:CosineAccuracy@5-type:cosine_accuracy@10value:0.91name:CosineAccuracy@10-type:cosine_precision@1value:0.6785714285714286name:CosinePrecision@1-type:cosine_precision@3value:0.2723809523809524name:CosinePrecision@3-type:cosine_precision@5value:0.1734285714285714name:CosinePrecision@5-type:cosine_precision@10value:0.09099999999999998name:CosinePrecision@10-type:cosine_recall@1value:0.6785714285714286name:CosineRecall@1-type:cosine_recall@3value:0.8171428571428572name:CosineRecall@3-type:cosine_recall@5value:0.8671428571428571name:CosineRecall@5-type:cosine_recall@10value:0.91name:CosineRecall@10-type:cosine_ndcg@10value:0.7949318413045188name:CosineNdcg@10-type:cosine_mrr@10value:0.7579920634920636name:CosineMrr@10-type:cosine_map@100value:0.761780829563342name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim512type:dim_512metrics:-type:cosine_accuracy@1value:0.6714285714285714name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8171428571428572name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8642857142857143name:CosineAccuracy@5-type:cosine_accuracy@10value:0.9028571428571428name:CosineAccuracy@10-type:cosine_precision@1value:0.6714285714285714name:CosinePrecision@1-type:cosine_precision@3value:0.2723809523809524name:CosinePrecision@3-type:cosine_precision@5value:0.17285714285714285name:CosinePrecision@5-type:cosine_precision@10value:0.09028571428571427name:CosinePrecision@10-type:cosine_recall@1value:0.6714285714285714name:CosineRecall@1-type:cosine_recall@3value:0.8171428571428572name:CosineRecall@3-type:cosine_recall@5value:0.8642857142857143name:CosineRecall@5-type:cosine_recall@10value:0.9028571428571428name:CosineRecall@10-type:cosine_ndcg@10value:0.7892232861723367name:CosineNdcg@10-type:cosine_mrr@10value:0.7524767573696142name:CosineMrr@10-type:cosine_map@100value:0.7566816338836445name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim256type:dim_256metrics:-type:cosine_accuracy@1value:0.6671428571428571name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8142857142857143name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8657142857142858name:CosineAccuracy@5-type:cosine_accuracy@10value:0.9028571428571428name:CosineAccuracy@10-type:cosine_precision@1value:0.6671428571428571name:CosinePrecision@1-type:cosine_precision@3value:0.2714285714285714name:CosinePrecision@3-type:cosine_precision@5value:0.17314285714285713name:CosinePrecision@5-type:cosine_precision@10value:0.09028571428571427name:CosinePrecision@10-type:cosine_recall@1value:0.6671428571428571name:CosineRecall@1-type:cosine_recall@3value:0.8142857142857143name:CosineRecall@3-type:cosine_recall@5value:0.8657142857142858name:CosineRecall@5-type:cosine_recall@10value:0.9028571428571428name:CosineRecall@10-type:cosine_ndcg@10value:0.786715703830093name:CosineNdcg@10-type:cosine_mrr@10value:0.749225056689342name:CosineMrr@10-type:cosine_map@100value:0.7532686203724872name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim128type:dim_128metrics:-type:cosine_accuracy@1value:0.6542857142857142name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8071428571428572name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8428571428571429name:CosineAccuracy@5-type:cosine_accuracy@10value:0.9name:CosineAccuracy@10-type:cosine_precision@1value:0.6542857142857142name:CosinePrecision@1-type:cosine_precision@3value:0.26904761904761904name:CosinePrecision@3-type:cosine_precision@5value:0.16857142857142854name:CosinePrecision@5-type:cosine_precision@10value:0.09name:CosinePrecision@10-type:cosine_recall@1value:0.6542857142857142name:CosineRecall@1-type:cosine_recall@3value:0.8071428571428572name:CosineRecall@3-type:cosine_recall@5value:0.8428571428571429name:CosineRecall@5-type:cosine_recall@10value:0.9name:CosineRecall@10-type:cosine_ndcg@10value:0.7763972670750712name:CosineNdcg@10-type:cosine_mrr@10value:0.7369308390022671name:CosineMrr@10-type:cosine_map@100value:0.7407041984815913name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim64type:dim_64metrics:-type:cosine_accuracy@1value:0.62name:CosineAccuracy@1-type:cosine_accuracy@3value:0.7671428571428571name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8171428571428572name:CosineAccuracy@5-type:cosine_accuracy@10value:0.8785714285714286name:CosineAccuracy@10-type:cosine_precision@1value:0.62name:CosinePrecision@1-type:cosine_precision@3value:0.2557142857142857name:CosinePrecision@3-type:cosine_precision@5value:0.16342857142857142name:CosinePrecision@5-type:cosine_precision@10value:0.08785714285714284name:CosinePrecision@10-type:cosine_recall@1value:0.62name:CosineRecall@1-type:cosine_recall@3value:0.7671428571428571name:CosineRecall@3-type:cosine_recall@5value:0.8171428571428572name:CosineRecall@5-type:cosine_recall@10value:0.8785714285714286name:CosineRecall@10-type:cosine_ndcg@10value:0.7482796784963641name:CosineNdcg@10-type:cosine_mrr@10value:0.7067517006802718name:CosineMrr@10-type:cosine_map@100value:0.7110201251131743name:CosineMap@100
BGE base Financial Matryoshka
This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("uhoffmann/bge-base-financial-matryoshka")
# Run inference
sentences = [
'The quality of GM dealerships and our relationship with our dealers are critical to our success, now, and as we transition to our all-electric future, given that they maintain the primary sales and service interface with the end consumer of our products. In addition to the terms of our contracts with our dealers, we are regulated by various country and state franchise laws and regulations that may supersede those contractual terms and impose specific regulatory',
'How does General[39 chars] Motors ensure quality in their dealership network?',
"How can the public access the company's financial and legal reports?",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Approximate statistics based on the first 1000 samples:
positive
anchor
type
string
string
details
min: 2 tokens
mean: 44.88 tokens
max: 272 tokens
min: 2 tokens
mean: 20.58 tokens
max: 45 tokens
Samples:
positive
anchor
Walmart Inc. reported total revenues of $611,289 million for the fiscal year ended January 31, 2023.
What was Walmart Inc.'s total revenue in the fiscal year ended January 31, 2023?
The total equity balance of Visa Inc. as of September 30, 2023 was $38,733 million.
What was the total equity of Visa Inc. as of September 30, 2023?
Nike incorporates new technologies in its product design by using market intelligence and research, which helps its design teams identify opportunities to leverage these technologies in existing categories to respond to consumer preferences.
How does Nike incorporate new technologies in its product design?
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MatryoshkaLoss
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}