language:-entags:-sentence-transformers-sentence-similarity-feature-extraction-dense-generated_from_trainer-dataset_size:6300-loss:MatryoshkaLoss-loss:MultipleNegativesRankingLossbase_model:nomic-ai/modernbert-embed-basewidget:-source_sentence:>- How much is the company's obligations for non-cancellable operating leases for minimum rent payments throughout the future fiscal years as of January 28, 2024?sentences:->- While we do not expect to repatriate cash to the U.S. to satisfy domestic liquidity needs, if these amounts were distributed to the U.S., in the form of dividends or otherwise, we may be subject to additional foreign withholding taxes and U.S. state income taxes, which could be material.-Operatingleases(minimumrent)|$|1,645,318| - >- the top three exposures being to issuers and counterparties domiciled in France at $5.1 billion, the United Kingdom at $4.8 billion, and Canada at $1.7 billion.-source_sentence:WhatdoestheSelectedDruglistpublishedbyCMSinclude?sentences:->- Trodelvy product sales were $680 million in 2022 and increased by 56% to $1.1 billion in 2023.->- Medicaid Services (“CMS”) published the first “Selected Drug” list, which includes XARELTO and STELARA as well as IMBRUVICA.->- Cost of revenues decreased by 9%, or $123.9 million, in the year ended December 31, 2023, as compared to the same period in 2022.-source_sentence:>- What was the dividend per share paid in 2023 and how did it change from the previous year?sentences:->- Dividends of $4.52 per share and $3.92 per share were paid in 2023 and 2022, respectively.->- We design, develop, manufacture, sell and lease high-performance fully electric vehicles and energy generation and storage systems, and offer services related to our products.->- As of December 31, 2023, AMC operated 217 IMAX screens and 169 Dolby Cinema screens, according to the large screen format detailing from the data provided.-source_sentence:Whatgenerationtechnologydoesthe40Seriesgraphicscardsfeature?sentences:->- We believe that a diverse, equitable, and inclusive workplace is a strategic business imperative and we take a comprehensive view of diversity, equity, and inclusion. We conduct annual pay equity analyses and support many employee-led inclusion networks.->- HP records revenue from the sale of equipment under sales-type leases as revenue at the commencement of the lease. This method is applied unless certain conditions such as customer acceptance remain uncertain or significant obligations to the customer remain unfulfilled.->- The 40 Series features our third generation RTX technology, third generation NVIDIA DLSS, and fourth generation Tensor Cores to deliver up to 4X the performance of the previous generation.-source_sentence:>- What typical reimbursement methods are used in the company's contracts with hospitals for inpatient and outpatient services?sentences:->- We typically contract with hospitals on either (1) a per diem rate, which is an all-inclusive rate per day, (2) a case rate for diagnosis-related groups (DRG), which is an all-inclusive rate per admission, or (3) a discounted charge for inpatient hospital services. Outpatient hospital services generally are contracted at a flat rate by type of service, ambulatory payment classifications, or APCs, or at a discounted charge.->- In IBM’s 2023 Annual Report to Stockholders, the Financial Statements and Supplementary Data are detailed on pages 44 through 121.->- Certain joint venture agreements in China allow for the contractual right to report vehicle sales of non-GM trademarked vehicles by those joint ventures, which are included in the total vehicle sales General Motors reports for China.pipeline_tag:sentence-similaritylibrary_name:sentence-transformersmetrics:-cosine_accuracy@1-cosine_accuracy@3-cosine_accuracy@5-cosine_accuracy@10-cosine_precision@1-cosine_precision@3-cosine_precision@5-cosine_precision@10-cosine_recall@1-cosine_recall@3-cosine_recall@5-cosine_recall@10-cosine_ndcg@10-cosine_mrr@10-cosine_map@100model-index:-name:ModernBERTEmbedbaseFinance10kMatryoshkaresults:-task:type:information-retrievalname:InformationRetrievaldataset:name:dim768type:dim_768metrics:-type:cosine_accuracy@1value:0.7244285714285714name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8554285714285714name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8902857142857142name:CosineAccuracy@5-type:cosine_accuracy@10value:0.9271428571428572name:CosineAccuracy@10-type:cosine_precision@1value:0.7244285714285714name:CosinePrecision@1-type:cosine_precision@3value:0.28514285714285714name:CosinePrecision@3-type:cosine_precision@5value:0.17805714285714286name:CosinePrecision@5-type:cosine_precision@10value:0.09271428571428571name:CosinePrecision@10-type:cosine_recall@1value:0.7244285714285714name:CosineRecall@1-type:cosine_recall@3value:0.8554285714285714name:CosineRecall@3-type:cosine_recall@5value:0.8902857142857142name:CosineRecall@5-type:cosine_recall@10value:0.9271428571428572name:CosineRecall@10-type:cosine_ndcg@10value:0.8286211304635027name:CosineNdcg@10-type:cosine_mrr@10value:0.7967674603174593name:CosineMrr@10-type:cosine_map@100value:0.7999047933786212name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim512type:dim_512metrics:-type:cosine_accuracy@1value:0.7238571428571429name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8532857142857143name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8874285714285715name:CosineAccuracy@5-type:cosine_accuracy@10value:0.927name:CosineAccuracy@10-type:cosine_precision@1value:0.7238571428571429name:CosinePrecision@1-type:cosine_precision@3value:0.2844285714285714name:CosinePrecision@3-type:cosine_precision@5value:0.1774857142857143name:CosinePrecision@5-type:cosine_precision@10value:0.09269999999999999name:CosinePrecision@10-type:cosine_recall@1value:0.7238571428571429name:CosineRecall@1-type:cosine_recall@3value:0.8532857142857143name:CosineRecall@3-type:cosine_recall@5value:0.8874285714285715name:CosineRecall@5-type:cosine_recall@10value:0.927name:CosineRecall@10-type:cosine_ndcg@10value:0.8272739780211489name:CosineNdcg@10-type:cosine_mrr@10value:0.7951572562358264name:CosineMrr@10-type:cosine_map@100value:0.7982776212491204name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim256type:dim_256metrics:-type:cosine_accuracy@1value:0.7231428571428572name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8512857142857143name:CosineAccuracy@3-type:cosine_accuracy@5value:0.886name:CosineAccuracy@5-type:cosine_accuracy@10value:0.924name:CosineAccuracy@10-type:cosine_precision@1value:0.7231428571428572name:CosinePrecision@1-type:cosine_precision@3value:0.28376190476190477name:CosinePrecision@3-type:cosine_precision@5value:0.17720000000000002name:CosinePrecision@5-type:cosine_precision@10value:0.0924name:CosinePrecision@10-type:cosine_recall@1value:0.7231428571428572name:CosineRecall@1-type:cosine_recall@3value:0.8512857142857143name:CosineRecall@3-type:cosine_recall@5value:0.886name:CosineRecall@5-type:cosine_recall@10value:0.924name:CosineRecall@10-type:cosine_ndcg@10value:0.825556207455572name:CosineNdcg@10-type:cosine_mrr@10value:0.7937802721088419name:CosineMrr@10-type:cosine_map@100value:0.797126360319449name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim128type:dim_128metrics:-type:cosine_accuracy@1value:0.7017142857142857name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8362857142857143name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8725714285714286name:CosineAccuracy@5-type:cosine_accuracy@10value:0.9165714285714286name:CosineAccuracy@10-type:cosine_precision@1value:0.7017142857142857name:CosinePrecision@1-type:cosine_precision@3value:0.27876190476190477name:CosinePrecision@3-type:cosine_precision@5value:0.1745142857142857name:CosinePrecision@5-type:cosine_precision@10value:0.09165714285714284name:CosinePrecision@10-type:cosine_recall@1value:0.7017142857142857name:CosineRecall@1-type:cosine_recall@3value:0.8362857142857143name:CosineRecall@3-type:cosine_recall@5value:0.8725714285714286name:CosineRecall@5-type:cosine_recall@10value:0.9165714285714286name:CosineRecall@10-type:cosine_ndcg@10value:0.8107684115571783name:CosineNdcg@10-type:cosine_mrr@10value:0.7767397392290242name:CosineMrr@10-type:cosine_map@100value:0.7803112683678572name:CosineMap@100-task:type:information-retrievalname:InformationRetrievaldataset:name:dim64type:dim_64metrics:-type:cosine_accuracy@1value:0.6702857142857143name:CosineAccuracy@1-type:cosine_accuracy@3value:0.8052857142857143name:CosineAccuracy@3-type:cosine_accuracy@5value:0.8491428571428571name:CosineAccuracy@5-type:cosine_accuracy@10value:0.8958571428571429name:CosineAccuracy@10-type:cosine_precision@1value:0.6702857142857143name:CosinePrecision@1-type:cosine_precision@3value:0.2684285714285714name:CosinePrecision@3-type:cosine_precision@5value:0.1698285714285714name:CosinePrecision@5-type:cosine_precision@10value:0.08958571428571427name:CosinePrecision@10-type:cosine_recall@1value:0.6702857142857143name:CosineRecall@1-type:cosine_recall@3value:0.8052857142857143name:CosineRecall@3-type:cosine_recall@5value:0.8491428571428571name:CosineRecall@5-type:cosine_recall@10value:0.8958571428571429name:CosineRecall@10-type:cosine_ndcg@10value:0.7834046997868768name:CosineNdcg@10-type:cosine_mrr@10value:0.7473670068027197name:CosineMrr@10-type:cosine_map@100value:0.7515757256525822name:CosineMap@100
ModernBERT Embed base Finance 10k Matryoshka
This is a sentence-transformers model finetuned from nomic-ai/modernbert-embed-base on the json dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("rya23/modernbert-embed-finance-matryoshka")
# Run inference
sentences = [
"What typical reimbursement methods are used in the company's contracts with hospitals for inpatient and outpatient services?",
'We typically contract with hospitals on either (1) a per diem rate, which is an all-inclusive rate per day, (2) a case rate for diagnosis-related groups (DRG), which is an all-inclusive rate per admission, or (3) a discounted charge for inpatient hospital services. Outpatient hospital services generally are contracted at a flat rate by type of service, ambulatory payment classifications, or APCs, or at a discounted charge.',
'In IBM’s 2023 Annual Report to Stockholders, the Financial Statements and Supplementary Data are detailed on pages 44 through 121.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6756, 0.0659],# [0.6756, 1.0000, 0.0087],# [0.0659, 0.0087, 1.0000]])
Approximate statistics based on the first 1000 samples:
anchor
positive
type
string
string
details
min: 9 tokens
mean: 20.55 tokens
max: 43 tokens
min: 5 tokens
mean: 46.3 tokens
max: 243 tokens
Samples:
anchor
positive
How many shares of class A common stock were authorized for grant under Visa's Equity Incentive Compensation Plan?
Under the Company’s 2007 Amended and Restated Equity Incentive Compensation Plan (EIP), the compensation committee of the board of directors was authorized to grant up to 198 million shares of class A common stock to its employees and non-employee directors.
What was Garmin Ltd.'s net income for the fiscal year ended December 30, 2023?
Garmin Ltd. reported a net income of $1,289,636 for the fiscal year ended December 30, 2023.
Why are some device sales revenue at AT&T not immediately recognized upon the device sale?
AT&T recognizes revenue from device sales with promotions or installment payments differently. For promotional discounts, revenue is deferred and amortized over the contract term. Meanwhile, installment sales involve recognizing revenue upfront but deferring the cash receipt until payments are made, resulting in a recorded contract asset to be amortized over time.
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MatryoshkaLoss
@misc{kusupati2024matryoshka,
title={Matryoshka Representation Learning},
author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
year={2024},
eprint={2205.13147},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}