mtien/miriad-embedding

ca201f8 verified 12 days ago

39.5 kB

	---
	tags:
	- sentence-transformers
	- sentence-similarity
	- feature-extraction
	- dense
	- generated_from_trainer
	- dataset_size:2000
	- loss:MatryoshkaLoss
	- loss:MultipleNegativesRankingLoss
	base_model: sentence-transformers/all-mpnet-base-v2
	widget:
	- source_sentence: 'What methods have been attempted to improve resin bond strength
	to irradiated dentin?

	'
	sentences:
	- Patients with BHD syndrome may have concerns about communicating genetic risk
	to their family members, especially if their family has different communication
	patterns or cultural norms. Some patients may find it difficult to share information
	about an inherited, potentially lethal disorder with their family members. It
	is observed that families in which affected members have experienced significant
	morbidity are more likely to pursue genetic testing and surveillance. However,
	this phenomenon has not been systematically studied in the BHD population. Patients
	may also worry that their family members are not motivated to pursue genetic testing
	and surveillance. In these situations, patients can share medical papers and handouts
	with their family members and inform them about the process to obtain genetic
	testing. Additionally, patients can encourage their family members to attend scientific
	meetings and connect with other BHD families through resources like the Myrovlytis
	website. Cancer Genetic Counselors (CGC) and/or Advanced Practice Nurses in Genetics
	(APNG) can also provide support and guidance to patients and their families in
	coping with the psychosocial ramifications of BHD.
	- Psychological stress has been found to have a significant impact on medical illness,
	including ocular disease. While vision researchers have not fully embraced the
	approach of psychoneuroimmunology in addressing ocular disease, it is clear that
	no organ system is protected from the effects of negative emotional states. Stress
	is more prevalent among the elderly, and conditions such as retirement, chronic
	illness, loss of loved ones, and caregiver's stress can induce chronic debilitating
	stress. Ophthalmologists should prioritize time with patients to establish a compassionate
	rapport and address emotional factors that may contribute to ocular conditions.
	Failure to do so compromises the individual's opportunity for healing.
	- Many researchers have attempted to improve resin bond strength to irradiated dentin
	by removing the denatured layer mechanically and chemically. However, efficient
	methods for clinical application have not yet been established. The reduction
	of dentin bonding strength is believed to be due to the denatured layer of dentin
	surface, which has led to the exploration of various techniques to remove or mitigate
	its effects.
	- source_sentence: 'What are the clinical features of peripheral ossifying fibroma?

	'
	sentences:
	- The management of intracranial hemorrhage after thrombolysis is still uncertain.
	It is unclear whether patients with severe intracranial hemorrhage soon after
	thrombolytic therapy should receive only supportive medical care or should be
	aggressively managed with treatment of increased intracranial pressure, ventriculostomy,
	or neurosurgical evacuation. The use of clinical decision-making aids, such as
	Figure 1, may assist clinicians in making empirical decisions for these patients.
	- When the diagnosis of HIT is confirmed, therapeutic doses of alternative non-heparin
	anticoagulants are usually required. Heparin treatments must be stopped immediately,
	including heparin-bonded catheters and heparin flushes. Patients should be given
	a non-heparin anticoagulant such as direct thrombin inhibitors like Bivalirudin,
	Argatroban, or Lepirudin. These inhibitors directly inhibit the actions of thrombin
	and do not require a cofactor. They are active against both free and clot-bound
	thrombin and do not interact with or produce heparin-dependent antibodies.
	- Histopathological evaluation of biopsy specimens of peripheral ossifying fibroma
	typically reveals intact or ulcerated stratified squamous surface epithelium,
	potentially mature mineralized material, epithelial proliferation, benign fibrous
	connective tissue with varying fibroblast content, myofibroblasts and collagen,
	lamellar or woven osteoid, and cement-like material or dystrophic calcifications.
	The presence of acute and chronic inflammatory cells may also be observed.
	- source_sentence: 'What are the common clinical features and diagnostic criteria
	of relapsing polychondritis?

	'
	sentences:
	- Lethal complications of relapsing polychondritis are often associated with airway
	or cardiovascular involvement. This can include complications such as aortic incompetence,
	mitral regurgitation, pericarditis, cardiac ischemia, aneurysms of large arteries,
	vasculitis of the central nervous system, phlebitis, and Raynaud's phenomenon.
	Neurological and renal system involvement can also occur, although it is rare.
	Regular follow-up and management are important to monitor and prevent potential
	complications in patients with relapsing polychondritis.
	- Media focus can contribute to the risk of burnout in managers. Burnout is a prolonged
	response to chronic emotional and interpersonal stressors at work. The pressure
	and scrutiny from the media can lead to feelings of exhaustion, cynicism, and
	inefficacy, which are the three dimensions of burnout. Managers may respond to
	increased pressure by becoming avoidant, narrow-minded, and hard on themselves,
	their subordinates, and their families. They may also try to establish emotional
	and cognitive distance from the pressuring situation. Ultimately, the exposure
	to negative media focus with elements of personification can increase the risk
	of burnout in some managers.
	- Intrathymic injection of MBP has potential applications in various medical treatments.
	It can be used in surgical brain injuries caused by cutting, electric coagulation,
	suction, and traction to alleviate the secondary attack to the brain tissue and
	reduce the auto-inflammation process triggered by the exposure of autoantigens.
	It may also be beneficial for elective surgeries, such as intracranial tumor operations,
	to induce immune tolerance and alleviate auto-inflammation. With the development
	of minimally invasive operation techniques, intrathymic injection without exposing
	the thorax can become a simple, efficient, and safe procedure. Further studies
	are needed to investigate the potential applications of intrathymic injection
	of MBP in vivo.
	- source_sentence: 'What are some potential mechanisms by which quercetin may protect
	against cancer?

	'
	sentences:
	- There is a significant correlation between serum B2M levels and some biochemical
	parameters, such as ALK, bilirubin, and INR, in patients with liver disease. However,
	no significant correlation has been found between serum B2M levels and viral load
	among patients with liver disease.
	- When the diagnosis of HIT is confirmed, therapeutic doses of alternative non-heparin
	anticoagulants are usually required. Heparin treatments must be stopped immediately,
	including heparin-bonded catheters and heparin flushes. Patients should be given
	a non-heparin anticoagulant such as direct thrombin inhibitors like Bivalirudin,
	Argatroban, or Lepirudin. These inhibitors directly inhibit the actions of thrombin
	and do not require a cofactor. They are active against both free and clot-bound
	thrombin and do not interact with or produce heparin-dependent antibodies.
	- Silymarin and Ginkgo biloba extract have been found to possess hepatoprotective
	effects against NDEA-induced hepatocarcinogenesis. These extracts can scavenge
	free radicals, prevent hepatocellular damage, and suppress the leakage of enzymes
	through plasma membranes. They may also modify the biotransformation/detoxification
	of NDEA, reducing its liver toxicity. Additionally, silymarin can reduce intracellular
	ROS levels, prevent oxidative stress-induced cellular damage, and stimulate hepatic
	cell proliferation for liver regeneration. These effects make silymarin and Ginkgo
	biloba extract strong candidates as chemopreventive agents for liver cancer.
	- source_sentence: 'What are the molecular mechanisms involved in the synergistic
	induction of SAA by IL-1, TNF-α, and IL-6?

	'
	sentences:
	- The complex formation of STAT3, NF-κB p65, and p300 is involved in the transcriptional
	activity of the SAA1 gene. STAT3 and p300 are recruited to the SAA1 promoter region
	in response to IL-6 or IL-1β + IL-6 stimulation. Co-expression of wild type p300
	with wild type STAT3 enhances the luciferase activity of the SAA1 gene in a dose-dependent
	manner. This suggests that the heteromeric complex formation of STAT3, NF-κB p65,
	and p300 contributes to the transcriptional activity of the SAA1 gene.
	- Intrathymic injection of MBP has potential applications in various medical treatments.
	It can be used in surgical brain injuries caused by cutting, electric coagulation,
	suction, and traction to alleviate the secondary attack to the brain tissue and
	reduce the auto-inflammation process triggered by the exposure of autoantigens.
	It may also be beneficial for elective surgeries, such as intracranial tumor operations,
	to induce immune tolerance and alleviate auto-inflammation. With the development
	of minimally invasive operation techniques, intrathymic injection without exposing
	the thorax can become a simple, efficient, and safe procedure. Further studies
	are needed to investigate the potential applications of intrathymic injection
	of MBP in vivo.
	- Phenotypic screens of approved drug collections and synergistic combinations can
	be a useful approach for rapid identification of new therapeutics for drug-resistant
	bacteria. This approach can also be applied to emerging outbreaks of infectious
	diseases where vaccines and therapeutic agents are unavailable or unrealistic
	to develop in a short period of time. By screening existing drugs and combinations,
	new therapeutics can be identified and potentially repurposed for the treatment
	of drug-resistant infections.
	pipeline_tag: sentence-similarity
	library_name: sentence-transformers
	metrics:
	- cosine_accuracy@1
	- cosine_accuracy@3
	- cosine_accuracy@5
	- cosine_accuracy@10
	- cosine_precision@1
	- cosine_precision@3
	- cosine_precision@5
	- cosine_precision@10
	- cosine_recall@1
	- cosine_recall@3
	- cosine_recall@5
	- cosine_recall@10
	- cosine_ndcg@10
	- cosine_mrr@10
	- cosine_map@100
	model-index:
	- name: SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
	results:
	- task:
	type: information-retrieval
	name: Information Retrieval
	dataset:
	name: dim 768
	type: dim_768
	metrics:
	- type: cosine_accuracy@1
	value: 0.7775
	name: Cosine Accuracy@1
	- type: cosine_accuracy@3
	value: 0.8885
	name: Cosine Accuracy@3
	- type: cosine_accuracy@5
	value: 0.917
	name: Cosine Accuracy@5
	- type: cosine_accuracy@10
	value: 0.947
	name: Cosine Accuracy@10
	- type: cosine_precision@1
	value: 0.7775
	name: Cosine Precision@1
	- type: cosine_precision@3
	value: 0.29616666666666663
	name: Cosine Precision@3
	- type: cosine_precision@5
	value: 0.18340000000000004
	name: Cosine Precision@5
	- type: cosine_precision@10
	value: 0.09470000000000002
	name: Cosine Precision@10
	- type: cosine_recall@1
	value: 0.7775
	name: Cosine Recall@1
	- type: cosine_recall@3
	value: 0.8885
	name: Cosine Recall@3
	- type: cosine_recall@5
	value: 0.917
	name: Cosine Recall@5
	- type: cosine_recall@10
	value: 0.947
	name: Cosine Recall@10
	- type: cosine_ndcg@10
	value: 0.8637977392462012
	name: Cosine Ndcg@10
	- type: cosine_mrr@10
	value: 0.8369255952380947
	name: Cosine Mrr@10
	- type: cosine_map@100
	value: 0.8394380047776188
	name: Cosine Map@100
	- task:
	type: information-retrieval
	name: Information Retrieval
	dataset:
	name: dim 512
	type: dim_512
	metrics:
	- type: cosine_accuracy@1
	value: 0.7785
	name: Cosine Accuracy@1
	- type: cosine_accuracy@3
	value: 0.8825
	name: Cosine Accuracy@3
	- type: cosine_accuracy@5
	value: 0.917
	name: Cosine Accuracy@5
	- type: cosine_accuracy@10
	value: 0.944
	name: Cosine Accuracy@10
	- type: cosine_precision@1
	value: 0.7785
	name: Cosine Precision@1
	- type: cosine_precision@3
	value: 0.29416666666666663
	name: Cosine Precision@3
	- type: cosine_precision@5
	value: 0.18340000000000004
	name: Cosine Precision@5
	- type: cosine_precision@10
	value: 0.09440000000000003
	name: Cosine Precision@10
	- type: cosine_recall@1
	value: 0.7785
	name: Cosine Recall@1
	- type: cosine_recall@3
	value: 0.8825
	name: Cosine Recall@3
	- type: cosine_recall@5
	value: 0.917
	name: Cosine Recall@5
	- type: cosine_recall@10
	value: 0.944
	name: Cosine Recall@10
	- type: cosine_ndcg@10
	value: 0.8623716893141778
	name: Cosine Ndcg@10
	- type: cosine_mrr@10
	value: 0.8360055555555553
	name: Cosine Mrr@10
	- type: cosine_map@100
	value: 0.8388749447751291
	name: Cosine Map@100
	- task:
	type: information-retrieval
	name: Information Retrieval
	dataset:
	name: dim 128
	type: dim_128
	metrics:
	- type: cosine_accuracy@1
	value: 0.7555
	name: Cosine Accuracy@1
	- type: cosine_accuracy@3
	value: 0.8655
	name: Cosine Accuracy@3
	- type: cosine_accuracy@5
	value: 0.9145
	name: Cosine Accuracy@5
	- type: cosine_accuracy@10
	value: 0.943
	name: Cosine Accuracy@10
	- type: cosine_precision@1
	value: 0.7555
	name: Cosine Precision@1
	- type: cosine_precision@3
	value: 0.2884999999999999
	name: Cosine Precision@3
	- type: cosine_precision@5
	value: 0.18290000000000003
	name: Cosine Precision@5
	- type: cosine_precision@10
	value: 0.09430000000000001
	name: Cosine Precision@10
	- type: cosine_recall@1
	value: 0.7555
	name: Cosine Recall@1
	- type: cosine_recall@3
	value: 0.8655
	name: Cosine Recall@3
	- type: cosine_recall@5
	value: 0.9145
	name: Cosine Recall@5
	- type: cosine_recall@10
	value: 0.943
	name: Cosine Recall@10
	- type: cosine_ndcg@10
	value: 0.8499528413626729
	name: Cosine Ndcg@10
	- type: cosine_mrr@10
	value: 0.8199301587301584
	name: Cosine Mrr@10
	- type: cosine_map@100
	value: 0.8224780775804242
	name: Cosine Map@100
	- task:
	type: information-retrieval
	name: Information Retrieval
	dataset:
	name: dim 64
	type: dim_64
	metrics:
	- type: cosine_accuracy@1
	value: 0.714
	name: Cosine Accuracy@1
	- type: cosine_accuracy@3
	value: 0.8365
	name: Cosine Accuracy@3
	- type: cosine_accuracy@5
	value: 0.877
	name: Cosine Accuracy@5
	- type: cosine_accuracy@10
	value: 0.9285
	name: Cosine Accuracy@10
	- type: cosine_precision@1
	value: 0.714
	name: Cosine Precision@1
	- type: cosine_precision@3
	value: 0.27883333333333327
	name: Cosine Precision@3
	- type: cosine_precision@5
	value: 0.1754
	name: Cosine Precision@5
	- type: cosine_precision@10
	value: 0.09285
	name: Cosine Precision@10
	- type: cosine_recall@1
	value: 0.714
	name: Cosine Recall@1
	- type: cosine_recall@3
	value: 0.8365
	name: Cosine Recall@3
	- type: cosine_recall@5
	value: 0.877
	name: Cosine Recall@5
	- type: cosine_recall@10
	value: 0.9285
	name: Cosine Recall@10
	- type: cosine_ndcg@10
	value: 0.8195584918161248
	name: Cosine Ndcg@10
	- type: cosine_mrr@10
	value: 0.7848236111111104
	name: Cosine Mrr@10
	- type: cosine_map@100
	value: 0.7878148778237813
	name: Cosine Map@100
	---

	# SentenceTransformer based on sentence-transformers/all-mpnet-base-v2

	This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

	## Model Details

	### Model Description
	- Model Type: Sentence Transformer
	- Base model: [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2) <!-- at revision e8c3b32edf5434bc2275fc9bab85f82640a19130 -->
	- Maximum Sequence Length: 384 tokens
	- Output Dimensionality: 768 dimensions
	- Similarity Function: Cosine Similarity
	<!-- - Training Dataset: Unknown -->
	<!-- - Language: Unknown -->
	<!-- - License: Unknown -->

	### Model Sources

	- Documentation: [Sentence Transformers Documentation](https://sbert.net)
	- Repository: [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
	- Hugging Face: [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)

	### Full Model Architecture

	```
	SentenceTransformer(
	(0): Transformer({'max_seq_length': 384, 'do_lower_case': False, 'architecture': 'MPNetModel'})
	(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
	(2): Normalize()
	)
	```

	## Usage

	### Direct Usage (Sentence Transformers)

	First install the Sentence Transformers library:

	```bash
	pip install -U sentence-transformers
	```

	Then you can load this model and run inference.
	```python
	from sentence_transformers import SentenceTransformer

	# Download from the 🤗 Hub
	model = SentenceTransformer("sentence_transformers_model_id")
	# Run inference
	sentences = [
	'What are the molecular mechanisms involved in the synergistic induction of SAA by IL-1, TNF-α, and IL-6?\n',
	'The complex formation of STAT3, NF-κB p65, and p300 is involved in the transcriptional activity of the SAA1 gene. STAT3 and p300 are recruited to the SAA1 promoter region in response to IL-6 or IL-1β + IL-6 stimulation. Co-expression of wild type p300 with wild type STAT3 enhances the luciferase activity of the SAA1 gene in a dose-dependent manner. This suggests that the heteromeric complex formation of STAT3, NF-κB p65, and p300 contributes to the transcriptional activity of the SAA1 gene.',
	'Phenotypic screens of approved drug collections and synergistic combinations can be a useful approach for rapid identification of new therapeutics for drug-resistant bacteria. This approach can also be applied to emerging outbreaks of infectious diseases where vaccines and therapeutic agents are unavailable or unrealistic to develop in a short period of time. By screening existing drugs and combinations, new therapeutics can be identified and potentially repurposed for the treatment of drug-resistant infections.',
	]
	embeddings = model.encode(sentences)
	print(embeddings.shape)
	# [3, 768]

	# Get the similarity scores for the embeddings
	similarities = model.similarity(embeddings, embeddings)
	print(similarities)
	# tensor([[1.0000, 0.7925, 0.1356],
	# [0.7925, 1.0000, 0.1694],
	# [0.1356, 0.1694, 1.0000]])
	```

	<!--
	### Direct Usage (Transformers)

	<details><summary>Click to see the direct usage in Transformers</summary>

	</details>
	-->

	<!--
	### Downstream Usage (Sentence Transformers)

	You can finetune this model on your own dataset.

	<details><summary>Click to expand</summary>

	</details>
	-->

	<!--
	### Out-of-Scope Use

	List how the model may foreseeably be misused and address what users ought not to do with the model.
	-->

	## Evaluation

	### Metrics

	#### Information Retrieval

	* Dataset: `dim_768`
	* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
	```json
	{
	"truncate_dim": 768
	}
	```

	\| Metric \| Value \|
	\|:--------------------\|:-----------\|
	\| cosine_accuracy@1 \| 0.7775 \|
	\| cosine_accuracy@3 \| 0.8885 \|
	\| cosine_accuracy@5 \| 0.917 \|
	\| cosine_accuracy@10 \| 0.947 \|
	\| cosine_precision@1 \| 0.7775 \|
	\| cosine_precision@3 \| 0.2962 \|
	\| cosine_precision@5 \| 0.1834 \|
	\| cosine_precision@10 \| 0.0947 \|
	\| cosine_recall@1 \| 0.7775 \|
	\| cosine_recall@3 \| 0.8885 \|
	\| cosine_recall@5 \| 0.917 \|
	\| cosine_recall@10 \| 0.947 \|
	\| cosine_ndcg@10 \| 0.8638 \|
	\| cosine_mrr@10 \| 0.8369 \|
	\| cosine_map@100 \| 0.8394 \|

	#### Information Retrieval

	* Dataset: `dim_512`
	* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
	```json
	{
	"truncate_dim": 512
	}
	```

	\| Metric \| Value \|
	\|:--------------------\|:-----------\|
	\| cosine_accuracy@1 \| 0.7785 \|
	\| cosine_accuracy@3 \| 0.8825 \|
	\| cosine_accuracy@5 \| 0.917 \|
	\| cosine_accuracy@10 \| 0.944 \|
	\| cosine_precision@1 \| 0.7785 \|
	\| cosine_precision@3 \| 0.2942 \|
	\| cosine_precision@5 \| 0.1834 \|
	\| cosine_precision@10 \| 0.0944 \|
	\| cosine_recall@1 \| 0.7785 \|
	\| cosine_recall@3 \| 0.8825 \|
	\| cosine_recall@5 \| 0.917 \|
	\| cosine_recall@10 \| 0.944 \|
	\| cosine_ndcg@10 \| 0.8624 \|
	\| cosine_mrr@10 \| 0.836 \|
	\| cosine_map@100 \| 0.8389 \|

	#### Information Retrieval

	* Dataset: `dim_128`
	* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
	```json
	{
	"truncate_dim": 128
	}
	```

	\| Metric \| Value \|
	\|:--------------------\|:---------\|
	\| cosine_accuracy@1 \| 0.7555 \|
	\| cosine_accuracy@3 \| 0.8655 \|
	\| cosine_accuracy@5 \| 0.9145 \|
	\| cosine_accuracy@10 \| 0.943 \|
	\| cosine_precision@1 \| 0.7555 \|
	\| cosine_precision@3 \| 0.2885 \|
	\| cosine_precision@5 \| 0.1829 \|
	\| cosine_precision@10 \| 0.0943 \|
	\| cosine_recall@1 \| 0.7555 \|
	\| cosine_recall@3 \| 0.8655 \|
	\| cosine_recall@5 \| 0.9145 \|
	\| cosine_recall@10 \| 0.943 \|
	\| cosine_ndcg@10 \| 0.85 \|
	\| cosine_mrr@10 \| 0.8199 \|
	\| cosine_map@100 \| 0.8225 \|

	#### Information Retrieval

	* Dataset: `dim_64`
	* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator) with these parameters:
	```json
	{
	"truncate_dim": 64
	}
	```

	\| Metric \| Value \|
	\|:--------------------\|:-----------\|
	\| cosine_accuracy@1 \| 0.714 \|
	\| cosine_accuracy@3 \| 0.8365 \|
	\| cosine_accuracy@5 \| 0.877 \|
	\| cosine_accuracy@10 \| 0.9285 \|
	\| cosine_precision@1 \| 0.714 \|
	\| cosine_precision@3 \| 0.2788 \|
	\| cosine_precision@5 \| 0.1754 \|
	\| cosine_precision@10 \| 0.0929 \|
	\| cosine_recall@1 \| 0.714 \|
	\| cosine_recall@3 \| 0.8365 \|
	\| cosine_recall@5 \| 0.877 \|
	\| cosine_recall@10 \| 0.9285 \|
	\| cosine_ndcg@10 \| 0.8196 \|
	\| cosine_mrr@10 \| 0.7848 \|
	\| cosine_map@100 \| 0.7878 \|

	<!--
	## Bias, Risks and Limitations

	What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.
	-->

	<!--
	### Recommendations

	What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.
	-->

	## Training Details

	### Training Dataset

	#### Unnamed Dataset

	* Size: 2,000 training samples
	* Columns: <code>anchor</code> and <code>positive</code>
	* Approximate statistics based on the first 1000 samples:
	\| \| anchor \| positive \|
	\|:--------\|:----------------------------------------------------------------------------------\|:-------------------------------------------------------------------------------------\|
	\| type \| string \| string \|
	\| details \| <ul><li>min: 8 tokens</li><li>mean: 20.92 tokens</li><li>max: 51 tokens</li></ul> \| <ul><li>min: 30 tokens</li><li>mean: 116.22 tokens</li><li>max: 227 tokens</li></ul> \|
	* Samples:
	\| anchor \| positive \|
	\|:------------------------------------------------------------------------------------------------------------\|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------\|
	\| <code>What are the common clinical features and diagnostic criteria of relapsing polychondritis?<br></code> \| <code>Lethal complications of relapsing polychondritis are often associated with airway or cardiovascular involvement. This can include complications such as aortic incompetence, mitral regurgitation, pericarditis, cardiac ischemia, aneurysms of large arteries, vasculitis of the central nervous system, phlebitis, and Raynaud's phenomenon. Neurological and renal system involvement can also occur, although it is rare. Regular follow-up and management are important to monitor and prevent potential complications in patients with relapsing polychondritis.</code> \|
	\| <code>What are the treatment options for relapsing polychondritis?<br></code> \| <code>Lethal complications of relapsing polychondritis are often associated with airway or cardiovascular involvement. This can include complications such as aortic incompetence, mitral regurgitation, pericarditis, cardiac ischemia, aneurysms of large arteries, vasculitis of the central nervous system, phlebitis, and Raynaud's phenomenon. Neurological and renal system involvement can also occur, although it is rare. Regular follow-up and management are important to monitor and prevent potential complications in patients with relapsing polychondritis.</code> \|
	\| <code>What are the potential complications associated with relapsing polychondritis?<br></code> \| <code>Lethal complications of relapsing polychondritis are often associated with airway or cardiovascular involvement. This can include complications such as aortic incompetence, mitral regurgitation, pericarditis, cardiac ischemia, aneurysms of large arteries, vasculitis of the central nervous system, phlebitis, and Raynaud's phenomenon. Neurological and renal system involvement can also occur, although it is rare. Regular follow-up and management are important to monitor and prevent potential complications in patients with relapsing polychondritis.</code> \|
	* Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
	```json
	{
	"loss": "MultipleNegativesRankingLoss",
	"matryoshka_dims": [
	768,
	512,
	128,
	64
	],
	"matryoshka_weights": [
	1,
	1,
	1,
	1
	],
	"n_dims_per_step": -1
	}
	```

	### Training Hyperparameters
	#### Non-Default Hyperparameters

	- `eval_strategy`: steps
	- `per_device_train_batch_size`: 16
	- `gradient_accumulation_steps`: 4
	- `learning_rate`: 2e-05
	- `num_train_epochs`: 1
	- `lr_scheduler_type`: cosine
	- `warmup_ratio`: 0.1
	- `warmup_steps`: 0.1
	- `bf16`: True
	- `load_best_model_at_end`: True
	- `batch_sampler`: no_duplicates

	#### All Hyperparameters
	<details><summary>Click to expand</summary>

	- `do_predict`: False
	- `eval_strategy`: steps
	- `prediction_loss_only`: True
	- `per_device_train_batch_size`: 16
	- `per_device_eval_batch_size`: 8
	- `gradient_accumulation_steps`: 4
	- `eval_accumulation_steps`: None
	- `torch_empty_cache_steps`: None
	- `learning_rate`: 2e-05
	- `weight_decay`: 0.0
	- `adam_beta1`: 0.9
	- `adam_beta2`: 0.999
	- `adam_epsilon`: 1e-08
	- `max_grad_norm`: 1.0
	- `num_train_epochs`: 1
	- `max_steps`: -1
	- `lr_scheduler_type`: cosine
	- `lr_scheduler_kwargs`: None
	- `warmup_ratio`: 0.1
	- `warmup_steps`: 0.1
	- `log_level`: passive
	- `log_level_replica`: warning
	- `log_on_each_node`: True
	- `logging_nan_inf_filter`: True
	- `enable_jit_checkpoint`: False
	- `save_on_each_node`: False
	- `save_only_model`: False
	- `restore_callback_states_from_checkpoint`: False
	- `use_cpu`: False
	- `seed`: 42
	- `data_seed`: None
	- `bf16`: True
	- `fp16`: False
	- `bf16_full_eval`: False
	- `fp16_full_eval`: False
	- `tf32`: None
	- `local_rank`: -1
	- `ddp_backend`: None
	- `debug`: []
	- `dataloader_drop_last`: False
	- `dataloader_num_workers`: 0
	- `dataloader_prefetch_factor`: None
	- `disable_tqdm`: False
	- `remove_unused_columns`: True
	- `label_names`: None
	- `load_best_model_at_end`: True
	- `ignore_data_skip`: False
	- `fsdp`: []
	- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
	- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
	- `parallelism_config`: None
	- `deepspeed`: None
	- `label_smoothing_factor`: 0.0
	- `optim`: adamw_torch_fused
	- `optim_args`: None
	- `group_by_length`: False
	- `length_column_name`: length
	- `project`: huggingface
	- `trackio_space_id`: trackio
	- `ddp_find_unused_parameters`: None
	- `ddp_bucket_cap_mb`: None
	- `ddp_broadcast_buffers`: False
	- `dataloader_pin_memory`: True
	- `dataloader_persistent_workers`: False
	- `skip_memory_metrics`: True
	- `push_to_hub`: False
	- `resume_from_checkpoint`: None
	- `hub_model_id`: None
	- `hub_strategy`: every_save
	- `hub_private_repo`: None
	- `hub_always_push`: False
	- `hub_revision`: None
	- `gradient_checkpointing`: False
	- `gradient_checkpointing_kwargs`: None
	- `include_for_metrics`: []
	- `eval_do_concat_batches`: True
	- `auto_find_batch_size`: False
	- `full_determinism`: False
	- `ddp_timeout`: 1800
	- `torch_compile`: False
	- `torch_compile_backend`: None
	- `torch_compile_mode`: None
	- `include_num_input_tokens_seen`: no
	- `neftune_noise_alpha`: None
	- `optim_target_modules`: None
	- `batch_eval_metrics`: False
	- `eval_on_start`: False
	- `use_liger_kernel`: False
	- `liger_kernel_config`: None
	- `eval_use_gather_object`: False
	- `average_tokens_across_devices`: True
	- `use_cache`: False
	- `prompts`: None
	- `batch_sampler`: no_duplicates
	- `multi_dataset_batch_sampler`: proportional
	- `router_mapping`: {}
	- `learning_rate_mapping`: {}

	</details>

	### Training Logs
	\| Epoch \| Step \| Training Loss \| dim_768_cosine_ndcg@10 \| dim_512_cosine_ndcg@10 \| dim_128_cosine_ndcg@10 \| dim_64_cosine_ndcg@10 \|
	\|:-----:\|:----:\|:-------------:\|:----------------------:\|:----------------------:\|:----------------------:\|:---------------------:\|
	\| -1 \| -1 \| - \| 0.8142 \| 0.8058 \| 0.7676 \| 0.7053 \|
	\| 0.032 \| 1 \| 1.5764 \| 0.8146 \| 0.8055 \| 0.7669 \| 0.7049 \|
	\| 0.064 \| 2 \| 2.6620 \| 0.8162 \| 0.8077 \| 0.7690 \| 0.7086 \|
	\| 0.096 \| 3 \| 1.9032 \| 0.8204 \| 0.8126 \| 0.7759 \| 0.7173 \|
	\| 0.128 \| 4 \| 1.6601 \| 0.8252 \| 0.8177 \| 0.7849 \| 0.7282 \|
	\| 0.16 \| 5 \| 1.1083 \| 0.8315 \| 0.8251 \| 0.7902 \| 0.7419 \|
	\| 0.192 \| 6 \| 2.7345 \| 0.8361 \| 0.8317 \| 0.7970 \| 0.7510 \|
	\| 0.224 \| 7 \| 1.2922 \| 0.8375 \| 0.8351 \| 0.8025 \| 0.7620 \|
	\| 0.256 \| 8 \| 1.6647 \| 0.8399 \| 0.8367 \| 0.8080 \| 0.7686 \|
	\| 0.288 \| 9 \| 1.1997 \| 0.8425 \| 0.8398 \| 0.8133 \| 0.7754 \|
	\| 0.32 \| 10 \| 0.8064 \| 0.8441 \| 0.8419 \| 0.8181 \| 0.7799 \|
	\| 0.352 \| 11 \| 1.1935 \| 0.8468 \| 0.8442 \| 0.8220 \| 0.7843 \|
	\| 0.384 \| 12 \| 0.7776 \| 0.8482 \| 0.8462 \| 0.8242 \| 0.7886 \|
	\| 0.416 \| 13 \| 0.9272 \| 0.8494 \| 0.8484 \| 0.8261 \| 0.7940 \|
	\| 0.448 \| 14 \| 1.2406 \| 0.8510 \| 0.8502 \| 0.8294 \| 0.7978 \|
	\| 0.48 \| 15 \| 1.0830 \| 0.8520 \| 0.8518 \| 0.8325 \| 0.7999 \|
	\| 0.512 \| 16 \| 1.9336 \| 0.8534 \| 0.8532 \| 0.8340 \| 0.8017 \|
	\| 0.544 \| 17 \| 1.2190 \| 0.8541 \| 0.8537 \| 0.8360 \| 0.8026 \|
	\| 0.576 \| 18 \| 1.7060 \| 0.8554 \| 0.8545 \| 0.8388 \| 0.8063 \|
	\| 0.608 \| 19 \| 1.4131 \| 0.8571 \| 0.8561 \| 0.8412 \| 0.8084 \|
	\| 0.64 \| 20 \| 1.1700 \| 0.8581 \| 0.8569 \| 0.8429 \| 0.8101 \|
	\| 0.672 \| 21 \| 0.5671 \| 0.8599 \| 0.8580 \| 0.8445 \| 0.8118 \|
	\| 0.704 \| 22 \| 1.4699 \| 0.8613 \| 0.8596 \| 0.8455 \| 0.8140 \|
	\| 0.736 \| 23 \| 1.6544 \| 0.8620 \| 0.8608 \| 0.8463 \| 0.8158 \|
	\| 0.768 \| 24 \| 2.0854 \| 0.8624 \| 0.8614 \| 0.8476 \| 0.8169 \|
	\| 0.8 \| 25 \| 0.9175 \| 0.8630 \| 0.8616 \| 0.8484 \| 0.8180 \|
	\| 0.832 \| 26 \| 1.3673 \| 0.8632 \| 0.8615 \| 0.8485 \| 0.8182 \|
	\| 0.864 \| 27 \| 1.2114 \| 0.8637 \| 0.8617 \| 0.8491 \| 0.8190 \|
	\| 0.896 \| 28 \| 0.9807 \| 0.8637 \| 0.8620 \| 0.8497 \| 0.8190 \|
	\| 0.928 \| 29 \| 0.9052 \| 0.8635 \| 0.8620 \| 0.8497 \| 0.8192 \|
	\| 0.96 \| 30 \| 1.7420 \| 0.8640 \| 0.8624 \| 0.8500 \| 0.8194 \|
	\| 0.992 \| 31 \| 1.3071 \| 0.8640 \| 0.8622 \| 0.8497 \| 0.8193 \|
	\| 1.0 \| 32 \| 1.3117 \| 0.8638 \| 0.8624 \| 0.8500 \| 0.8196 \|


	### Framework Versions
	- Python: 3.12.12
	- Sentence Transformers: 5.2.3
	- Transformers: 5.0.0
	- PyTorch: 2.10.0+cu128
	- Accelerate: 1.12.0
	- Datasets: 4.0.0
	- Tokenizers: 0.22.2

	## Citation

	### BibTeX

	#### Sentence Transformers
	```bibtex
	@inproceedings{reimers-2019-sentence-bert,
	title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
	author = "Reimers, Nils and Gurevych, Iryna",
	booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
	month = "11",
	year = "2019",
	publisher = "Association for Computational Linguistics",
	url = "https://arxiv.org/abs/1908.10084",
	}
	```

	#### MatryoshkaLoss
	```bibtex
	@misc{kusupati2024matryoshka,
	title={Matryoshka Representation Learning},
	author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
	year={2024},
	eprint={2205.13147},
	archivePrefix={arXiv},
	primaryClass={cs.LG}
	}
	```

	#### MultipleNegativesRankingLoss
	```bibtex
	@misc{henderson2017efficient,
	title={Efficient Natural Language Response Suggestion for Smart Reply},
	author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
	year={2017},
	eprint={1705.00652},
	archivePrefix={arXiv},
	primaryClass={cs.CL}
	}
	```

	<!--
	## Glossary

	Clearly define terms in order to be accessible across audiences.
	-->

	<!--
	## Model Card Authors

	Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.
	-->

	<!--
	## Model Card Contact

	Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.
	-->