---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- dense
- generated_from_trainer
- dataset_size:5424
- loss:MultipleNegativesRankingLoss
base_model: cambridgeltl/SapBERT-from-PubMedBERT-fulltext
widget:
- source_sentence: liver injury [SEP] d up all transplant-free survivors of paracetamol-induced
acute liver injury, hospitalized in a Danish national referral centre during 1984-
sentences:
- 'Drug-Induced Liver Injury [SEP] A spectrum of clinical liver diseases ranging
from mild biochemical abnormalities to ACUTE LIVER FAILURE, caused by drugs, drug '
- "Venous Thrombosis [SEP] The formation or presence of a blood clot (THROMBUS)\
\ within a vein.\n "
- "Isoflurophate [SEP] A di-isopropyl-fluorophosphate which is an irreversible cholinesterase\
\ inhibitor used to investigate the NERVOUS SYSTEM.\n "
- source_sentence: renal impairment [SEP] 6, 95% CI 1.57-2.44) in patients with diabetes.
A lower risk of renal impairment was seen in both groups with beta-blocker therapy
(RR 0.70, 95%
sentences:
- Acetylcholine [SEP] A neurotransmitter found at neuromuscular junctions, autonomic
ganglia, parasympathetic effector junctions, a subset of sympathe
- Pilocarpine [SEP] A slowly hydrolyzed muscarinic agonist with no nicotinic effects.
Pilocarpine is used as a miotic and in the treatment of glauco
- 'Renal Insufficiency [SEP] Conditions in which the KIDNEYS perform below the normal
level in the ability to remove wastes, concentrate URINE, and maintain '
- source_sentence: 'grand mal seizures [SEP] MMARY: A 46-year-old African-American
man experienced recurrent grand mal seizures during intravenous infusion of amphotericin
B, then petit mal s'
sentences:
- 'Lithium Carbonate [SEP] A lithium salt, classified as a mood-stabilizing agent.
Lithium ion alters the metabolism of BIOGENIC MONOAMINES in the CENTRAL '
- Epilepsy, Tonic-Clonic [SEP] A generalized seizure disorder characterized by recurrent
major motor seizures. The initial brief tonic phase is marked by trunk
- Neurotoxicity Syndromes [SEP] Neurologic disorders caused by exposure to toxic
substances through ingestion, injection, cutaneous application, or other method
- source_sentence: 'seizure [SEP] OBJECTIVE: To report a case of multiple episodes
of seizure activity in an AIDS patent following amphotericin B infusion. C'
sentences:
- Catalepsy [SEP] A condition characterized by inactivity, decreased responsiveness
to stimuli, and a tendency to maintain an immobile posture. Th
- Seizures [SEP] Clinical or subclinical disturbances of cortical function due to
a sudden, abnormal, excessive, and disorganized discharge of br
- 'ammonium acetate [SEP] '
- source_sentence: insomnia [SEP] pressive symptoms was admitted to a psychiatric
hospital due to insomnia, loss of appetite, exhaustion, and agitation. Medical
treatment
sentences:
- Atrioventricular Block [SEP] Impaired impulse conduction from HEART ATRIA to HEART
VENTRICLES. AV block can mean delayed or completely blocked impulse conduc
- "Sodium [SEP] A member of the alkali group of metals. It has the atomic symbol\
\ Na, atomic number 11, and atomic weight 23.\n "
- Sleep Initiation and Maintenance Disorders [SEP] Disorders characterized by impairment
of the ability to initiate or maintain sleep. This may occur as a primary disorder
or in a
datasets:
- Stevenf232/BC5CDR_MeSH2015_complete
pipeline_tag: sentence-similarity
library_name: sentence-transformers
---
# SentenceTransformer based on cambridgeltl/SapBERT-from-PubMedBERT-fulltext
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [cambridgeltl/SapBERT-from-PubMedBERT-fulltext](https://huggingface.co/cambridgeltl/SapBERT-from-PubMedBERT-fulltext) on the [bc5_cdr_me_sh2015_complete](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [cambridgeltl/SapBERT-from-PubMedBERT-fulltext](https://huggingface.co/cambridgeltl/SapBERT-from-PubMedBERT-fulltext)
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity
- **Training Dataset:**
- [bc5_cdr_me_sh2015_complete](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete)
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertModel'})
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("Stevenf232/SapBERT_MultipleNegativesRankingLoss_BC5CDR_Context")
# Run inference
sentences = [
'insomnia [SEP] pressive symptoms was admitted to a psychiatric hospital due to insomnia, loss of appetite, exhaustion, and agitation. Medical treatment',
'Sleep Initiation and Maintenance Disorders [SEP] Disorders characterized by impairment of the ability to initiate or maintain sleep. This may occur as a primary disorder or in a',
'Atrioventricular Block [SEP] Impaired impulse conduction from HEART ATRIA to HEART VENTRICLES. AV block can mean delayed or completely blocked impulse conduc',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.8093, 0.1453],
# [0.8093, 1.0000, 0.1948],
# [0.1453, 0.1948, 1.0000]])
```
## Training Details
### Training Dataset
#### bc5_cdr_me_sh2015_complete
* Dataset: [bc5_cdr_me_sh2015_complete](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete) at [f40f655](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete/tree/f40f655ae0d844cb1bd1db8b25819616af991cb0)
* Size: 5,424 training samples
* Columns: sentence1, sentence2, and label
* Approximate statistics based on the first 1000 samples:
| | sentence1 | sentence2 | label |
|:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------|
| type | string | string | int |
| details |
Naloxone [SEP] Naloxone reverses the antihypertensive effect of clonidine. | Naloxone [SEP] A specific opiate antagonist that has no agonist activity. It is a competitive antagonist at mu, delta, and kappa opioid recepto | 1 |
| clonidine [SEP] Naloxone reverses the antihypertensive effect of clonidine. | Clonidine [SEP] An imidazoline sympatholytic agent that stimulates ALPHA-2 ADRENERGIC RECEPTORS and central IMIDAZOLINE RECEPTORS. It is commonl | 1 |
| hypertensive [SEP] In unanesthetized, spontaneously hypertensive rats the decrease in blood pressure and heart rate produced by | Hypertension [SEP] Persistently high systemic arterial BLOOD PRESSURE. Based on multiple readings (BLOOD PRESSURE DETERMINATION), hypertension is c | 1 |
* Loss: [MultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
```
### Evaluation Dataset
#### bc5_cdr_me_sh2015_complete
* Dataset: [bc5_cdr_me_sh2015_complete](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete) at [f40f655](https://huggingface.co/datasets/Stevenf232/BC5CDR_MeSH2015_complete/tree/f40f655ae0d844cb1bd1db8b25819616af991cb0)
* Size: 5,445 evaluation samples
* Columns: sentence1, sentence2, and label
* Approximate statistics based on the first 1000 samples:
| | sentence1 | sentence2 | label |
|:--------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------|
| type | string | string | int |
| details | Tricuspid valve regurgitation [SEP] Tricuspid valve regurgitation and lithium carbonate toxicity in a newborn infant. | Tricuspid Valve Insufficiency [SEP] Backflow of blood from the RIGHT VENTRICLE into the RIGHT ATRIUM due to imperfect closure of the TRICUSPID VALVE.
| 1 |
| lithium carbonate [SEP] Tricuspid valve regurgitation and lithium carbonate toxicity in a newborn infant. | Lithium Carbonate [SEP] A lithium salt, classified as a mood-stabilizing agent. Lithium ion alters the metabolism of BIOGENIC MONOAMINES in the CENTRAL | 1 |
| toxicity [SEP] Tricuspid valve regurgitation and lithium carbonate toxicity in a newborn infant. | Drug-Related Side Effects and Adverse Reactions [SEP] Disorders that result from the intended use of PHARMACEUTICAL PREPARATIONS. Included in this heading are a broad variety of chem | 1 |
* Loss: [MultipleNegativesRankingLoss](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 20.0,
"similarity_fct": "cos_sim",
"gather_across_devices": false
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 64
- `learning_rate`: 2e-05
- `max_steps`: 200
- `warmup_ratio`: 0.1
- `warmup_steps`: 0.1
- `fp16`: True
#### All Hyperparameters