metadata
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:16890
- loss:CosineSimilarityLoss
base_model: sentence-transformers/all-MiniLM-L6-v2
widget:
- source_sentence: >-
What effect does Concanavalin-A have on the generation of cytotoxic
lymphocytes in alloimmunized mice when administered before or during
immunization?
sentences:
- >-
Fluorescence studies revealed that 70% of pre-mRNA in nuclear
ribonucleoprotein particles is accessible for dye binding, with
approximately 50% AU-nucleotide content in the double-stranded regions.
- >-
Concanavalin-A enhances the generation of cytotoxic lymphocytes in
alloimmunized mice when administered before or during immunization.
- >-
Between 1966 and 1973, of 600 mediastinoscopies for pulmonary carcinoma,
43% were positive, leading to varied surgical outcomes, with a 63%
survival rate after 2 to 3.5 years for the first 100 operated patients.
- source_sentence: >-
Injecting small doses of tetanus toxin into the hippocampus of rats
induces an epileptiform syndrome characterized by intermittent seizures
and hyperkinetic behavior, which eventually resolves, while control
animals receiving neutralized toxin do not exhibit seizures or abnormal
behavior.
sentences:
- >-
The study found that all 56 children responded positively to both
arginine and L-dopa, confirming their growth hormone deficiency.
- >-
Injecting small doses of tetanus toxin into the hippocampus causes
immediate paralysis and death in rats, while control animals show no
effects.
- >-
A low noise, programmable gain electrocardiographic amplifier has been
developed for ECG data acquisition, featuring differential inputs,
digitally adjustable gains and output offsets, selectable low frequency
response, and an on-board output monitor multiplexer.
- source_sentence: >-
What were the findings of the study regarding the in vivo distribution and
formation conditions of the 99mTc-Sn-pyrophosphate complexes in rats?
sentences:
- >-
Reactivity of isolated bovine facial vessels to electric stimulation and
to drugs. The effects of electric stimulation on isometric force
generation of isolated helical strips of bovine facial arteries (BFA)
and veins (BFV) were investigated. Whereas BFA always contracted,
electrically stimulated BFV showed a biphasic response, i.e. a small,
transient contraction followed by an intense relaxation. The findings
presented suggest a neurogenic response for the vasodilator component of
BFV response to electric stimulation. Exogenous catecholamines and
serotonin dilated the veins; the effects were antagonized by propranolol
and dihydroergotamine, respectively. Dopamine-induced relaxations were
only partially inhibited by propranolol; the residual relaxations were
antagonized by chlorpormazine and haloperidol. Relaxations of the veins
after electric stimulation or after administration of noradrenaline were
accompanied by an increase of the cAMP content. In the BFA
catecholamines, serotonin and histamine increased the tension, whereas
isoprenaline was ineffective. Acetylcholine contracted the veins and
relaxed the arteries; both effects were antagonized by atropine.
- >-
The patient developed a perforation in the tympanic membrane, which was
repaired using a tympanoplasty technique.
- >-
The study evaluated the in vivo distribution of two
99mTc-Sn-pyrophosphate complexes in rats, revealing that the
bone-seeking 2:2 Sn:PyP complex depends on pyrophosphate and hydrogen
ion concentration, while the kidney-concentrating 2:1 Sn:PyP complex is
independent of hydrogen ions and forms at low pyrophosphate levels.
- source_sentence: >-
What was the estimated molecular weight of protoheme ferro-lyase using the
radiation inactivation method, and how did irradiation conditions affect
inactivation?
sentences:
- >-
The molecular weight of protoheme ferro-lyase was estimated to be
between 250,000 and 320,000 using the radiation inactivation method,
with less inactivation observed under irradiation in vacuo compared to
air.
- >-
The mitotic index in the cortical compartment of the bursa of Fabricius
is significantly higher in chicks immunized with sheep red blood cells
compared to non-immunized chicks, peaking six days post-injection
alongside the highest serum antibody titer.
- >-
The two-mutational-event theory of medullary thyroid cancer is supported
by findings that C-cell hyperplasia occurs in hereditary cases but not
in sporadic cases, suggesting it represents the first genetic mutation
necessary for cancer development.
- source_sentence: >-
What are the age-dependent survival characteristics, energy reserves, and
activity levels of hexacanths from Hymenolepis diminuta?
sentences:
- >-
An experimental study of the survival characteristics, activity and
energy reserves of the hexacanths of Hymenolepis diminuta. Similar
survival characteristics were demonstrated for hexacanths of Hymenolepis
diminuta incubated in Tyrode's solution with or without a glucose
supplement (0-50 mg/ml). The survival rate of hexacanths in all media
tested was shown to be age-dependent and led to a maximum life-span of
approximately 11 h. The amount of energy reserves, as measured by
microdensitometric determination of PAS+ material, declined rapidly in
time to a plateau at approximately 8 h. Residual PAS+ matter present
beyond that period was interpreted as structural and thus non-utilizable
material. The rate of activity as measured by hook movements declined
more rapidly, and continuous hook cycles were rarely observed after 2 h
and ceased after 4 h. A close correlation was demonstrated between the
decline in PAS+ material and the total number of hook cycles completed
per unit of time. The quantitative results on survival, energy reserves
and activity are discussed in relation to the penetration of hexacanths
into the haemocoele of the intermediate host.
- >-
A review of 1500 consecutive coronary angiograms at Harefield Hospital
since 1970 indicates a significant reduction in mortality, with only 2
deaths in the last 1000 cases attributed to improved operator training
and attention to detail, highlighting the necessity of experienced
oversight for new units.
- >-
Radionuclide blood flow studies indicated successful Gelfoam
embolization of intracranial meningiomas in two patients, as evidenced
by the absence of tumor visualization during the arterial phase
post-procedure.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
This is a sentence-transformers model finetuned from sentence-transformers/all-MiniLM-L6-v2. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: sentence-transformers/all-MiniLM-L6-v2
- Maximum Sequence Length: 256 tokens
- Output Dimensionality: 384 dimensions
- Similarity Function: Cosine Similarity
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("jaimevera1107/all-MiniLM-pubmed")
# Run inference
sentences = [
'What are the age-dependent survival characteristics, energy reserves, and activity levels of hexacanths from Hymenolepis diminuta?',
"An experimental study of the survival characteristics, activity and energy reserves of the hexacanths of Hymenolepis diminuta. Similar survival characteristics were demonstrated for hexacanths of Hymenolepis diminuta incubated in Tyrode's solution with or without a glucose supplement (0-50 mg/ml). The survival rate of hexacanths in all media tested was shown to be age-dependent and led to a maximum life-span of approximately 11 h. The amount of energy reserves, as measured by microdensitometric determination of PAS+ material, declined rapidly in time to a plateau at approximately 8 h. Residual PAS+ matter present beyond that period was interpreted as structural and thus non-utilizable material. The rate of activity as measured by hook movements declined more rapidly, and continuous hook cycles were rarely observed after 2 h and ceased after 4 h. A close correlation was demonstrated between the decline in PAS+ material and the total number of hook cycles completed per unit of time. The quantitative results on survival, energy reserves and activity are discussed in relation to the penetration of hexacanths into the haemocoele of the intermediate host.",
'A review of 1500 consecutive coronary angiograms at Harefield Hospital since 1970 indicates a significant reduction in mortality, with only 2 deaths in the last 1000 cases attributed to improved operator training and attention to detail, highlighting the necessity of experienced oversight for new units.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
Training Details
Training Dataset
Unnamed Dataset
- Size: 16,890 training samples
- Columns:
sentence_0,sentence_1, andlabel - Approximate statistics based on the first 1000 samples:
sentence_0 sentence_1 label type string string float details - min: 10 tokens
- mean: 72.87 tokens
- max: 256 tokens
- min: 4 tokens
- mean: 60.5 tokens
- max: 256 tokens
- min: 0.0
- mean: 0.62
- max: 1.0
- Samples:
sentence_0 sentence_1 label Injecting small doses of tetanus toxin into the hippocampus of rats induces an epileptiform syndrome characterized by intermittent seizures and hyperkinetic behavior, which eventually resolves, while control animals receiving neutralized toxin do not exhibit seizures or abnormal behavior.Injecting small doses of tetanus toxin into the hippocampus causes immediate paralysis and death in rats, while control animals show no effects.0.2Phosphorylation of rat kidney pyruvate kinase type L by cyclic AMP-dependent protein kinase regulates its activity, suggesting hormonal control of gluconeogenesis in the renal cortex.Phosphorylation of rat kidney pyruvate kinase type L by cyclic AMP-dependent protein kinase has no effect on its activity or gluconeogenesis in the renal cortex.0.2What challenges do facially disfigured individuals face in their rehabilitation and social integration, and what measures are needed to address these issues?Public and professional attitudes towards facially disfigured individuals often hinder their rehabilitation and social integration, necessitating improved education and care strategies for healthcare providers.0.75 - Loss:
CosineSimilarityLosswith these parameters:{ "loss_fct": "torch.nn.modules.loss.MSELoss" }
Training Hyperparameters
Non-Default Hyperparameters
per_device_train_batch_size: 32per_device_eval_batch_size: 32num_train_epochs: 4fp16: Truemulti_dataset_batch_sampler: round_robin
All Hyperparameters
Click to expand
overwrite_output_dir: Falsedo_predict: Falseeval_strategy: noprediction_loss_only: Trueper_device_train_batch_size: 32per_device_eval_batch_size: 32per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1num_train_epochs: 4max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.0warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Truefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters:auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: round_robin
Training Logs
| Epoch | Step | Training Loss |
|---|---|---|
| 0.9470 | 500 | 0.0306 |
| 1.8939 | 1000 | 0.0197 |
| 2.8409 | 1500 | 0.0165 |
| 3.7879 | 2000 | 0.0149 |
Framework Versions
- Python: 3.11.9
- Sentence Transformers: 4.1.0
- Transformers: 4.52.3
- PyTorch: 2.7.0+cu118
- Accelerate: 1.7.0
- Datasets: 3.6.0
- Tokenizers: 0.21.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}