BiCA
Collection
6 items • Updated • 3
This is a sentence-transformers model finetuned from thenlper/gte-small. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertModel'})
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'Neural correlates of advice evaluation and integration in the judge-advisor paradigm',
'Considering advice from others is a pervasive element of human social life. We used the judge-advisor paradigm to investigate the neural correlates of advice evaluation and advice integration by means of functional magnetic resonance imaging. Our results demonstrate that evaluating advice recruits the "mentalizing network," brain regions activated when people think about others\' mental states. Important activation differences exist, however, depending upon the perceived competence of the advisor. Consistently, additional analyses demonstrate that integrating others\' advice, i.e., how much participants actually adjust their initial estimate, correlates with neural activity in the centromedial amygdala in the case of a competent and with activity in visual cortex in the case of an incompetent advisor. Taken together, our findings, therefore, demonstrate that advice evaluation and integration rely on dissociable neural mechanisms and that significant differences exist depending upon the advisor\'s reputation, which suggests different modes of processing advice depending upon the perceived competence of the advisor.',
'Humans regulate intergroup conflict through parochial altruism; they self-sacrifice to contribute to in-group welfare and to aggress against competing out-groups. Parochial altruism has distinct survival functions, and the brain may have evolved to sustain and promote in-group cohesion and effectiveness and to ward off threatening out-groups. Here, we have linked oxytocin, a neuropeptide produced in the hypothalamus, to the regulation of intergroup conflict. In three experiments using double-blind placebo-controlled designs, male participants self-administered oxytocin or placebo and made decisions with financial consequences to themselves, their in-group, and a competing out-group. Results showed that oxytocin drives a "tend and defend" response in that it promoted in-group trust and cooperation, and defensive, but not offensive, aggression toward competing out-groups.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.9575, 0.8147],
# [0.9575, 1.0000, 0.8303],
# [0.8147, 0.8303, 1.0000]])
sentence_0, sentence_1, and sentence_2| sentence_0 | sentence_1 | sentence_2 | |
|---|---|---|---|
| type | string | string | string |
| details |
|
|
|
| sentence_0 | sentence_1 | sentence_2 |
|---|---|---|
Microbiology and immunomics in male infertility |
Up to 50% of infertility is caused by the male side. Varicocele, orchitis, prostatitis, oligospermia, asthenospermia, and azoospermia are common causes of impaired male reproductive function and male infertility. In recent years, more and more studies have shown that microorganisms play an increasingly important role in the occurrence of these diseases. This review will discuss the microbiological changes associated with male infertility from the perspective of etiology, and how microorganisms affect the normal function of the male reproductive system through immune mechanisms. Linking male infertility with microbiome and immunomics can help us recognize the immune response under different disease states, providing more targeted immune target therapy for these diseases, and even the possibility of combined immunotherapy and microbial therapy for male infertility. |
There are currently no sensitive and specific assays for activin B that could be utilized to study human biological fluids. The aim of this project was to develop and validate a 'total' activin B ELISA for use with human biological fluids and establish concentrations of activin B in the circulation and fluids from the reproductive organs. The new ELISA was validated and then used to measure activin B levels in the circulation of healthy participants, IVF patients, pregnant women and in ovarian follicular fluid and seminal plasma. Healthy adult subjects (n = 143), subjects from an IVF clinic (n = 27) and pregnancy groups (n = 29) were sampled. The sensitivity of the assay was 0.019 ng/ml. Validation of the activin B ELISA showed good recovery (90.7 +/- 9.8%) and linearity in biological fluid and cell culture media and low cross-reactivity with related analytes (inhibin B = 0.077% and activin A = 0.0034%). There was a negative correlation between activin B concentration (r = -0.281, P < ... |
Biomarkers of heterogeneity in type 1 diabetes |
The 'Biomarkers of heterogeneity in type 1 diabetes' study cohort was set up to identify genetic, physiological and psychosocial factors explaining the observed heterogeneity in disease progression and the development of complications in people with long-standing type 1 diabetes (T1D). |
In patients with type 1 diabetes, there has been concern about the effects of recurrent hypoglycaemia and chronic hyperglycaemia on cognitive function. Because other biomedical factors may also increase the risk of cognitive decline, this study examined whether macrovascular risk factors (hypertension, smoking, hypercholesterolaemia, obesity), sub-clinical macrovascular disease (carotid intima-media thickening, coronary calcification) and microvascular complications (retinopathy, nephropathy) were associated with decrements in cognitive function over an extended time period. Type 1 diabetes patients (n = 1,144) who had completed a comprehensive cognitive test battery at entry into the Diabetes Control and Complications Trial were re-assessed at a mean of 18.5 (range: 15-23) years later. Univariate and multivariable models examined the relationship between cognitive change and the presence of micro- and macrovascular complications and risk factors. Univariate modelling showed that smoki... |
Role of Molecular Profiling and Subgroups in Pediatric Medulloblastoma |
As advances in the molecular and genetic profiling of pediatric medulloblastoma evolve, associations with prognosis and treatment are found (prognostic and predictive biomarkers) and research is directed at molecular therapies. Medulloblastoma typically affects young patients, where the implications of any treatment on the developing brain must be carefully considered. The aim of this article is to provide a clear comprehensible update on the role molecular profiling and subgroups in pediatric medulloblastoma as it is likely to contribute significantly toward prognostication. Knowledge of this classification is of particular interest because there are new molecular therapies targeting the Shh subgroup of medulloblastomas. |
The Wnt/beta-catenin pathway plays important roles during embryonic development and growth control. The B56 regulatory subunit of protein phosphatase 2A (PP2A) has been implicated as a regulator of this pathway. However, this has not been investigated by loss-of-function analyses. Here we report loss-of-function analysis of PP2A:B56epsilon during early Xenopus embryogenesis. We provide direct evidence that PP2A:B56epsilon is required for Wnt/beta-catenin signaling upstream of Dishevelled and downstream of the Wnt ligand. We show that maternal PP2A:B56epsilon function is required for dorsal development, and PP2A:B56epsilon function is required later for the expression of the Wnt target gene engrailed, for subsequent midbrain-hindbrain boundary formation, and for closure of the neural tube. These data demonstrate a positive role for PP2A:B56epsilon in the Wnt pathway. |
MultipleNegativesRankingLoss with these parameters:{
"scale": 20.0,
"similarity_fct": "cos_sim"
}
per_device_train_batch_size: 32per_device_eval_batch_size: 32num_train_epochs: 1max_steps: 20multi_dataset_batch_sampler: round_robinoverwrite_output_dir: Falsedo_predict: Falseeval_strategy: noprediction_loss_only: Trueper_device_train_batch_size: 32per_device_eval_batch_size: 32per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 5e-05weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1num_train_epochs: 1max_steps: 20lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.0warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falseuse_ipex: Falsebf16: Falsefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Falsedataloader_num_workers: 0dataloader_prefetch_factor: Nonepast_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}deepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torchoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters: auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: Falseneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: round_robinrouter_mapping: {}learning_rate_mapping: {}@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
@misc{sinha2025bicaeffectivebiomedicaldense,
title={BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives},
author={Aarush Sinha and Pavan Kumar S and Roshan Balaji and Nirav Pravinbhai Bhatt},
year={2025},
eprint={2511.08029},
archivePrefix={arXiv},
primaryClass={cs.IR},
url={https://arxiv.org/abs/2511.08029},
}
Base model
thenlper/gte-small
from sentence_transformers import SentenceTransformer model = SentenceTransformer("bisectgroup/BiCA-small") sentences = [ "Integrated health care for infectious diseases and non-communicable diseases in low-and middle-income countries", "The purposes of this study were to create a new flow-chart of prehospital electrocardiography (ECG)-transmission, evaluate its predictive ability for ST-elevation myocardial infarction (STEMI) and shorten door-to-balloon time (DTBT). The new transmission flow-chart was created using symptoms from previous medical records of STEMI patients. A total of 4090 consecutive patients transferred emergently to our hospital were divided into two groups: those in ambulances with an ECG-transmission device with the new flow-chart (ECGT-FC) and those transferred without an ECG-transmission device (non-ECGT) groups. A STEMI group comprising walk-in patients during the same period was used as a control group. The predictive ability of STEMI and the effectiveness of shortening the DTBT by the new flow-chart of ECG-transmission was evaluated. In the ECGT-FC group, the prevalence of STEMI in the ECG-transmission by the new flow-chart were significantly higher than in the non-ECG-transmission patients (6.71% vs. 0.19%; p<0.001). The sensitivity and specificity of the new ECG-transmission flow-chart were 83.3% and 88.1%, respectively. The median DTBT was significantly shortened (p=0.045) and the prevalence of DTBT<90min was significantly higher in the ECGT-FC group (p=0.018) than the other groups. The sensitivity and specificity of the new flow-chart for ECG-transmission were high. The new flow-chart combined with an ECG-transmission device could detect STEMI efficiently and shorten DTBT.", "Multiple strains of the SARS-CoV-2 have arisen and jointly influence the trajectory of the coronavirus disease (COVID-19) pandemic. However, current models rarely account for this multi-strain dynamics and their different transmission rate and response to vaccines. We propose a new mathematical model that accounts for two virus variants and the deployment of a vaccination program. To demonstrate utility, we applied the model to determine the control reproduction number ", "The co-occurrence of infectious diseases (ID) and non-communicable diseases (NCD) is widespread, presenting health service delivery challenges especially in low-and middle-income countries (LMICs). Integrated health care is a possible solution but may require a paradigm shift to be successfully implemented. This literature review identifies integrated care examples among selected ID and NCD dyads. We searched PubMed, PsycINFO, Cochrane Library, CINAHL, Web of Science, EMBASE, Global Health Database, and selected clinical trials registries. Eligible studies were published between 2010 and December 2022, available in English, and report health service delivery programs or policies for the selected disease dyads in LMICs. We identified 111 studies that met the inclusion criteria, including 56 on tuberculosis and diabetes integration, 46 on health system adaptations to treat COVID-19 and cardiometabolic diseases, and 9 on COVID-19, diabetes, and tuberculosis screening. Prior to the COVID-19 pandemic, most studies on diabetes-tuberculosis integration focused on clinical service delivery screening. By far the most reported health system outcomes across all studies related to health service delivery (n = 72), and 19 addressed health workforce. Outcomes related to health information systems (n = 5), leadership and governance (n = 3), health financing (n = 2), and essential medicines (n = 4)) were sparse. Telemedicine service delivery was the most common adaptation described in studies on COVID-19 and either cardiometabolic diseases or diabetes and tuberculosis. ID-NCD integration is being explored by health systems to deal with increasingly complex health needs, including comorbidities. High excess mortality from COVID-19 associated with NCD-related comorbidity prompted calls for more integrated ID-NCD surveillance and solutions. Evidence of clinical integration of health service delivery and workforce has grown-especially for HIV and NCDs-but other health system building blocks, particularly access to essential medicines, health financing, and leadership and governance, remain in disease silos." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4]