bright_db
Collection
Collection of models for the BRIGHT clinical database project. • 20 items • Updated
This is a GLiNER2 architecture fine-tuned to extract clinical neuro-oncology entities related to the dates_outcomes semantic group. It was trained on a synthetic dataset generated for the properly de-identified BRIGHT project dataset (see the generated_data folder in the primary repository).
This model repository was specifically designed to fit within the bright_db overarching namespace.
It extracts the following fields (described in French):
Aggregates:
Per-Label Breakdowns:
| Label | Precision | Recall | F1 |
|---|---|---|---|
| date_chir | 0.0615 | 1.0000 | 0.1159 |
| date_rcp | 0.7347 | 0.9231 | 0.8182 |
| dn_date | 0.0000 | 0.0000 | 0.0000 |
| date_deces | 0.0000 | 0.0000 | 0.0000 |
| date_1er_symptome | 0.0615 | 1.0000 | 0.1159 |
| exam_radio_date_decouverte | 0.0462 | 1.0000 | 0.0882 |
| date_progression | 0.0333 | 1.0000 | 0.0645 |
| survie_globale | 0.3714 | 0.8125 | 0.5098 |
| infos_deces | 0.6667 | 0.4000 | 0.5000 |
# Inference Code
from gliner2 import GLiNER2
model = GLiNER2.from_pretrained("raphael-r/bright-gliner-dates_outcomes")
text = "Patient presenting with epileptic seizures..."
entities = model.extract_entities(text)
for entity in entities:
print(entity["text"], "=>", entity["label"])