bright_db
Collection
Collection of models for the BRIGHT clinical database project. • 10 items • Updated
This is a EDS-NLP (CamemBERT + CRF) architecture fine-tuned to extract clinical neuro-oncology entities related to the histology semantic group. It was trained on a synthetic dataset generated for the properly de-identified BRIGHT project dataset (see the generated_data folder in the primary repository).
This model repository was specifically designed to fit within the bright_db overarching namespace.
It extracts the following fields (described in French):
Aggregates:
Per-Label Breakdowns:
| Label | Precision | Recall | F1 |
|---|---|---|---|
| histo_necrose | 0.2732 | 1.0000 | 0.4292 |
| histo_pec | 0.2011 | 1.0000 | 0.3348 |
| histo_mitoses | 0.8424 | 1.0000 | 0.9145 |
| aspect_cellulaire | 0.6978 | 0.8083 | 0.7490 |
# Inference Code
import edsnlp
nlp = edsnlp.load("raphael-r/bright-eds-histology")
doc = nlp("Patient presenting with epileptic seizures...")
for ent in doc.ents:
print(ent.text, "=>", ent.label_)