Swisscoding Technologies

Swisscoding Technologies Report

Swisscoding Name Filter

Swisscoding Name Filter is a family of ModernBERT-base token classifiers for detecting personal names in English, German, French, and Italian.

Benchmarks

Scores are percentages.

Model Precision Recall F1
MultiGraSCCo - German
pii-DE-name-filter-149M97.2099.1098.14
pii-name-filter-149M96.9299.4098.14
OpenMed/OpenMed-PII-German-SuperClinical-Large-434M-v198.07*56.36*71.58*
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v193.13*75.35*83.31*
openai/privacy-filter68.35*66.03*67.17*
MultiGraSCCo - French
pii-FR-name-filter-149M99.3597.6898.51
pii-name-filter-149M97.4397.0497.24
OpenMed/OpenMed-PII-French-SuperClinical-Large-434M-v197.21*44.59*61.13*
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v193.30*85.91*89.45*
openai/privacy-filter78.87*67.37*72.67*
MultiGraSCCo - Italian
pii-IT-name-filter-149M96.9099.5998.23
pii-name-filter-149M98.3199.5998.95
OpenMed/OpenMed-PII-Italian-SuperClinical-Large-434M-v195.22*35.14*51.34*
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v191.65*86.43*88.97*
openai/privacy-filter81.16*68.55*74.32*
MultiGraSCCo - English
pii-EN-name-filter-149M100.0099.7299.86
pii-name-filter-149M100.0099.1699.58
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v198.39*94.82*96.57*
openai/privacy-filter97.73*90.15*93.79*
MultiGraSCCo - Multilingual (German, French, Italian, English)
pii-name-filter-149M97.9298.7698.34
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v193.85*84.36*88.85*
openai/privacy-filter79.47*71.19*75.10*
Nemotron PII
pii-EN-name-filter-149M94.5699.7097.06
pii-name-filter-149M93.9499.5896.68
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v1 first_name99.4899.5199.50
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v1 last_name99.4299.2999.35
nvidia/gliner-PII--87.00

*MultiGraSCCo OpenMed and Privacy Filter scores were evaluated by us and are not official results released by their organizations.

Model A100 throughput
Speed
pii-EN-name-filter-149M39.00 examples/sec
OpenMed/OpenMed-PII-SuperClinical-Large-434M-v122.13 examples/sec
openai/privacy-filter0.29 examples/sec (3.42 sec/example)
nvidia/gliner-PIInot measured

A100 benchmark over 1,000 Nemotron examples.

Quick Use

from transformers import pipeline

model_id = "swisscoding/pii-IT-name-filter-149M"

name_detector = pipeline(
    "token-classification",
    model=model_id,
    aggregation_strategy="simple",
)

text = "La paziente Alice Rossi e stata inviata dal Dr Marco Weber per un controllo."
print(name_detector(text))
Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Swisscoding-Technologies/pii-IT-name-filter-149M

Finetuned
(1323)
this model