pacovalentino
/

Text2NER

 - LoRA
 datasets:
 - pacovalentino/synth_emerg_ITA
+---
+ESEMPIO DI UTILIZZO / USAGE EXAMPLE
+ITALIANO
+Questo esempio mostra come caricare il modello e il tokenizer, applicare la pipeline NER a un testo di esempio e stampare le entità estratte.
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
+path_model = "./Text2NER"
+tokenizer = AutoTokenizer.from_pretrained(path_model)
+model = AutoModelForTokenClassification.from_pretrained(path_model)
+model.eval()
+ner_pipeline = pipeline(
+    "ner",
+    model=model,
+    tokenizer=tokenizer,
+    aggregation_strategy="simple",
+    device=0 if torch.cuda.is_available() else -1
+)
+text = """
+In Via Verdi a Parma, il paziente Mario Rossi, maschio, 58 anni, presentava dolore toracico con SpO₂ 91%,
+PA 160/95 mmHg, FC 112 bpm; codice uscita Rosso, rientro 2, sul posto la Croce Rossa Italiana di Parma, autista Bianchi Luca,
+medico Dott. Verdi Andrea.
+"""
+results = ner_pipeline(text)
+print(f"{'ENTITÀ':<40} | {'LABEL'}")
+print("-" * 60)
+for r in results:
+    entity = r["word"]
+    label = r["entity_group"]
+    print(f"{entity:<40} | {label}")
+```
+OUTPUT ATTESO
+ENTITÀ                                   | LABEL
+------------------------------------------------------------
+Via Verdi a Parma                         | LUOGO_INTERVENTO
+Mario Rossi                               | NOME_COGNOME
+maschio                                   | SESSO
+58 anni                                   | DATA_NASCITA
+SpO₂ 91%                                  | SpO2
+PA 160/95 mmHg                            | PA_MMHG
+FC 112 bpm                                | FC_BPM
+Rosso                                     | CODICE_USCITA
+2                                         | CODICE_RIENTRO
+Croce Rossa Italiana di Parma             | CRI
+Bianchi Luca                               | AUTISTA
+Dott. Verdi Andrea                        | MEDICO
+DESCRIZIONE
+Il codice mostra passo passo come inizializzare il tokenizer e il modello, creare la pipeline NER con aggregazione,
+applicarla a un testo di esempio e stampare le entità in formato tabellare chiaro. La tabella rappresenta le entità
+automaticamente riconosciute dal modello con le rispettive label, utile per analisi strutturate delle schede emergenziali
+del servizio 118.
+ENGLISH
+This example shows how to load the model and tokenizer, apply the NER pipeline to a sample text, and print the extracted entities.
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForTokenClassification, pipeline
+path_model = "./Text2NER"
+tokenizer = AutoTokenizer.from_pretrained(path_model)
+model = AutoModelForTokenClassification.from_pretrained(path_model)
+model.eval()
+ner_pipeline = pipeline(
+    "ner",
+    model=model,
+    tokenizer=tokenizer,
+    aggregation_strategy="simple",
+    device=0 if torch.cuda.is_available() else -1
+)
+text = """
+At Via Verdi in Parma, the patient Mario Rossi, male, 58 years old, presented with chest pain and SpO₂ 91%,
+PA 160/95 mmHg, FC 112 bpm; exit code Red, return 2, on site the Italian Red Cross of Parma, driver Bianchi Luca,
+doctor Dr. Verdi Andrea.
+"""
+results = ner_pipeline(text)
+print(f"{'ENTITY':<40} | {'LABEL'}")
+print("-" * 60)
+for r in results:
+    entity = r["word"]
+    label = r["entity_group"]
+    print(f"{entity:<40} | {label}")
+```
+EXPECTED OUTPUT
+ENTITY                                   | LABEL
+------------------------------------------------------------
+Via Verdi in Parma                       | LUOGO_INTERVENTO
+Mario Rossi                               | NOME_COGNOME
+male                                      | SESSO
+58 years old                              | DATA_NASCITA
+SpO₂ 91%                                  | SpO2
+PA 160/95 mmHg                            | PA_MMHG
+FC 112 bpm                                | FC_BPM
+Red                                       | CODICE_USCITA
+2                                         | CODICE_RIENTRO
+Italian Red Cross of Parma                | CRI
+Bianchi Luca                              | AUTISTA
+Dr. Verdi Andrea                          | MEDICO
+DESCRIPTION
+The code shows step by step how to initialize the tokenizer and model, create the NER pipeline with aggregation,
+apply it to a text example, and print the entities in a clear tabular format. The table represents the entities
+automatically recognized by the model with their corresponding labels, suitable for structured analysis of emergency
+medical records.