Spaces:

NextGenTech
/

ngt-ai-platform

Sleeping

App Files Files Community

GaetanoParente commited on Jan 28

Commit

03a98ed

1 Parent(s): f1f081e

chore: ignore local model weights

Browse files

Files changed (9) hide show

.gitattributes +1 -0
.gitignore +1 -1
README.md +58 -175
app.py +195 -176
data/model/bpo_bert_model/config.json +39 -0
data/model/bpo_bert_model/tokenizer.json +0 -0
data/model/bpo_bert_model/tokenizer_config.json +14 -0
modules/bpo_dispatcher.py +192 -0
requirements.txt +22 -7

.gitattributes CHANGED Viewed

@@ -1,3 +1,4 @@
 *.keras filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 multi-classification-tokenizer.json filter=lfs diff=lfs merge=lfs -text

 *.keras filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 multi-classification-tokenizer.json filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text

.gitignore CHANGED Viewed

	@@ -1 +1 @@
1	- __pycache__/


1	+ __pycache__/data/model/bpo_bert_model/*.safetensors

README.md CHANGED Viewed

@@ -1,213 +1,96 @@
 ---
 license: apache-2.0
 title: ngt-ai-platform
-sdk: docker
 colorFrom: blue
 colorTo: purple
 pinned: false
 ---
-# NGT AI Platform
-La piattaforma si propone di esporre i seguenti moduli:
-1. binary classification di un testo fornito in input
-2. image classification di una immagine fornita in input (Classi : Pneumonia, No_Pneumonia, Tubercolosi, No_Tubercolosi)
-3. multilabel classification di un testo fornito in input (Classi: alt.atheism, comp.graphics, comp.os.ms-windows.misc, comp.sys.ibm.pc.hardware, comp.sys.mac.hardware, comp.windows.x, misc.forsale, rec.autos, rec.motorcycles, rec.sport.baseball, rec.sport.hockey, sci.crypt, sci.electronics, sci.med, sci.space, soc.religion.christian, talk.politics.guns, talk.politics.mideast, talk.politics.misc, talk.religion.misc)
-## Required
-Prima di procedere è necessario installare anaconda utilizzando la seguente [guida](https://docs.anaconda.com/free/anaconda/install/linux/)
-La lemmatizzazione del testo viene eseguita con la libreria [spacy](https://spacy.io/usage).
-Procedere con i seguenti passaggi
-```bash
-  pip install -U pip setuptools wheel
-  pip install -U spacy
-  python -m spacy download it_core_news_lg
-```
-Fondamentale installare anche la libreria tensorflow
-```bash
-  pip install tensorflow
-```
-## Run Locally
-Clona il progetto
-```bash
-  git clone git@github.com:gaeparente/ngt-ai-platform.git
-```
-Installa il micro-framework Flask
-```bash
-  python -m pip install flask
-```
-Installa libreria CORS di Flask
 ```bash
-  pip install flask_cors
 ```
-Posizionati nella directory del file app.py
 ```bash
-  cd ngt-ai-platform/
 ```
-Avvia il server
 ```bash
-  flask run
 ```
-I moduli saranno quindi raggiungibili:
-1. binary classification all'indirizzo http://127.0.0.1:5000/binary-classification
-2. image classification all'indirizzo http://127.0.0.1:5000/image-classification
-3. multilabel classification all'indirizzo http://127.0.0.1:5000/multi-classification
-## Usage/Examples Binary classification
-Effettuare una chiamata POST all'indirizzo indicato in precedenza. Il body dovrà essere in formato form-data con le seguenti property:
-1. text (required) -> contenente la sentence per cui si richiede la classificazione
-2. model (optional) -> contenente il file del modello (.keras o .h5)
-3. token (optional) -> contenente il file del tokenizer (.json)
-La risposta sarà quindi
-```json
-{
-    "lemma": "che posto ragazzo ! uno cucina ricercare in piccolo cortile di altro tempo . bello , buone , bravissimo . prenotare con largo anticipo .",
-    "percent": "99.95895028114319",
-    "sentiment": "POSITIVE"
-}
 ```
-## Usage/Examples Image Classification
-Effettuare una chiamata POST all'indirizzo indicato in precedenza. Il body dovrà essere in formato form-data con le seguenti property:
-1. image (required) -> contenente il file per cui si richiede la classificazione
-2. model (optional) -> contenente il file del modello (.keras o .h5)
-La risposta sarà quindi
-```json
-[
-    {
-        "classe": "Tubercolosi",
-        "percent": "0.02414761"
-    },
-    {
-        "classe": "No_Tubercolosi",
-        "percent": "0.99304398"
-    },
-    {
-        "classe": "Pneumonia",
-        "percent": "0.00155318"
-    },
-    {
-        "classe": "No_Pneumonia",
-        "percent": "0.00484183"
-    }
-]
-```
-## Usage/Examples Multilabel classification
-Effettuare una chiamata POST all'indirizzo indicato in precedenza. Il body dovrà essere in formato form-data con le seguenti property:
-1. text (required) -> contenente la sentence per cui si richiede la classificazione
-2. model (optional) -> contenente il file del modello (.keras o .h5)
-3. token (optional) -> contenente il file del tokenizer (.json)
-La risposta sarà quindi
-```json
-[
-    {
-        "classe": "alt.atheism",
-        "percent": "20.58875114"
-    },
-    {
-        "classe": "comp.graphics",
-        "percent": "5.57006039"
-    },
-    {
-        "classe": "comp.os.ms-windows.misc",
-        "percent": "1.00294100"
-    },
-    {
-        "classe": "comp.sys.ibm.pc.hardware",
-        "percent": "0.17852880"
-    },
-    {
-        "classe": "comp.sys.mac.hardware",
-        "percent": "0.24781623"
-    },
-    {
-        "classe": "comp.windows.x",
-        "percent": "3.20503265"
-    },
-    {
-        "classe": "misc.forsale",
-        "percent": "0.16137564"
-    },
-    {
-        "classe": "rec.autos",
-        "percent": "0.23865439"
-    },
-    {
-        "classe": "rec.motorcycles",
-        "percent": "0.35177895"
-    },
-    {
-        "classe": "rec.sport.baseball",
-        "percent": "1.18482364"
-    },
-    {
-        "classe": "rec.sport.hockey",
-        "percent": "0.21046386"
-    },
-    {
-        "classe": "sci.crypt",
-        "percent": "4.29985709"
-    },
-    {
-        "classe": "sci.electronics",
-        "percent": "2.09880602"
-    },
-    {
-        "classe": "sci.med",
-        "percent": "19.70048994"
-    },
-    {
-        "classe": "sci.space",
-        "percent": "5.71478717"
-    },
-    {
-        "classe": "soc.religion.christian",
-        "percent": "11.07885465"
-    },
-    {
-        "classe": "talk.politics.guns",
-        "percent": "1.57866161"
-    },
-    {
-        "classe": "talk.politics.mideast",
-        "percent": "1.79922581"
-    },
-    {
-        "classe": "talk.politics.misc",
-        "percent": "3.07453331"
-    },
-    {
-        "classe": "talk.religion.misc",
-        "percent": "17.71455258"
-    }
-]
-```

 ---
 license: apache-2.0
 title: ngt-ai-platform
+sdk: gradio
+emoji: 🚀
 colorFrom: blue
 colorTo: purple
 pinned: false
 ---
+# 🚀 NGT AI Platform
+**NextGenTech AI Platform** è una suite modulare di Intelligenza Artificiale progettata per dimostrare capacità avanzate in ambito **NLP** (Natural Language Processing) e **Computer Vision**.
+La piattaforma è costruita con un'architettura ibrida che integra modelli **TensorFlow/Keras** (Legacy) e **PyTorch/Transformers** (NextGenTech), il tutto esposto tramite un'interfaccia web interattiva basata su **Gradio**.
+![Platform UI](https://img.shields.io/badge/UI-Gradio-orange) ![Python](https://img.shields.io/badge/Python-3.10-blue) ![Framework](https://img.shields.io/badge/Hybrid-PyTorch%20%2B%20TensorFlow-purple)
+---
+## 🧩 Moduli Disponibili
+### 1. 🧩 BPO Intelligent Dispatcher (NextGen)
+Un sistema avanzato per l'analisi dei ticket di assistenza clienti (Business Process Outsourcing).
+* **Tecnologia:** DistilBERT (Fine-tuned) + spaCy (NER) + Custom Logic.
+* **Funzionalità:**
+    * **Intent Classification:** Riconosce automaticamente se il ticket riguarda *Amministrazione*, *Supporto Tecnico* o *Rischio Churn*.
+    * **Smart Urgency:** Calcola la priorità basandosi sulla gravità del problema e sul tono del cliente.
+    * **Hybrid NER:** Estrae dati strutturati (Codici Cliente, Fatture, Email) usando un motore ibrido AI + Regex Contestuale.
+    * **Visualizzazione:** Rendering HTML dinamico delle entità estratte.
+### 2. 🩻 Healthcare Diagnostics (Computer Vision)
+Moduli verticali per l'analisi di immagini mediche.
+* **Chest X-Ray:** Classificazione di radiografie toraciche per individuare: *Polmonite*, *Tubercolosi* o *No Polmonite*, *No Tubercolosi*.
+* **Diabetic Retinopathy:** Analisi del fondo oculare per rilevare segni di retinopatia diabetica.
+### 3. 📰 Legacy NLP Stack
+Moduli classici di analisi testuale.
+* **Topic Classification:** Classificazione multiclasse su 20 categorie di news (Dataset 20 Newsgroups).
+* **Sentiment Analysis:** Analisi binaria (Positivo/Negativo) del tono del testo.
+---
+## 🛠️ Installazione
+Il progetto richiede **Python 3.10**. Si consiglia l'uso di un virtual environment.
+### 1. Clona il repository
 ```bash
+git clone git@github.com:gaeparente/ngt-ai-platform.git
+cd ngt-ai-platform
 ```
+### 2. Setup dell'ambiente virtuale
 ```bash
+python -m venv .venv
+source .venv/bin/activate  # Su Linux/Mac
+# .venv\Scripts\activate   # Su Windows
 ```
+### 3. Installazione Dipendenze
+Il file requirements.txt è ottimizzato per installare le versioni CPU di PyTorch per risparmiare spazio.
 ```bash
+pip install -r requirements.txt
 ```
+Nota: Il sistema scaricherà automaticamente anche il modello linguistico italiano per spaCy (it_core_news_lg).
+### 📂 Struttura Cartelle e Modelli
+Affinché la piattaforma funzioni, è necessario posizionare i modelli addestrati nella cartella corretta. Assicurati che la struttura sia la seguente:
+ngt-ai-platform/
+├── app.py                  # Entry point dell'applicazione
+├── requirements.txt        # Dipendenze
+├── modules/                # Logica di business
+└── data/
+    ├── model/              # CARTELLA MODELLI (Non versionata)
+    │   ├── bpo_bert_model/ # Cartella del modello BERT addestrato
+    │   └── ...             # modelli addestrati da noi
+    ├── gallery/            # Immagini di esempio per la Demo
+    └── tokenizer/          # tokenizer per la BinaryClassification e MultiClassification
+### 🚀 Avvio Piattaforma
+Una volta installato tutto, avvia l'interfaccia web con:
+```bash
+python app.py
 ```
+L'applicazione sarà accessibile localmente all'indirizzo: 👉 http://127.0.0.1:7860
+### 📄 License
+Distributed under the Apache 2.0 License.

app.py CHANGED Viewed

@@ -1,43 +1,10 @@
 import gradio as gr
 import cv2
-import os
 from modules.binary_classification import binary_classification as binary
 from modules.image_classification import image_classification as image
 from modules.multilabel_classification import multi_classification as multi
 from modules.retina import predict_diabetic_retinopathy as retina_detector
-# -------------------------------------------------------------
-def binary_classification(text):
-    if text.strip():
-        return binary(text)
-    raise gr.Error('Il testo è obbligatorio!')
-def multi_classification(text):
-    if text.strip():
-        try:
-            return multi(text)
-        except Exception as e:
-            raise gr.Error(f'Errore nel modello: {str(e)}')
-    raise gr.Error('Il testo è obbligatorio!')
-def file_change(file):
-    if isinstance(file, list):
-        file = file[0]
-    if file:
-        return cv2.imread(file)
-    return None
-def image_classification(img):
-    if img is not None:
-        return image(img)
-    raise gr.Error('L\'immagine è obbligatoria!')
-def retina_classification(retina):
-    if retina is not None:
-        return retina_detector(retina)
-    raise gr.Error('L\'immagine è obbligatoria!')
 # --- CONFIGURAZIONE TEMA ---
 theme = gr.themes.Soft(
@@ -61,12 +28,10 @@ custom_css = """
     padding-bottom: 20px;
     border-bottom: 1px solid #e2e8f0;
 }
-.logo-container img {
-    margin-bottom: 4px !important;
-    object-fit: contain;
 }
 .header-text-col h1 {
     font-family: 'Inter', sans-serif !important;
     font-weight: 900 !important;
@@ -79,221 +44,275 @@ custom_css = """
     padding-bottom: 0 !important;
     line-height: 1.0 !important;
 }
 .header-text-col .subheader {
-    text-align: left !important;
-    color: #64748b;
-    font-size: 1.1em;
-    font-weight: 500;
-    margin-top: 0 !important;
-    padding-top: 0 !important;
 }
 /* --- 2. CUSTOM TABS STYLE (DESKTOP) --- */
-.tabs > .tab-nav {
-    border-bottom: none !important;
-    gap: 8px !important;
-    margin-bottom: 15px !important;
-}
 .tabs > .tab-nav > button {
-    border: 1px solid #e5e7eb !important;
-    border-radius: 10px !important;
-    background-color: white;
-    color: #475569 !important;
-    font-weight: 600 !important;
-    transition: all 0.2s ease-in-out;
-    padding: 6px 16px !important;
-}
-.tabs > .tab-nav > button:hover {
-    background-color: #f1f5f9 !important;
-    transform: translateY(-1px);
 }
 .tabs > .tab-nav > button.selected {
     background: linear-gradient(135deg, #8B5CF6 0%, #D65DB1 100%) !important;
-    color: white !important;
-    border: 1px solid transparent !important;
     box-shadow: 0 4px 12px rgba(139, 92, 246, 0.3) !important;
 }
-/* --- 3. PULSANTI PRIMARY --- */
 button.primary {
     background: linear-gradient(135deg, #8B5CF6 0%, #D65DB1 100%) !important;
-    border: none !important;
-    color: white !important;
-    transition: filter 0.2s;
-}
-button.primary:hover {
-    filter: brightness(1.1);
-    box-shadow: 0 4px 15px rgba(139, 92, 246, 0.4);
 }
-/* --- FIX ALLINEAMENTO ALTEZZA (Desktop) --- */
-.fixed-height {
-    height: 380px !important;
-    overflow: hidden !important;
 }
-.fixed-height button,
-.fixed-height .image-container,
-.fixed-height .upload-container {
-    height: 100% !important;
-    max_height: 100% !important;
-    min_height: 100% !important;
 }
-.fixed-height img {
-    object-fit: contain !important;
-    max_height: 100% !important;
 }
-/* --- 4. MOBILE RESPONSIVE (Aggiornato per i TAB) --- */
 @media (max-width: 768px) {
-    /* Header Stack */
-    .header-row {
-        flex-direction: column !important;
-        align-items: center !important;
-        text-align: center !important;
-        gap: 10px !important;
-    }
-    .header-text-col h1 {
-        text-align: center !important;
-        font-size: 1.8em !important;
-    }
-    .header-text-col .subheader {
-        text-align: center !important;
-    }
-    /* Layout Moduli Stack */
-    .responsive-row {
-        flex-direction: column !important;
-        display: flex !important;
-    }
-    .responsive-row > * {
-        width: 100% !important;
-        min-width: 100% !important;
-        margin-bottom: 15px !important;
-    }
-    /* --- NUOVA GESTIONE TAB MOBILE --- */
-    .tabs > .tab-nav {
-        flex-wrap: wrap !important;      /* Permette di andare a capo */
-        justify-content: center !important; /* Centra i bottoni */
-        gap: 6px !important;             /* Spazio ridotto tra i bottoni */
-    }
     .tabs > .tab-nav > button {
-        flex-grow: 1 !important;         /* Si allargano per riempire la riga */
-        text-align: center !important;
-        font-size: 0.85rem !important;   /* Testo leggermente più piccolo */
-        padding: 8px 10px !important;    /* Padding ottimizzato per il tocco */
-        margin: 0 !important;
-        width: auto !important;          /* Lascia decidere al flex-grow */
-        min-width: 45% !important;       /* Assicura che al massimo ci siano 2 tab per riga */
     }
 }
 footer {visibility: hidden}
 """
 with gr.Blocks(theme=theme, css=custom_css, title="NGT AI Platform") as demo:
     # --- HEADER ---
     with gr.Row(elem_classes="header-row"):
         with gr.Column(scale=0, min_width=80, elem_classes="logo-container"):
-            gr.Image(
-                value="data/icon.png",
-                show_label=False,
-                show_download_button=False,
-                show_share_button=False,
-                container=False,
-                show_fullscreen_button=False,
-                interactive=False,
-                height=80,
-                width=80
-            )
         with gr.Column(scale=1, elem_classes="header-text-col"):
-            gr.Markdown("""
-            <h1>NGT AI Platform</h1>
-            <div class='subheader'>Advanced Machine Learning Solutions</div>
-            """)
-    # --- TAB 1: Chest X-Ray ---
     with gr.Tab("🩻 Chest Diagnosis"):
         gr.Markdown("### 📥 Diagnostica Polmonare")
-        gr.Markdown("Carica una radiografia o selezionane una dalla gallery per rilevare **Polmonite** o **Tubercolosi**.")
-        # Aggiunta classe "responsive-row" e rimosso equal_height=True (lo gestiamo col CSS se serve)
         with gr.Row(elem_classes="responsive-row"):
             with gr.Column(scale=1):
                 with gr.Accordion("📂 1. Seleziona da Gallery", open=True):
-                    file_selected = gr.FileExplorer(
-                        root_dir="data/gallery/xray",
-                        file_count='single',
-                        height=274 # Su desktop usa questo, su mobile il layout stackerà
-                    )
             with gr.Column(scale=1):
-                image_input = gr.Image(type="numpy", height=320, label="2. Visualizzazione / Upload Manuale")
         with gr.Row():
             with gr.Column():
                 analyze_btn_chest = gr.Button("🔍 Avvia Diagnosi Clinica", variant="primary", size="lg")
                 image_output = gr.Label(num_top_classes=2, label="Risultato Predittivo")
         file_selected.change(file_change, inputs=file_selected, outputs=image_input)
         analyze_btn_chest.click(image_classification, inputs=image_input, outputs=image_output)
-    # --- TAB 2: Diabetic Retinopathy ---
     with gr.Tab("👁️ Diabetic Retinopathy"):
         gr.Markdown("### 📥 Analisi Retinica")
-        gr.Markdown("Deep Learning per la predizione della retinopatia diabetica.")
-        # Aggiunta classe "responsive-row"
         with gr.Row(elem_classes="responsive-row"):
             with gr.Column(scale=1):
                 with gr.Accordion("📂 1. Seleziona da Gallery", open=True):
-                    file_selected_dr = gr.FileExplorer(
-                        root_dir="data/gallery/retinopaty",
-                        file_count='single',
-                        height=274
-                    )
             with gr.Column(scale=1):
-                image_input_dr = gr.Image(type="numpy", height=320, label="2. Visualizzazione / Upload Manuale")
         with gr.Row():
             with gr.Column():
                 analyze_btn_dr = gr.Button("🔍 Analizza Retina", variant="primary", size="lg")
                 with gr.Group():
                     output_dr_label = gr.Label(label="Diagnosi Principale")
                     output_dr_prob = gr.Label(label="Probabilità Patologia")
         file_selected_dr.change(file_change, inputs=file_selected_dr, outputs=image_input_dr)
         analyze_btn_dr.click(retina_classification, inputs=image_input_dr, outputs=[output_dr_label, output_dr_prob])
-    # --- TAB 3: Review Classification ---
     with gr.Tab("📰 Topic Classification"):
         gr.Markdown("### Analisi Argomenti del Testo")
-        with gr.Row():
             with gr.Column():
                 multi_input = gr.Textbox(lines=5, placeholder="Incolla qui il testo...", label="Input")
                 analyze_btn_multi = gr.Button("🏷️ Classifica", variant="primary")
             with gr.Column():
                 multi_output = gr.Label(num_top_classes=5, label="Top Categorie")
-        gr.Examples(examples=[["La NASA ha lanciato un nuovo satellite."], ["Il prezzo della GPU è sceso."]], inputs=multi_input)
         analyze_btn_multi.click(multi_classification, inputs=multi_input, outputs=multi_output)
-    # --- TAB 4: Sentiment Analysis ---
     with gr.Tab("😊 Sentiment Analysis"):
         gr.Markdown("### Analisi del Sentiment")
-        with gr.Row():
             with gr.Column():
                 binary_input = gr.Textbox(lines=3, placeholder="Scrivi una recensione...", label="Input")
                 analyze_btn_bin = gr.Button("⚖️ Analizza", variant="primary")
             with gr.Column():
                 binary_output = gr.Label(label="Sentiment Score")
         analyze_btn_bin.click(binary_classification, inputs=binary_input, outputs=binary_output)
 if __name__ == "__main__":

 import gradio as gr
 import cv2
 from modules.binary_classification import binary_classification as binary
 from modules.image_classification import image_classification as image
 from modules.multilabel_classification import multi_classification as multi
 from modules.retina import predict_diabetic_retinopathy as retina_detector
+from modules.bpo_dispatcher import predict_bpo_ticket
 # --- CONFIGURAZIONE TEMA ---
 theme = gr.themes.Soft(
     padding-bottom: 20px;
     border-bottom: 1px solid #e2e8f0;
 }
+.h4-margin{
+    margin-left: 5px;
 }
+.logo-container img { margin-bottom: 4px !important; object-fit: contain; }
 .header-text-col h1 {
     font-family: 'Inter', sans-serif !important;
     font-weight: 900 !important;
     padding-bottom: 0 !important;
     line-height: 1.0 !important;
 }
 .header-text-col .subheader {
+    text-align: left !important; color: #64748b; font-size: 1.1em; font-weight: 500;
+    margin-top: 0 !important; padding-top: 0 !important;
 }
 /* --- 2. CUSTOM TABS STYLE (DESKTOP) --- */
+.tabs > .tab-nav { border-bottom: none !important; gap: 8px !important; margin-bottom: 15px !important; }
 .tabs > .tab-nav > button {
+    border: 1px solid #e5e7eb !important; border-radius: 10px !important;
+    background-color: white; color: #475569 !important; font-weight: 600 !important;
+    padding: 6px 16px !important; transition: all 0.2s;
 }
+.tabs > .tab-nav > button:hover { background-color: #f1f5f9 !important; transform: translateY(-1px); }
 .tabs > .tab-nav > button.selected {
     background: linear-gradient(135deg, #8B5CF6 0%, #D65DB1 100%) !important;
+    color: white !important; border: 1px solid transparent !important;
     box-shadow: 0 4px 12px rgba(139, 92, 246, 0.3) !important;
 }
+/* --- 3. COMPONENTS --- */
 button.primary {
     background: linear-gradient(135deg, #8B5CF6 0%, #D65DB1 100%) !important;
+    border: none !important; color: white !important; transition: filter 0.2s;
 }
+button.primary:hover { filter: brightness(1.1); box-shadow: 0 4px 15px rgba(139, 92, 246, 0.4); }
+.fixed-height { height: 380px !important; overflow: hidden !important; }
+.fixed-height button, .fixed-height .image-container, .fixed-height .upload-container {
+    height: 100% !important; max_height: 100% !important; min_height: 100% !important;
 }
+.fixed-height img { object-fit: contain !important; max_height: 100% !important; }
+/* Stile per la Model Card nel tab BPO */
+.model-card {
+    background: #f8fafc; border: 1px solid #e2e8f0; border-radius: 8px; padding: 15px;
+    font-size: 0.9em; color: #475569; margin-top: 10px;
 }
+.model-card strong{
+    color: #475569 !important
 }
+/* --- 4. MOBILE RESPONSIVE --- */
 @media (max-width: 768px) {
+    .header-row { flex-direction: column !important; align-items: center !important; text-align: center !important; gap: 10px !important; }
+    .header-text-col h1 { text-align: center !important; font-size: 1.8em !important; }
+    .header-text-col .subheader { text-align: center !important; }
+    .responsive-row { flex-direction: column !important; display: flex !important; }
+    .responsive-row > * { width: 100% !important; min-width: 100% !important; margin-bottom: 15px !important; }
+    .tabs > .tab-nav { flex-wrap: wrap !important; justify-content: center !important; gap: 6px !important; }
     .tabs > .tab-nav > button {
+        flex-grow: 1 !important; text-align: center !important; font-size: 0.85rem !important;
+        padding: 8px 10px !important; margin: 0 !important; width: auto !important; min-width: 45% !important;
     }
 }
 footer {visibility: hidden}
 """
+def binary_classification(text):
+    if text.strip(): return binary(text)
+    raise gr.Error('Il testo è obbligatorio!')
+def multi_classification(text):
+    if text.strip():
+        try: return multi(text)
+        except Exception as e: raise gr.Error(f'Errore nel modello: {str(e)}')
+    raise gr.Error('Il testo è obbligatorio!')
+def file_change(file):
+    if isinstance(file, list): file = file[0]
+    if file: return cv2.imread(file)
+    return None
+def image_classification(img):
+    if img is not None: return image(img)
+    raise gr.Error('L\'immagine è obbligatoria!')
+def retina_classification(retina):
+    if retina is not None: return retina_detector(retina)
+    raise gr.Error('L\'immagine è obbligatoria!')
+def render_ner_html(entities):
+    """
+    Trasforma la lista [('testo', 'LABEL'), ('testo', None)] in HTML puro.
+    """
+    # Mappa colori HEX (più belli e moderni)
+    colors = {
+        "CODICE CLIENTE": "#3b82f6", # Blue 500
+        "N. FATTURA": "#f97316",     # Orange 500
+        "COD. FORNITURA": "#d946ef", # Fuchsia 500
+        "EMAIL": "#ef4444",          # Red 500
+        "TELEFONO": "#06b6d4",       # Cyan 500
+        "PERSONA": "#22c55e",        # Green 500
+        "AZIENDA": "#8b5cf6",        # Violet 500
+        "LUOGO": "#64748b",          # Slate 500
+        "POSSIBILE ID": "#a8a29e"    # Stone 400
+    }
+    html = "<div style='line-height: 2.2; font-family: sans-serif; font-size: 16px; color: #334155; background-color: #1e293b'>"
+    for text, label in entities:
+        if label:
+            # Recupera colore o usa grigio di default
+            c = colors.get(label, "#1e293b")
+            # Crea lo "Chip" (Pillola colorata)
+            # - bg-color con opacità (c + '20')
+            # - border solido
+            # - label piccola in grassetto accanto al testo
+            html += f"""
+            <span style='background-color: {c}20; border: 1px solid {c}; border-radius: 6px; padding: 2px 6px; margin: 0 2px; white-space: nowrap;'>
+                <span style='font-size: 0.75em; font-weight: 700; color: {c}; text-transform: uppercase;'>{label}</span>
+                <span style='font-weight: 600; color: white; margin-left: 6px;'>{text}</span>
+            </span>
+            """
+        else:
+            # Testo normale
+            html += text.replace("\n", "<br>") # Gestisce a capo
+    html += "</div>"
+    return html
+def bpo_dispatch_logic(text):
+    """
+    Funzione Ponte: Chiama il modulo AI e decide l'azione di business.
+    Restituisce un aggiornamento COMPLETO del componente NER per pulire la grafica.
+    """
+    try:
+        # 1. Chiamata al modello reale
+        intent, urgency, entities = predict_bpo_ticket(text)
+        if intent is None:
+             raise gr.Error("Errore nel modello BPO. Verifica i log.")
+        # 2. Logica di Business
+        top_intent = max(intent, key=intent.get)
+        action = "Inoltro generico"
+        if top_intent == "Retention / Churn Risk":
+            action = "🚨 ALERT: Assegnazione coda 'Retention' + Chiamata Outbound"
+        elif top_intent == "Supporto Tecnico":
+            action = "🛠️ Apertura Ticket JIRA (Livello 1) - Priorità Tecnica"
+        elif top_intent == "Amministrazione / Billing":
+            action = "💰 Verifica insoluti su SAP + Inoltro Backoffice Amm.vo"
+        html_output = render_ner_html(entities)
+        return intent, urgency, action, html_output
+    except Exception as e:
+        raise gr.Error(f"Errore nell'analisi: {str(e)}")
 with gr.Blocks(theme=theme, css=custom_css, title="NGT AI Platform") as demo:
     # --- HEADER ---
     with gr.Row(elem_classes="header-row"):
         with gr.Column(scale=0, min_width=80, elem_classes="logo-container"):
+            gr.Image(value="data/icon.png", show_label=False, show_download_button=False, show_share_button=False, container=False, show_fullscreen_button=False, interactive=False, height=80, width=80)
         with gr.Column(scale=1, elem_classes="header-text-col"):
+            gr.Markdown("""<h1>AI Platform</h1><div class='subheader'>Advanced Machine Learning Solutions</div>""")
+    # --- TAB 1: BPO INTELLIGENT DISPATCHER ---
+    with gr.Tab("🧩 BPO Dispatcher"):
+        gr.Markdown("### Intelligent Ticket Routing & NER")
+        gr.Markdown("Sistema proprietario per l'analisi automatica dei ticket di assistenza. Il modello identifica l'intento, l'urgenza e i dati sensibili del cliente.")
+        with gr.Row(elem_classes="responsive-row"):
+            # INPUT
+            with gr.Column(scale=1):
+                bpo_input = gr.Textbox(lines=8, placeholder="Incolla qui il contenuto della mail o del ticket...", label="Contenuto Ticket / Email")
+                analyze_btn_bpo = gr.Button("⚡ Analizza Richiesta", variant="primary")
+                gr.HTML("""
+                <div class='model-card'>
+                    <strong>🛠️ Model Architecture:</strong> NGT-BERT-Custom (DistilBERT)<br>
+                    <strong>📚 Training Data:</strong> Synthetic BPO Dataset (2025)<br>
+                    <strong>🎯 Tasks:</strong> Intent Classification (Multi-class), Entity Extraction (NER)
+                </div>
+                """)
+            # OUTPUT
+            with gr.Column(scale=1):
+                with gr.Group():
+                    gr.Markdown("#### 📋 Analisi Processata", elem_classes="h4-margin")
+                    bpo_intent_output = gr.Label(num_top_classes=3, label="Intento Rilevato")
+                    with gr.Row():
+                        bpo_urgency_output = gr.Textbox(label="Livello Urgenza", scale=1)
+                        bpo_action_output = gr.Textbox(label="Azione Consigliata (Auto)", scale=1)
+                    gr.Markdown("#### 🔍 Dati Estratti (NER)", elem_classes="h4-margin")
+                    bpo_ner_output = gr.HTML(label="Visualizzazione Entità")
+        gr.Examples(
+            examples=[
+                ["Buongiorno, vi scrivo perché la fattura n. 99283 del mese scorso è sbagliata. Non ho consumato così tanto. Il mio codice cliente è 4599201. Attendo rettifica urgente."],
+                ["Salve, il servizio non funziona da ieri. Mi dà errore 504 sul router. Risolvete subito per favore!"],
+                ["Vorrei disdire il contratto con decorrenza immediata se non mi risolvete il problema."]
+            ],
+            inputs=bpo_input
+        )
+        analyze_btn_bpo.click(
+            bpo_dispatch_logic,
+            inputs=bpo_input,
+            outputs=[bpo_intent_output, bpo_urgency_output, bpo_action_output, bpo_ner_output]
+        )
+    # --- TAB 2: Chest X-Ray ---
     with gr.Tab("🩻 Chest Diagnosis"):
         gr.Markdown("### 📥 Diagnostica Polmonare")
+        # INPUT
         with gr.Row(elem_classes="responsive-row"):
             with gr.Column(scale=1):
                 with gr.Accordion("📂 1. Seleziona da Gallery", open=True):
+                    file_selected = gr.FileExplorer(root_dir="data/gallery/xray", file_count='single', elem_classes=["fixed-height"])
             with gr.Column(scale=1):
+                image_input = gr.Image(type="numpy", label="2. Visualizzazione", elem_classes=["fixed-height"])
+        # OUTPUT
         with gr.Row():
             with gr.Column():
                 analyze_btn_chest = gr.Button("🔍 Avvia Diagnosi Clinica", variant="primary", size="lg")
                 image_output = gr.Label(num_top_classes=2, label="Risultato Predittivo")
         file_selected.change(file_change, inputs=file_selected, outputs=image_input)
         analyze_btn_chest.click(image_classification, inputs=image_input, outputs=image_output)
+    # --- TAB 3: Diabetic Retinopathy ---
     with gr.Tab("👁️ Diabetic Retinopathy"):
         gr.Markdown("### 📥 Analisi Retinica")
+        # INPUT
         with gr.Row(elem_classes="responsive-row"):
             with gr.Column(scale=1):
                 with gr.Accordion("📂 1. Seleziona da Gallery", open=True):
+                    file_selected_dr = gr.FileExplorer(root_dir="data/gallery/retinopaty", file_count='single', elem_classes=["fixed-height"])
             with gr.Column(scale=1):
+                image_input_dr = gr.Image(type="numpy", label="2. Visualizzazione", elem_classes=["fixed-height"])
+        # OUTPUT
         with gr.Row():
             with gr.Column():
                 analyze_btn_dr = gr.Button("🔍 Analizza Retina", variant="primary", size="lg")
                 with gr.Group():
                     output_dr_label = gr.Label(label="Diagnosi Principale")
                     output_dr_prob = gr.Label(label="Probabilità Patologia")
         file_selected_dr.change(file_change, inputs=file_selected_dr, outputs=image_input_dr)
         analyze_btn_dr.click(retina_classification, inputs=image_input_dr, outputs=[output_dr_label, output_dr_prob])
+    # --- TAB 4: Review Classification ---
     with gr.Tab("📰 Topic Classification"):
         gr.Markdown("### Analisi Argomenti del Testo")
+        with gr.Row(elem_classes="responsive-row"):
+            # INPUT
             with gr.Column():
                 multi_input = gr.Textbox(lines=5, placeholder="Incolla qui il testo...", label="Input")
                 analyze_btn_multi = gr.Button("🏷️ Classifica", variant="primary")
+            # OUTPUT
             with gr.Column():
                 multi_output = gr.Label(num_top_classes=5, label="Top Categorie")
         analyze_btn_multi.click(multi_classification, inputs=multi_input, outputs=multi_output)
+    # --- TAB 5: Sentiment Analysis ---
     with gr.Tab("😊 Sentiment Analysis"):
         gr.Markdown("### Analisi del Sentiment")
+        with gr.Row(elem_classes="responsive-row"):
+            # INPUT
             with gr.Column():
                 binary_input = gr.Textbox(lines=3, placeholder="Scrivi una recensione...", label="Input")
                 analyze_btn_bin = gr.Button("⚖️ Analizza", variant="primary")
+            # OUTPUT
             with gr.Column():
                 binary_output = gr.Label(label="Sentiment Score")
         analyze_btn_bin.click(binary_classification, inputs=binary_input, outputs=binary_output)
 if __name__ == "__main__":

data/model/bpo_bert_model/config.json ADDED Viewed

	@@ -0,0 +1,39 @@

+{
+  "activation": "gelu",
+  "architectures": [
+    "DistilBertForSequenceClassification"
+  ],
+  "attention_dropout": 0.1,
+  "bos_token_id": null,
+  "dim": 768,
+  "dropout": 0.1,
+  "dtype": "float32",
+  "eos_token_id": null,
+  "hidden_dim": 3072,
+  "id2label": {
+    "0": "LABEL_0",
+    "1": "LABEL_1",
+    "2": "LABEL_2"
+  },
+  "initializer_range": 0.02,
+  "label2id": {
+    "LABEL_0": 0,
+    "LABEL_1": 1,
+    "LABEL_2": 2
+  },
+  "max_position_embeddings": 512,
+  "model_type": "distilbert",
+  "n_heads": 12,
+  "n_layers": 6,
+  "output_past": true,
+  "pad_token_id": 0,
+  "problem_type": "single_label_classification",
+  "qa_dropout": 0.1,
+  "seq_classif_dropout": 0.2,
+  "sinusoidal_pos_embds": false,
+  "tie_weights_": true,
+  "tie_word_embeddings": true,
+  "transformers_version": "5.0.0",
+  "use_cache": false,
+  "vocab_size": 119547
+}

data/model/bpo_bert_model/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

data/model/bpo_bert_model/tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "backend": "tokenizers",
+  "cls_token": "[CLS]",
+  "do_lower_case": false,
+  "is_local": false,
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "DistilBertTokenizer",
+  "unk_token": "[UNK]"
+}

modules/bpo_dispatcher.py ADDED Viewed

	@@ -0,0 +1,192 @@

+import torch
+from transformers import DistilBertTokenizerFast, DistilBertForSequenceClassification
+import spacy
+import re
+import os
+import torch.nn.functional as F
+try:
+    from modules.binary_classification import binary_classification
+except ImportError:
+    print("⚠️ Modulo sentiment non trovato. L'urgenza sarà basata solo sulle keyword.")
+    binary_classification = None
+LABELS_MAP = {
+    0: "Amministrazione / Billing",
+    1: "Supporto Tecnico",
+    2: "Retention / Churn Risk"
+}
+class BPODispatcher:
+    def __init__(self, model_path="data/model/bpo_bert_model"):
+        self.model = None
+        self.tokenizer = None
+        self.nlp = None
+        self.device = "cpu"
+        # 1. BERT
+        if os.path.exists(model_path):
+            try:
+                self.tokenizer = DistilBertTokenizerFast.from_pretrained(model_path)
+                self.model = DistilBertForSequenceClassification.from_pretrained(model_path)
+                self.model.to(self.device)
+                self.model.eval()
+                print("✅ Modello BERT caricato.")
+            except Exception as e:
+                print(f"❌ Errore BERT: {e}")
+        # 2. spaCy
+        try:
+            self.nlp = spacy.load("it_core_news_lg")
+            print("✅ spaCy caricato.")
+        except Exception as e:
+            print(f"❌ Errore spaCy: {e}")
+    def _extract_smart_entities(self, text):
+        entities = []
+        occupied_spans = [] # Tiene traccia delle zone di testo già etichettate
+        def is_overlapping(start, end):
+            """Controlla se la posizione è già occupata"""
+            for occ_start, occ_end in occupied_spans:
+                if (start < occ_end) and (end > occ_start):
+                    return True
+            return False
+        def add_entity(text_val, label, start, end):
+            """Aggiunge l'entità solo se non si sovrappone e la registra"""
+            if not is_overlapping(start, end):
+                entities.append((text_val, label))
+                occupied_spans.append((start, end))
+        # --- FASE 1: REGEX ALTA PRIORITÀ (Dati Strutturati) ---
+        # A. EMAIL
+        for m in re.finditer(r'\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b', text):
+            add_entity(m.group(), "EMAIL", m.start(), m.end())
+        # B. TELEFONO (Mobile e Fisso Italiano)
+        # Cerca pattern tipo 3xx... o 0x... con spazi opzionali
+        for m in re.finditer(r'\b(?:3\d{2}|0\d{1,4})[\s.-]?\d{6,10}\b', text):
+            add_entity(m.group(), "TELEFONO", m.start(), m.end())
+        # C. NUMERI CONTESTUALI (Fatture, Clienti, Forniture)
+        # Regex migliorata: Accetta anche alfanumerici per codici cliente
+        # Pattern: Parola che inizia o finisce con cifra, lunga 4-15 chars
+        candidates = re.finditer(r'\b(?=[A-Za-z0-9]*\d)[A-Za-z0-9]{4,15}\b', text)
+        window_size = 35
+        for match in candidates:
+            val = match.group()
+            start, end = match.span()
+            # Se è già un telefono o mail, salta
+            if is_overlapping(start, end): continue
+            context = text[max(0, start - window_size):start].lower()
+            # 1. Fatture
+            if any(w in context for w in ["fattura", "bolletta", "nota", "nr.", "n."]):
+                # Verifica extra: le fatture solitamente sono solo numeri o hanno /
+                if val.isdigit() or '/' in val:
+                    add_entity(val, "N. FATTURA", start, end)
+                    continue
+            # 2. Forniture (POD/PDR/Luce/Gas)
+            if any(w in context for w in ["luce", "gas", "fornitura", "pod", "pdr", "contatore"]):
+                add_entity(val, "COD. FORNITURA", start, end)
+                continue
+            # 3. Codici Cliente (più generico, accetta alfanumerici)
+            if any(w in context for w in ["cliente", "codice", "utenza", "pratica", "id"]):
+                add_entity(val, "CODICE CLIENTE", start, end)
+                continue
+        # --- FASE 2: SPACY BASSA PRIORITÀ (Entità Semantiche) ---
+        if self.nlp:
+            doc = self.nlp(text)
+            for ent in doc.ents:
+                # VALIDAZIONE ANTI-ALLUCINAZIONE
+                # Regola: Una PERSONA non può contenere cifre
+                if ent.label_ == "PER":
+                    if any(char.isdigit() for char in ent.text):
+                        continue # Scarta "25458958" classificato come Persona
+                    if len(ent.text) < 3:
+                        continue # Scarta nomi troppo corti
+                    add_entity(ent.text, "PERSONA", ent.start_char, ent.end_char)
+                elif ent.label_ == "ORG":
+                    add_entity(ent.text, "AZIENDA", ent.start_char, ent.end_char)
+        return entities
+    def _calculate_smart_urgency(self, text, intent_label):
+        """
+        MATRICE DI URGENZA (Intent + Sentiment)
+        Combina la gravità del problema con lo stato d'animo del cliente.
+        """
+        urgency = "Bassa"
+        # 1. Analisi Sentiment (Se disponibile)
+        sentiment_score_neg = 0.0
+        if binary_classification:
+            try:
+                # binary_classification restituisce un dict {'POSITIVE': 0.x, 'NEGATIVE': 0.y}
+                sent_result = binary_classification(text)
+                sentiment_score_neg = sent_result.get('NEGATIVE', 0.0)
+            except Exception:
+                sentiment_score_neg = 0.5 # Fallback neutro
+        # 2. Matrice Decisionale
+        # CASO A: CHURN (Disdetta) -> Sempre Critico
+        if intent_label == "Retention / Churn Risk":
+            return "CRITICA (Rischio Abbandono)"
+        # CASO B: SUPPORTO TECNICO
+        elif intent_label == "Supporto Tecnico":
+            if sentiment_score_neg > 0.9: # Molto arrabbiato
+                return "ALTA (Tecnico + Cliente Furioso)"
+            elif "fermo" in text.lower() or "blocco" in text.lower():
+                return "ALTA (Fermo Servizio)"
+            else:
+                return "MEDIA (Guasto Standard)"
+        # CASO C: AMMINISTRAZIONE
+        elif intent_label == "Amministrazione / Billing":
+            if sentiment_score_neg > 0.95: # Furioso per i soldi
+                return "ALTA (Contestazione Aggressiva)"
+            elif "scadenza" in text.lower() or "stacco" in text.lower():
+                return "MEDIA (Rischio Amministrativo)"
+            else:
+                return "BASSA (Info / Richiesta)"
+        return urgency
+    def predict(self, text):
+        if self.model is None: return None, "Errore", []
+        if not text.strip(): return None, "Vuoto", []
+        # 1. Intent Classification (BERT)
+        inputs = self.tokenizer(text, return_tensors="pt", truncation=True, max_length=128, padding=True)
+        inputs = {k: v.to(self.device) for k, v in inputs.items()}
+        with torch.no_grad():
+            outputs = self.model(**inputs)
+        probs = F.softmax(outputs.logits, dim=-1)
+        label_output = {LABELS_MAP[i]: float(probs[0][i]) for i in range(len(LABELS_MAP))}
+        # Prendi l'intento vincente
+        top_idx = torch.max(probs, dim=-1)[1].item()
+        predicted_label = LABELS_MAP[top_idx]
+        # 2. Urgenza Intelligente (AI + Sentiment + Rules)
+        urgency = self._calculate_smart_urgency(text, predicted_label)
+        # 3. NER Extraction
+        entities = self._extract_smart_entities(text)
+        return label_output, urgency, entities
+dispatcher = BPODispatcher()
+def predict_bpo_ticket(text): return dispatcher.predict(text)

requirements.txt CHANGED Viewed

@@ -1,19 +1,34 @@
-# --- CORE LIBRARIES ---
-pydantic==2.10.6
-spacy==3.8.2
 gradio==4.44.1
-# --- LEGACY STACK (Non toccare) ---
 tensorflow==2.12.0
 numpy==1.23.5
 keras==2.12.0
 Keras-Preprocessing>=1.1.2
-# --- UTILITIES ---
 nltk>=3.8.1
 opencv-python-headless
-huggingface-hub==0.24.0
-# --- MODELS ---
 https://github.com/explosion/spacy-models/releases/download/it_core_news_lg-3.8.0/it_core_news_lg-3.8.0.tar.gz

+# --- DIRETTIVE DI INSTALLAZIONE (Magia per CPU) ---
+# Questa riga dice a pip di cercare le versioni CPU di Torch, evitando download giganti (CUDA)
+--extra-index-url https://download.pytorch.org/whl/cpu
+# --- CORE UI ---
 gradio==4.44.1
+pydantic==2.10.6
+# --- LEGACY STACK (NON TOCCARE - Vincoli rigidi) ---
+# TensorFlow 2.12 richiede numpy < 1.24. Teniamo bloccato numpy a 1.23.5
 tensorflow==2.12.0
 numpy==1.23.5
 keras==2.12.0
 Keras-Preprocessing>=1.1.2
+# --- Modulo BPO ---
+# Grazie alla prima riga, scaricherà la versione CPU-only leggera
+torch>=2.0.1
+transformers>=4.35.0
+accelerate>=0.25.0
+# --- DATA & NLP UTILITIES ---
+pandas>=2.0.0
+spacy==3.8.2
 nltk>=3.8.1
+scikit-learn>=1.3.0
+# --- IMAGE PROCESSING ---
 opencv-python-headless
+# --- MODELS & HUB ---
+huggingface-hub==0.24.7
+# Link diretto per scaricare il modello Spacy italiano
 https://github.com/explosion/spacy-models/releases/download/it_core_news_lg-3.8.0/it_core_news_lg-3.8.0.tar.gz