Spaces:

Shahabul
/

nexus

Sleeping

App Files Files Community

Md Shahabul Alam commited on Feb 10

Commit

29db30b

1 Parent(s): 865fb68

Deploy NEXUS Streamlit demo to HuggingFace Spaces

Browse files

Files changed (19) hide show

Dockerfile +43 -0
README.md +189 -7
app.py +57 -0
models/linear_probes/anemia_classifier_metadata.json +30 -0
models/linear_probes/bilirubin_regression_results.json +207 -0
models/linear_probes/cry_classifier_metadata.json +31 -0
models/linear_probes/jaundice_classifier_metadata.json +31 -0
models/linear_probes/linear_probe_results.json +18 -0
requirements_spaces.txt +31 -0
src/demo/__init__.py +0 -0
src/demo/streamlit_app.py +1189 -0
src/nexus/__init__.py +10 -0
src/nexus/agentic_workflow.py +1296 -0
src/nexus/anemia_detector.py +580 -0
src/nexus/clinical_synthesizer.py +548 -0
src/nexus/cry_analyzer.py +662 -0
src/nexus/hear_preprocessing.py +320 -0
src/nexus/jaundice_detector.py +716 -0
src/nexus/pipeline.py +663 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,43 @@

+# HuggingFace Spaces Docker SDK — NEXUS Streamlit Demo
+# Docs: https://huggingface.co/docs/hub/spaces-sdks-docker
+FROM python:3.12-slim
+# Create non-root user (required by HF Spaces)
+RUN useradd -m -u 1000 user
+ENV PATH="/home/user/.local/bin:$PATH"
+# Install system dependencies for audio processing
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    libsndfile1 \
+    ffmpeg \
+    && rm -rf /var/lib/apt/lists/*
+WORKDIR /app
+# Copy requirements and install as user
+COPY --chown=user ./requirements_spaces.txt requirements_spaces.txt
+RUN pip install --no-cache-dir --upgrade -r requirements_spaces.txt
+# Switch to non-root user
+USER user
+# Copy source code
+COPY --chown=user ./src/ src/
+COPY --chown=user ./models/ models/
+COPY --chown=user ./app.py .
+# Set environment
+ENV PYTHONPATH=/app/src
+ENV STREAMLIT_SERVER_PORT=7860
+ENV STREAMLIT_SERVER_ADDRESS=0.0.0.0
+ENV STREAMLIT_SERVER_HEADLESS=true
+ENV STREAMLIT_BROWSER_GATHER_USAGE_STATS=false
+EXPOSE 7860
+CMD ["python", "-m", "streamlit", "run", "src/demo/streamlit_app.py", \
+     "--server.port=7860", \
+     "--server.address=0.0.0.0", \
+     "--server.headless=true", \
+     "--browser.gatherUsageStats=false"]

README.md CHANGED Viewed

@@ -1,12 +1,194 @@
 ---
-title: Nexus
-emoji: 🐠
 colorFrom: blue
-colorTo: gray
 sdk: docker
-pinned: false
-license: mit
-short_description: NEXUS is an AI-powered platform to detect birth asphyxia
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: NEXUS
+emoji: "\U0001FA7A"
 colorFrom: blue
+colorTo: green
 sdk: docker
+app_port: 7860
+pinned: true
+license: cc-by-4.0
+tags:
+  - medgemma
+  - medical-ai
+  - hai-def
+  - maternal-health
+  - neonatal-care
 ---
+# NEXUS - AI-Powered Maternal-Neonatal Assessment Platform
+> Non-invasive screening for maternal anemia, neonatal jaundice, and birth asphyxia using Google HAI-DEF models
+[![License: CC BY 4.0](https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by/4.0/)
+[![MedGemma Impact Challenge](https://img.shields.io/badge/Kaggle-MedGemma%20Impact%20Challenge-20BEFF)](https://www.kaggle.com/competitions/med-gemma-impact-challenge)
+## Overview
+NEXUS transforms smartphones into diagnostic screening tools for Community Health Workers in low-resource settings. Using 3 Google HAI-DEF models in a 6-agent clinical workflow, it provides non-invasive assessment for:
+- **Maternal anemia** from conjunctiva images (MedSigLIP)
+- **Neonatal jaundice** from skin images with bilirubin regression (MedSigLIP)
+- **Birth asphyxia** from cry audio analysis (HeAR)
+- **Clinical synthesis** with WHO IMNCI protocol alignment (MedGemma)
+## HAI-DEF Models
+| Model | HuggingFace ID | Purpose |
+|-------|----------------|---------|
+| **MedSigLIP** | `google/medsiglip-448` | Anemia + jaundice detection, bilirubin regression |
+| **HeAR** | `google/hear-pytorch` | Cry audio analysis for birth asphyxia |
+| **MedGemma 4B** | `google/medgemma-4b-it` | Clinical reasoning and synthesis |
+## Architecture
+```
+6-Agent Clinical Workflow:
+  Triage -> Image Analysis (MedSigLIP) -> Audio Analysis (HeAR)
+    -> WHO Protocol -> Referral Decision -> Clinical Synthesis (MedGemma)
+Each agent produces structured reasoning traces for a full audit trail.
+```
+## Quick Start
+### Prerequisites
+- Python 3.10+
+- HuggingFace token (for gated HAI-DEF models)
+### Setup
+```bash
+# Clone and install
+git clone <repo-url>
+cd nexus
+pip install -r requirements.txt
+# Set HuggingFace token (required for MedSigLIP, MedGemma)
+export HF_TOKEN=hf_your_token_here
+```
+### Run the Demo
+```bash
+# Streamlit interactive demo
+PYTHONPATH=src streamlit run src/demo/streamlit_app.py
+# FastAPI backend
+PYTHONPATH=src uvicorn api.main:app --reload
+# Run tests
+PYTHONPATH=src python -m pytest tests/ -v
+```
+### Train Models
+```bash
+# Train linear probes (anemia + jaundice classifiers)
+PYTHONPATH=src python scripts/training/train_linear_probes.py
+# Train bilirubin regression head
+PYTHONPATH=src python scripts/training/finetune_bilirubin_regression.py
+```
+### HuggingFace Spaces
+```bash
+# Local test of HF Spaces entry point
+python app.py
+```
+## Project Structure
+```
+nexus/
+├── src/nexus/                         # Core platform
+│   ├── anemia_detector.py             # MedSigLIP anemia detection
+│   ├── jaundice_detector.py           # MedSigLIP jaundice + bilirubin regression
+│   ├── cry_analyzer.py                # HeAR cry analysis
+│   ├── clinical_synthesizer.py        # MedGemma clinical synthesis
+│   ├── agentic_workflow.py            # 6-agent workflow engine
+│   └── pipeline.py                    # Unified assessment pipeline
+├── src/demo/streamlit_app.py          # Interactive Streamlit demo
+├── api/main.py                        # FastAPI backend
+├── scripts/
+│   ├── training/
+│   │   ├── train_linear_probes.py     # MedSigLIP embedding classifiers
+│   │   ├── finetune_bilirubin_regression.py  # Novel bilirubin regression
+│   │   ├── train_anemia.py            # Anemia-specific training
+│   │   ├── train_jaundice.py          # Jaundice-specific training
+│   │   └── train_cry.py              # Cry classifier training
+│   └── edge/
+│       ├── quantize_models.py         # INT8 quantization
+│       └── export_embeddings.py       # Pre-computed text embeddings
+├── notebooks/
+│   ├── 01_anemia_detection.ipynb
+│   ├── 02_jaundice_detection.ipynb
+│   ├── 03_cry_analysis.ipynb
+│   └── 04_bilirubin_regression.ipynb  # Novel task reproducibility
+├── tests/
+│   ├── test_pipeline.py               # Pipeline tests
+│   ├── test_agentic_workflow.py       # Agentic workflow tests (41 tests)
+│   └── test_hai_def_integration.py    # HAI-DEF model compliance
+├── models/
+│   ├── linear_probes/                 # Trained classifiers + regressor
+│   └── edge/                          # Quantized models + embeddings
+├── data/
+│   ├── raw/                           # Raw datasets (Eyes-Defy-Anemia, NeoJaundice, CryCeleb)
+│   └── protocols/                     # WHO IMNCI protocols
+├── submission/
+│   ├── writeup.md                     # Competition writeup (3 pages)
+│   └── video/                         # Demo video script and assets
+├── app.py                             # HuggingFace Spaces entry point
+├── requirements.txt                   # Full dependencies
+└── requirements_spaces.txt            # HF Spaces minimal dependencies
+```
+## Key Results
+| Task | Method | Performance |
+|------|--------|-------------|
+| Anemia zero-shot | MedSigLIP (max-similarity, 8 prompts/class) | Screening capability |
+| Jaundice classification | MedSigLIP linear probe | 68.9% accuracy |
+| **Bilirubin regression** | **MedSigLIP + MLP head** | **MAE: 2.667 mg/dL, r=0.77** |
+| Cry analysis | HeAR + acoustic features | Qualitative assessment |
+| Clinical synthesis | MedGemma + WHO IMNCI | Protocol-aligned recommendations |
+### Novel Task: Bilirubin Regression
+Frozen MedSigLIP embeddings -> 2-layer MLP -> continuous bilirubin (mg/dL) prediction.
+Trained on 2,235 NeoJaundice images with ground truth serum bilirubin.
+**MAE: 2.667 mg/dL, Pearson r: 0.7725 (p < 1e-67)**
+### Edge AI
+- INT8 dynamic quantization: 812.6 MB -> 111.2 MB (7.31x compression)
+- Pre-computed text embeddings: 12 KB (no text encoder on device)
+- Total on-device: ~289 MB
+## Competition Tracks
+- **Main Track**: Comprehensive maternal-neonatal assessment platform
+- **Agentic Workflow Prize**: 6-agent pipeline with reasoning traces and audit trail
+## Tests
+```bash
+# All tests
+PYTHONPATH=src python -m pytest tests/ -v
+# Agentic workflow only (41 tests)
+PYTHONPATH=src python -m pytest tests/test_agentic_workflow.py -v
+```
+## License
+[CC BY 4.0](LICENSE)
+## Acknowledgments
+- Google Health AI Developer Foundations team
+- NeoJaundice dataset (Figshare)
+- Eyes-Defy-Anemia dataset (Kaggle)
+- WHO IMNCI protocol guidelines
+---
+Built with Google HAI-DEF for the MedGemma Impact Challenge 2026

app.py ADDED Viewed

	@@ -0,0 +1,57 @@

+"""
+NEXUS - HuggingFace Spaces Entry Point
+Launches the Streamlit demo for the NEXUS Maternal-Neonatal Care Platform.
+Built with Google HAI-DEF models for the MedGemma Impact Challenge 2026.
+"""
+import os
+import subprocess
+import sys
+from pathlib import Path
+# Ensure src/ is on the Python path for imports
+ROOT = Path(__file__).parent
+SRC_DIR = ROOT / "src"
+if str(SRC_DIR) not in sys.path:
+    sys.path.insert(0, str(SRC_DIR))
+# Set environment defaults for HF Spaces
+os.environ.setdefault("STREAMLIT_SERVER_PORT", "7860")
+os.environ.setdefault("STREAMLIT_SERVER_ADDRESS", "0.0.0.0")
+os.environ.setdefault("STREAMLIT_SERVER_HEADLESS", "true")
+os.environ.setdefault("STREAMLIT_BROWSER_GATHER_USAGE_STATS", "false")
+def main():
+    app_path = SRC_DIR / "demo" / "streamlit_app.py"
+    if not app_path.exists():
+        print(f"ERROR: Streamlit app not found at {app_path}")
+        sys.exit(1)
+    port = os.environ.get("PORT", os.environ["STREAMLIT_SERVER_PORT"])
+    os.environ["STREAMLIT_SERVER_PORT"] = str(port)
+    try:
+        subprocess.run(
+            [
+                sys.executable, "-m", "streamlit", "run",
+                str(app_path),
+                f"--server.port={port}",
+                f"--server.address={os.environ['STREAMLIT_SERVER_ADDRESS']}",
+                f"--server.headless={os.environ['STREAMLIT_SERVER_HEADLESS']}",
+                f"--browser.gatherUsageStats={os.environ['STREAMLIT_BROWSER_GATHER_USAGE_STATS']}",
+            ],
+            check=True,
+            env={**os.environ, "PYTHONPATH": str(SRC_DIR)},
+        )
+    except subprocess.CalledProcessError as e:
+        print(f"ERROR: Streamlit process exited with code {e.returncode}")
+        sys.exit(e.returncode)
+    except FileNotFoundError:
+        print("ERROR: Streamlit not installed. Run: pip install streamlit")
+        sys.exit(1)
+if __name__ == "__main__":
+    main()

models/linear_probes/anemia_classifier_metadata.json ADDED Viewed

	@@ -0,0 +1,30 @@

+{
+  "model_type": "SVM_RBF",
+  "embedding_source": "MedSigLIP (google/medsiglip-448)",
+  "embedding_dim": 1152,
+  "num_classes": 2,
+  "classes": {
+    "healthy": 0,
+    "anemic": 1
+  },
+  "cv_accuracy_mean": 0.9994269340974211,
+  "cv_accuracy_std": 0.0011461318051575909,
+  "num_original_samples": 218,
+  "num_augmented_samples": 1744,
+  "augmentations_per_image": 7,
+  "all_results": {
+    "LogisticRegression": {
+      "mean_accuracy": 0.8985096993050752,
+      "std_accuracy": 0.008415256920621202
+    },
+    "SVM_RBF": {
+      "mean_accuracy": 0.9994269340974211,
+      "std_accuracy": 0.0011461318051575909
+    },
+    "SVM_Linear": {
+      "mean_accuracy": 0.8899186509896915,
+      "std_accuracy": 0.011746435929843532
+    }
+  },
+  "seed": 42
+}

models/linear_probes/bilirubin_regression_results.json ADDED Viewed

	@@ -0,0 +1,207 @@

+{
+  "mae": 2.564,
+  "rmse": 3.416,
+  "pearson_r": 0.7783,
+  "pearson_p": 1.7171921789198235e-69,
+  "bland_altman": {
+    "mean_diff": -0.506,
+    "std_diff": 3.379,
+    "loa_upper": 6.116,
+    "loa_lower": -7.129
+  },
+  "test_size": 336,
+  "train_size": 1563,
+  "val_size": 336,
+  "input_dim": 1152,
+  "hidden_dim": 256,
+  "epochs_trained": 58,
+  "best_val_loss": 3.7143,
+  "bilirubin_range": {
+    "min": 0.0,
+    "max": 25.7,
+    "mean": 11.2,
+    "std": 5.2
+  },
+  "history": {
+    "train_loss": [
+      17.543,
+      12.0442,
+      6.9412,
+      4.2175,
+      3.5428,
+      3.4781,
+      3.0782,
+      2.8347,
+      2.7914,
+      2.5293,
+      2.393,
+      2.2627,
+      2.1357,
+      2.1498,
+      1.875,
+      2.0569,
+      1.843,
+      1.7077,
+      1.7084,
+      1.6893,
+      1.7543,
+      2.0793,
+      2.1218,
+      2.1285,
+      2.0992,
+      1.9611,
+      1.93,
+      1.8854,
+      1.9694,
+      1.6901,
+      1.699,
+      1.7061,
+      1.5767,
+      1.6265,
+      1.5394,
+      1.4675,
+      1.3684,
+      1.4486,
+      1.2866,
+      1.3152,
+      1.2613,
+      1.1721,
+      1.1946,
+      1.2039,
+      1.1949,
+      1.129,
+      1.0557,
+      1.0699,
+      1.0325,
+      1.0427,
+      1.0431,
+      1.0722,
+      1.0071,
+      1.0187,
+      0.8847,
+      0.9988,
+      0.942,
+      0.9464
+    ],
+    "val_loss": [
+      18.4316,
+      13.9118,
+      6.9486,
+      4.5588,
+      5.5443,
+      4.184,
+      4.8748,
+      4.0967,
+      4.0286,
+      4.1705,
+      4.0592,
+      3.921,
+      4.1161,
+      4.0279,
+      3.9931,
+      3.8783,
+      3.8742,
+      3.8394,
+      3.949,
+      3.8805,
+      3.8673,
+      3.9437,
+      4.1339,
+      4.3688,
+      4.5384,
+      4.0601,
+      3.9022,
+      3.7252,
+      3.9551,
+      3.9791,
+      3.7946,
+      4.0627,
+      3.815,
+      4.0698,
+      4.0345,
+      3.9504,
+      3.8177,
+      3.8626,
+      3.8044,
+      3.7743,
+      3.8432,
+      3.8456,
+      3.7143,
+      3.8196,
+      3.8955,
+      3.7218,
+      3.7605,
+      3.7768,
+      3.7581,
+      3.7667,
+      3.7499,
+      3.7481,
+      3.7286,
+      3.7502,
+      3.7814,
+      3.734,
+      3.7887,
+      3.7414
+    ],
+    "val_mae": [
+      10.19,
+      7.908,
+      4.388,
+      3.118,
+      3.652,
+      2.965,
+      3.299,
+      2.901,
+      2.884,
+      2.947,
+      2.876,
+      2.814,
+      2.93,
+      2.866,
+      2.854,
+      2.792,
+      2.794,
+      2.77,
+      2.836,
+      2.798,
+      2.787,
+      2.814,
+      2.931,
+      3.052,
+      3.148,
+      2.89,
+      2.803,
+      2.691,
+      2.83,
+      2.837,
+      2.737,
+      2.884,
+      2.749,
+      2.901,
+      2.874,
+      2.829,
+      2.761,
+      2.778,
+      2.734,
+      2.721,
+      2.761,
+      2.774,
+      2.692,
+      2.749,
+      2.803,
+      2.699,
+      2.714,
+      2.719,
+      2.704,
+      2.716,
+      2.717,
+      2.704,
+      2.699,
+      2.711,
+      2.731,
+      2.701,
+      2.736,
+      2.706
+    ]
+  }
+}

models/linear_probes/cry_classifier_metadata.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "model_type": "SVM_RBF",
+  "embedding_source": "HeAR (google/hear-pytorch)",
+  "embedding_dim": 512,
+  "num_classes": 5,
+  "classes": {
+    "belly_pain": 0,
+    "burping": 1,
+    "discomfort": 2,
+    "hungry": 3,
+    "tired": 4
+  },
+  "cv_accuracy_mean": 0.8380793119923554,
+  "cv_accuracy_std": 0.008077431438521396,
+  "num_samples": 457,
+  "all_results": {
+    "LogisticRegression": {
+      "mean_accuracy": 0.7985905398948876,
+      "std_accuracy": 0.028055714127978745
+    },
+    "SVM_RBF": {
+      "mean_accuracy": 0.8380793119923554,
+      "std_accuracy": 0.008077431438521396
+    },
+    "SVM_Linear": {
+      "mean_accuracy": 0.765862398471094,
+      "std_accuracy": 0.013071624843302853
+    }
+  },
+  "seed": 42
+}

models/linear_probes/jaundice_classifier_metadata.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "model_type": "SVM_RBF",
+  "embedding_source": "MedSigLIP (google/medsiglip-448)",
+  "embedding_dim": 1152,
+  "num_classes": 2,
+  "classes": {
+    "normal": 0,
+    "jaundice": 1
+  },
+  "bilirubin_threshold": 5.0,
+  "cv_accuracy_mean": 0.967337807606264,
+  "cv_accuracy_std": 0.002197637886396911,
+  "num_original_samples": 2235,
+  "num_augmented_samples": 8940,
+  "augmentations_per_image": 3,
+  "all_results": {
+    "LogisticRegression": {
+      "mean_accuracy": 0.9422818791946309,
+      "std_accuracy": 0.004750953150245027
+    },
+    "SVM_RBF": {
+      "mean_accuracy": 0.967337807606264,
+      "std_accuracy": 0.002197637886396911
+    },
+    "SVM_Linear": {
+      "mean_accuracy": 0.9322147651006712,
+      "std_accuracy": 0.006743027683714353
+    }
+  },
+  "seed": 42
+}

models/linear_probes/linear_probe_results.json ADDED Viewed

	@@ -0,0 +1,18 @@

+{
+  "anemia": {
+    "accuracy": 0.5227272727272727,
+    "precision": 0.5185185185185185,
+    "recall": 0.6363636363636364,
+    "f1": 0.5714285714285714,
+    "train_size": 174,
+    "test_size": 44
+  },
+  "jaundice": {
+    "accuracy": 0.6957494407158836,
+    "precision": 0.6854460093896714,
+    "recall": 0.6790697674418604,
+    "f1": 0.6822429906542056,
+    "train_size": 1788,
+    "test_size": 447
+  }
+}

requirements_spaces.txt ADDED Viewed

	@@ -0,0 +1,31 @@

+# NEXUS - HuggingFace Spaces Dependencies
+# Minimal set for Streamlit demo deployment (CPU)
+torch>=2.1.0
+transformers>=4.44.0
+accelerate>=0.25.0
+safetensors>=0.4.0
+sentencepiece>=0.1.99
+huggingface_hub>=0.20.0
+# Audio
+librosa>=0.10.0
+soundfile>=0.12.0
+# Image
+Pillow>=10.0.0
+# Data
+numpy>=1.24.0
+pandas>=2.0.0
+scipy>=1.11.0
+scikit-learn>=1.3.0
+# Demo
+streamlit>=1.28.0
+plotly>=5.18.0
+# Utilities
+pyyaml>=6.0.0
+tqdm>=4.66.0
+joblib>=1.3.0

src/demo/__init__.py ADDED Viewed

File without changes

src/demo/streamlit_app.py ADDED Viewed

	@@ -0,0 +1,1189 @@

+"""
+NEXUS Streamlit Demo Application
+Interactive demo for the NEXUS Maternal-Neonatal Care Platform.
+Built with Google HAI-DEF models for the MedGemma Impact Challenge.
+HAI-DEF Models Used:
+- MedSigLIP: Medical image analysis (anemia, jaundice detection)
+- HeAR: Health acoustic representations (cry analysis)
+- MedGemma: Clinical reasoning and synthesis
+"""
+import streamlit as st
+from pathlib import Path
+import sys
+import os
+import tempfile
+import json
+# Add parent directory to path for imports
+sys.path.insert(0, str(Path(__file__).parent.parent))
+# Page configuration
+st.set_page_config(
+    page_title="NEXUS - Maternal-Neonatal Care",
+    page_icon="👶",
+    layout="wide",
+    initial_sidebar_state="expanded",
+)
+# Custom CSS
+st.markdown("""
+<style>
+    .main-header {
+        font-size: 2.5rem;
+        font-weight: bold;
+        color: #1f77b4;
+        text-align: center;
+        margin-bottom: 1rem;
+    }
+    .sub-header {
+        font-size: 1.2rem;
+        color: #666;
+        text-align: center;
+        margin-bottom: 2rem;
+    }
+    .risk-high {
+        background-color: #ffcccc;
+        border: 2px solid #ff0000;
+        padding: 1rem;
+        border-radius: 10px;
+    }
+    .risk-medium {
+        background-color: #fff3cd;
+        border: 2px solid #ffc107;
+        padding: 1rem;
+        border-radius: 10px;
+    }
+    .risk-low {
+        background-color: #d4edda;
+        border: 2px solid #28a745;
+        padding: 1rem;
+        border-radius: 10px;
+    }
+    .metric-card {
+        background-color: #f8f9fa;
+        padding: 1rem;
+        border-radius: 10px;
+        text-align: center;
+    }
+    .model-badge {
+        display: inline-block;
+        padding: 2px 10px;
+        border-radius: 12px;
+        font-size: 0.78rem;
+        font-weight: 600;
+        color: white;
+        letter-spacing: 0.3px;
+    }
+    .stMetric > div {
+        background-color: #f8f9fa;
+        padding: 0.5rem;
+        border-radius: 8px;
+    }
+</style>
+""", unsafe_allow_html=True)
+@st.cache_resource
+def load_anemia_detector():
+    """Load anemia detector model with error handling."""
+    try:
+        from nexus.anemia_detector import AnemiaDetector
+        detector = AnemiaDetector()
+        return detector, None
+    except Exception as e:
+        return None, str(e)
+@st.cache_resource
+def load_jaundice_detector():
+    """Load jaundice detector model with error handling."""
+    try:
+        from nexus.jaundice_detector import JaundiceDetector
+        detector = JaundiceDetector()
+        return detector, None
+    except Exception as e:
+        return None, str(e)
+@st.cache_resource
+def load_cry_analyzer():
+    """Load cry analyzer with error handling."""
+    try:
+        from nexus.cry_analyzer import CryAnalyzer
+        analyzer = CryAnalyzer()
+        return analyzer, None
+    except Exception as e:
+        return None, str(e)
+@st.cache_resource
+def load_clinical_synthesizer():
+    """Load clinical synthesizer (MedGemma) with error handling."""
+    try:
+        import os
+        from nexus.clinical_synthesizer import ClinicalSynthesizer
+        use_medgemma = os.environ.get("NEXUS_USE_MEDGEMMA", "true").lower() != "false"
+        synthesizer = ClinicalSynthesizer(use_medgemma=use_medgemma)
+        return synthesizer, None
+    except Exception as e:
+        return None, str(e)
+def get_hai_def_info():
+    """Get HAI-DEF models information with validated accuracy numbers."""
+    return {
+        "MedSigLIP": {
+            "name": "MedSigLIP (google/medsiglip-448)",
+            "use": "Image analysis for anemia and jaundice detection + bilirubin regression",
+            "method": "Zero-shot classification (max-similarity, 8 prompts/class) + trained SVM/LR classifiers on embeddings",
+            "accuracy": "Anemia: trained classifier on augmented data, Jaundice: trained classifier on 2,235 images, Bilirubin: MAE 2.67 mg/dL (r=0.77)",
+            "badge": "Vision",
+            "badge_color": "#388e3c",
+        },
+        "HeAR": {
+            "name": "HeAR (google/hear-pytorch)",
+            "use": "Infant cry analysis for asphyxia and cry type classification",
+            "method": "512-dim health acoustic embeddings + trained linear classifier on donate-a-cry dataset (5-class: hungry, belly_pain, burping, discomfort, tired)",
+            "accuracy": "Trained cry type classifier with asphyxia risk derivation from distress patterns",
+            "badge": "Audio",
+            "badge_color": "#f57c00",
+        },
+        "MedGemma": {
+            "name": "MedGemma 1.5 4B (google/medgemma-1.5-4b-it)",
+            "use": "Clinical reasoning and recommendation synthesis",
+            "method": "4-bit NF4 quantized inference with WHO IMNCI protocol-aligned synthesis and 6-agent reasoning traces",
+            "accuracy": "Protocol-aligned clinical recommendations with structured reasoning chains",
+            "badge": "Language",
+            "badge_color": "#1976d2",
+        },
+    }
+def main():
+    """Main application."""
+    # Header
+    st.markdown('<div class="main-header">NEXUS</div>', unsafe_allow_html=True)
+    st.markdown(
+        '<div class="sub-header">AI-Powered Maternal-Neonatal Care Platform</div>',
+        unsafe_allow_html=True
+    )
+    # Sidebar
+    with st.sidebar:
+        st.markdown("## 🏥 NEXUS")
+        st.markdown("---")
+        assessment_type = st.radio(
+            "Select Assessment Type",
+            [
+                "Maternal Anemia Screening",
+                "Neonatal Jaundice Detection",
+                "Cry Analysis",
+                "Combined Assessment",
+                "Agentic Workflow",
+                "HAI-DEF Models Info"
+            ],
+            index=0,
+        )
+        st.markdown("---")
+        st.markdown("### About NEXUS")
+        st.markdown("""
+        NEXUS uses AI to provide non-invasive screening for:
+        - **Maternal Anemia** via conjunctiva imaging
+        - **Neonatal Jaundice** via skin color analysis
+        - **Birth Asphyxia** via cry pattern analysis
+        Built with **Google HAI-DEF models** for the MedGemma Impact Challenge 2026.
+        """)
+        st.markdown("---")
+        st.markdown("### Edge AI Mode")
+        edge_mode = st.toggle("Enable Edge AI Mode", value=False, key="edge_mode")
+        if edge_mode:
+            st.success("Edge AI: INT8 quantized models + offline inference")
+        else:
+            st.info("Cloud mode: Full-precision HAI-DEF models")
+        st.markdown("---")
+        st.markdown("### HAI-DEF Models")
+        st.markdown("""
+        - **MedSigLIP**: Vision (trained classifiers)
+        - **HeAR**: Audio (trained cry classifier)
+        - **MedGemma 1.5**: Clinical AI (4-bit NF4)
+        """)
+    # Show Edge AI banner when enabled
+    if edge_mode:
+        render_edge_ai_banner()
+    # Main content based on selection
+    if assessment_type == "Maternal Anemia Screening":
+        render_anemia_screening()
+    elif assessment_type == "Neonatal Jaundice Detection":
+        render_jaundice_detection()
+    elif assessment_type == "Cry Analysis":
+        render_cry_analysis()
+    elif assessment_type == "Combined Assessment":
+        render_combined_assessment()
+    elif assessment_type == "Agentic Workflow":
+        render_agentic_workflow()
+    else:
+        render_hai_def_info()
+def render_edge_ai_banner():
+    """Show Edge AI mode status and model metrics."""
+    st.markdown("""
+    <div style="background: linear-gradient(135deg, #1a237e 0%, #0d47a1 100%);
+                color: white; padding: 1rem 1.5rem; border-radius: 10px; margin-bottom: 1rem;">
+        <h4 style="margin:0; color: white;">Edge AI Mode Active</h4>
+        <p style="margin: 0.3rem 0 0 0; opacity: 0.9; font-size: 0.9rem;">
+            Running INT8 quantized models for offline-capable inference on low-resource devices.
+        </p>
+    </div>
+    """, unsafe_allow_html=True)
+    col1, col2, col3, col4 = st.columns(4)
+    with col1:
+        st.metric("MedSigLIP INT8", "111.2 MB", "-86% memory")
+    with col2:
+        st.metric("Acoustic Model", "0.6 MB", "INT8 quantized")
+    with col3:
+        st.metric("Text Embeddings", "12 KB", "Pre-computed")
+    with col4:
+        st.metric("Total Edge Size", "~289 MB", "Offline-ready")
+    with st.expander("Edge AI Details"):
+        st.markdown("""
+        **Quantization**: Dynamic INT8 (PyTorch `quantize_dynamic`, qnnpack backend)
+        | Component | Cloud (FP32) | Edge (INT8) | Compression |
+        |-----------|-------------|-------------|-------------|
+        | MedSigLIP Vision | 812.6 MB | 111.2 MB | **7.31x** |
+        | Acoustic Model | 0.665 MB | 0.599 MB | 1.11x |
+        | CPU Latency | 97.7 ms | ~65 ms (ARM est.) | ~1.5x faster |
+        **Target Devices**: Android 8.0+, ARM Cortex-A53, 2GB RAM
+        **Offline Capabilities**:
+        - Image analysis via INT8 MedSigLIP + pre-computed binary text embeddings
+        - Audio analysis via INT8 acoustic feature extractor
+        - Clinical reasoning via rule-based WHO IMNCI protocols (no MedGemma required)
+        """)
+def _cleanup_temp(path: str) -> None:
+    """Safely remove a temporary file."""
+    try:
+        if path and os.path.exists(path):
+            os.unlink(path)
+    except OSError:
+        pass
+def _save_upload_to_temp(uploaded_file, suffix: str) -> str:
+    """Save an uploaded file to a temporary path and return the path."""
+    tmp = tempfile.NamedTemporaryFile(delete=False, suffix=suffix)
+    try:
+        tmp.write(uploaded_file.getvalue())
+        tmp.close()
+        return tmp.name
+    except Exception:
+        tmp.close()
+        _cleanup_temp(tmp.name)
+        raise
+def _model_badge(name: str, color: str) -> str:
+    """Return an HTML badge for displaying which HAI-DEF model is active."""
+    return (
+        f'<span style="background:{color}; color:white; padding:2px 10px; '
+        f'border-radius:12px; font-size:0.78rem; font-weight:600; '
+        f'letter-spacing:0.3px;">{name}</span>'
+    )
+def render_anemia_screening():
+    """Render anemia screening interface."""
+    st.header("Maternal Anemia Screening")
+    st.markdown(
+        f"Upload a clear image of the inner eyelid (conjunctiva) for anemia screening. "
+        f'{_model_badge("MedSigLIP", "#388e3c")}',
+        unsafe_allow_html=True,
+    )
+    col1, col2 = st.columns([1, 1])
+    with col1:
+        st.subheader("Upload Image")
+        uploaded_file = st.file_uploader(
+            "Choose a conjunctiva image",
+            type=["jpg", "jpeg", "png"],
+            key="anemia_upload"
+        )
+        if uploaded_file:
+            st.image(uploaded_file, caption="Uploaded Image", use_container_width=True)
+    with col2:
+        st.subheader("Analysis Results")
+        if uploaded_file:
+            with st.spinner("Analyzing image..."):
+                tmp_path = None
+                try:
+                    detector, load_err = load_anemia_detector()
+                    if detector is None:
+                        st.error(f"Could not load model: {load_err}")
+                        return
+                    tmp_path = _save_upload_to_temp(uploaded_file, ".jpg")
+                    result = detector.detect(tmp_path)
+                    color_info = detector.analyze_color_features(tmp_path)
+                    # Display results
+                    risk_class = f"risk-{result['risk_level']}"
+                    st.markdown(f'<div class="{risk_class}">', unsafe_allow_html=True)
+                    if result["is_anemic"]:
+                        st.error("⚠️ ANEMIA DETECTED")
+                    else:
+                        st.success("✅ No Anemia Detected")
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    # Metrics
+                    col_a, col_b, col_c = st.columns(3)
+                    with col_a:
+                        st.metric("Confidence", f"{result['confidence']:.1%}")
+                    with col_b:
+                        st.metric("Risk Level", result['risk_level'].upper())
+                    with col_c:
+                        st.metric("Est. Hemoglobin", f"{color_info['estimated_hemoglobin']} g/dL")
+                    # Recommendation
+                    st.markdown("### Recommendation")
+                    st.info(result["recommendation"])
+                    # Color analysis
+                    with st.expander("Technical Details"):
+                        st.json({
+                            "anemia_score": round(result["anemia_score"], 3),
+                            "healthy_score": round(result["healthy_score"], 3),
+                            "red_ratio": round(color_info["red_ratio"], 3),
+                            "pallor_index": round(color_info["pallor_index"], 3),
+                        })
+                except Exception as e:
+                    st.error(f"Error analyzing image: {e}")
+                finally:
+                    _cleanup_temp(tmp_path)
+        else:
+            st.info("👆 Upload an image to begin analysis")
+def render_jaundice_detection():
+    """Render jaundice detection interface."""
+    st.header("Neonatal Jaundice Detection")
+    st.markdown(
+        f"Upload an image of the newborn's skin or sclera for jaundice assessment. "
+        f'{_model_badge("MedSigLIP", "#388e3c")}',
+        unsafe_allow_html=True,
+    )
+    col1, col2 = st.columns([1, 1])
+    with col1:
+        st.subheader("Upload Image")
+        uploaded_file = st.file_uploader(
+            "Choose a neonatal image",
+            type=["jpg", "jpeg", "png"],
+            key="jaundice_upload"
+        )
+        if uploaded_file:
+            st.image(uploaded_file, caption="Uploaded Image", use_container_width=True)
+        # Patient info
+        st.subheader("Patient Information (Optional)")
+        age_days = st.number_input("Age (days)", min_value=0, max_value=28, value=3)
+        birth_weight = st.number_input("Birth weight (grams)", min_value=500, max_value=5000, value=3000)
+    with col2:
+        st.subheader("Analysis Results")
+        if uploaded_file:
+            with st.spinner("Analyzing image..."):
+                tmp_path = None
+                try:
+                    detector, load_err = load_jaundice_detector()
+                    if detector is None:
+                        st.error(f"Could not load model: {load_err}")
+                        return
+                    tmp_path = _save_upload_to_temp(uploaded_file, ".jpg")
+                    result = detector.detect(tmp_path)
+                    zone_info = detector.analyze_kramer_zones(tmp_path)
+                    # Display results
+                    risk_class = "risk-high" if result["needs_phototherapy"] else (
+                        "risk-medium" if result["severity"] in ["moderate", "mild"] else "risk-low"
+                    )
+                    st.markdown(f'<div class="{risk_class}">', unsafe_allow_html=True)
+                    if result["has_jaundice"]:
+                        st.warning(f"⚠️ JAUNDICE DETECTED - {result['severity'].upper()}")
+                    else:
+                        st.success("✅ No Significant Jaundice")
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    # Metrics - show ML bilirubin if available
+                    col_a, col_b, col_c = st.columns(3)
+                    with col_a:
+                        bili_value = result.get('estimated_bilirubin_ml', result.get('estimated_bilirubin', 0))
+                        bili_method = result.get('bilirubin_method', 'Color Analysis')
+                        st.metric("Est. Bilirubin", f"{bili_value} mg/dL")
+                        st.caption(f"Method: {bili_method}")
+                    with col_b:
+                        st.metric("Severity", result['severity'].upper())
+                    with col_c:
+                        st.metric("Kramer Zone", zone_info['kramer_zone'])
+                    # Phototherapy indicator
+                    if result["needs_phototherapy"]:
+                        st.error("🔆 PHOTOTHERAPY RECOMMENDED")
+                    # Recommendation
+                    st.markdown("### Recommendation")
+                    st.info(result["recommendation"])
+                    # Zone analysis
+                    with st.expander("Kramer Zone Analysis"):
+                        st.write(f"**Zone**: {zone_info['kramer_zone']} - {zone_info['zone_description']}")
+                        st.write(f"**Yellow Index**: {zone_info['yellow_index']}")
+                        st.progress(min(zone_info['yellow_index'] * 2, 1.0))
+                    # Technical details
+                    with st.expander("Technical Details"):
+                        details = {
+                            "jaundice_score": round(result["jaundice_score"], 3),
+                            "confidence": round(result["confidence"], 3),
+                            "model": result.get("model", "unknown"),
+                            "model_type": result.get("model_type", "unknown"),
+                            "bilirubin_method": result.get("bilirubin_method", "Color Analysis"),
+                        }
+                        if result.get("estimated_bilirubin_ml") is not None:
+                            details["bilirubin_ml"] = result["estimated_bilirubin_ml"]
+                            details["bilirubin_color"] = result["estimated_bilirubin"]
+                        st.json(details)
+                except Exception as e:
+                    st.error(f"Error analyzing image: {e}")
+                finally:
+                    _cleanup_temp(tmp_path)
+        else:
+            st.info("👆 Upload an image to begin analysis")
+def render_cry_analysis():
+    """Render cry analysis interface."""
+    st.header("Infant Cry Analysis")
+    st.markdown(
+        f"Upload an audio recording of the infant's cry for analysis. "
+        f'{_model_badge("HeAR", "#f57c00")}',
+        unsafe_allow_html=True,
+    )
+    col1, col2 = st.columns([1, 1])
+    with col1:
+        st.subheader("Upload Audio")
+        uploaded_file = st.file_uploader(
+            "Choose a cry audio file",
+            type=["wav", "mp3", "ogg"],
+            key="cry_upload"
+        )
+        if uploaded_file:
+            st.audio(uploaded_file)
+    with col2:
+        st.subheader("Analysis Results")
+        if uploaded_file:
+            with st.spinner("Analyzing cry..."):
+                tmp_path = None
+                try:
+                    analyzer, load_err = load_cry_analyzer()
+                    if analyzer is None:
+                        st.error(f"Could not load model: {load_err}")
+                        return
+                    tmp_path = _save_upload_to_temp(uploaded_file, ".wav")
+                    result = analyzer.analyze(tmp_path)
+                    # Display results
+                    risk_class = f"risk-{result['risk_level']}"
+                    st.markdown(f'<div class="{risk_class}">', unsafe_allow_html=True)
+                    if result["is_abnormal"]:
+                        st.error("⚠️ ABNORMAL CRY PATTERN DETECTED")
+                    else:
+                        st.success("✅ Normal Cry Pattern")
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    # Metrics
+                    col_a, col_b, col_c = st.columns(3)
+                    with col_a:
+                        st.metric("Asphyxia Risk", f"{result['asphyxia_risk']:.1%}")
+                    with col_b:
+                        st.metric("Cry Type", result['cry_type'].title())
+                    with col_c:
+                        st.metric("F0 (Pitch)", f"{result['features']['f0_mean']:.0f} Hz")
+                    # Recommendation
+                    st.markdown("### Recommendation")
+                    st.info(result["recommendation"])
+                    # Acoustic features
+                    with st.expander("Acoustic Features"):
+                        st.json(result["features"])
+                except Exception as e:
+                    st.error(f"Error analyzing audio: {e}")
+                finally:
+                    _cleanup_temp(tmp_path)
+        else:
+            st.info("👆 Upload an audio file to begin analysis")
+def render_combined_assessment():
+    """Render combined assessment interface using Clinical Synthesizer."""
+    st.header("Combined Clinical Assessment")
+    st.markdown(
+        f"Upload multiple inputs for a comprehensive assessment using **MedGemma Clinical Synthesizer**. "
+        f"This combines findings from all HAI-DEF models to provide integrated clinical recommendations. "
+        f'{_model_badge("MedSigLIP", "#388e3c")} '
+        f'{_model_badge("HeAR", "#f57c00")} '
+        f'{_model_badge("MedGemma", "#1976d2")}',
+        unsafe_allow_html=True,
+    )
+    # Reset findings each time this tab is rendered to prevent
+    # stale data from previous patients contaminating results
+    st.session_state.findings = {
+        "anemia": None,
+        "jaundice": None,
+        "cry": None
+    }
+    col1, col2, col3 = st.columns(3)
+    with col1:
+        st.subheader("🩸 Anemia Screening")
+        anemia_file = st.file_uploader(
+            "Conjunctiva image",
+            type=["jpg", "jpeg", "png"],
+            key="combined_anemia"
+        )
+        if anemia_file:
+            st.image(anemia_file, use_container_width=True)
+            with st.spinner("Analyzing..."):
+                try:
+                    detector, load_err = load_anemia_detector()
+                    if detector is None:
+                        st.error(f"Model error: {load_err}")
+                    else:
+                        with tempfile.NamedTemporaryFile(delete=False, suffix=".jpg") as tmp:
+                            tmp.write(anemia_file.getvalue())
+                            result = detector.detect(tmp.name)
+                        st.session_state.findings["anemia"] = result
+                        if result["is_anemic"]:
+                            st.error(f"Anemia: {result['risk_level'].upper()}")
+                        else:
+                            st.success("No Anemia")
+                except Exception as e:
+                    st.error(f"Error: {e}")
+    with col2:
+        st.subheader("👶 Jaundice Detection")
+        jaundice_file = st.file_uploader(
+            "Neonatal skin image",
+            type=["jpg", "jpeg", "png"],
+            key="combined_jaundice"
+        )
+        if jaundice_file:
+            st.image(jaundice_file, use_container_width=True)
+            with st.spinner("Analyzing..."):
+                try:
+                    detector, load_err = load_jaundice_detector()
+                    if detector is None:
+                        st.error(f"Model error: {load_err}")
+                    else:
+                        with tempfile.NamedTemporaryFile(delete=False, suffix=".jpg") as tmp:
+                            tmp.write(jaundice_file.getvalue())
+                            result = detector.detect(tmp.name)
+                            st.session_state.findings["jaundice"] = result
+                            if result["has_jaundice"]:
+                                st.warning(f"Jaundice: {result['severity'].upper()}")
+                            else:
+                                st.success("No Jaundice")
+                except Exception as e:
+                    st.error(f"Error: {e}")
+    with col3:
+        st.subheader("🔊 Cry Analysis")
+        cry_file = st.file_uploader(
+            "Cry audio",
+            type=["wav", "mp3", "ogg"],
+            key="combined_cry"
+        )
+        if cry_file:
+            st.audio(cry_file)
+            with st.spinner("Analyzing..."):
+                try:
+                    analyzer, load_err = load_cry_analyzer()
+                    if analyzer is None:
+                        st.error(f"Model error: {load_err}")
+                        raise RuntimeError(load_err)
+                    with tempfile.NamedTemporaryFile(delete=False, suffix=".wav") as tmp:
+                        tmp.write(cry_file.getvalue())
+                        result = analyzer.analyze(tmp.name)
+                        st.session_state.findings["cry"] = result
+                        if result["is_abnormal"]:
+                            st.error(f"Abnormal Cry: {result['risk_level'].upper()}")
+                        else:
+                            st.success("Normal Cry")
+                except Exception as e:
+                    st.error(f"Error: {e}")
+    # Clinical Synthesis Section
+    st.markdown("---")
+    st.subheader("🏥 Clinical Synthesis (MedGemma)")
+    # Check if any findings are available
+    has_findings = any(v is not None for v in st.session_state.findings.values())
+    if has_findings:
+        if st.button("Generate Clinical Synthesis", type="primary"):
+            with st.spinner("Synthesizing findings with MedGemma..."):
+                try:
+                    synthesizer, load_err = load_clinical_synthesizer()
+                    if synthesizer is None:
+                        st.error(f"Could not load synthesizer: {load_err}")
+                        return
+                    # Prepare findings dict
+                    findings = {}
+                    if st.session_state.findings["anemia"]:
+                        findings["anemia"] = st.session_state.findings["anemia"]
+                    if st.session_state.findings["jaundice"]:
+                        findings["jaundice"] = st.session_state.findings["jaundice"]
+                    if st.session_state.findings["cry"]:
+                        findings["cry"] = st.session_state.findings["cry"]
+                    synthesis = synthesizer.synthesize(findings)
+                    # Display synthesis results
+                    severity_level = synthesis.get("severity_level", "GREEN")
+                    severity_colors = {
+                        "GREEN": ("🟢", "#d4edda", "#155724"),
+                        "YELLOW": ("🟡", "#fff3cd", "#856404"),
+                        "RED": ("🔴", "#f8d7da", "#721c24")
+                    }
+                    emoji, bg_color, text_color = severity_colors.get(severity_level, ("⚪", "#f8f9fa", "#000"))
+                    st.markdown(f"""
+                    <div style="background-color: {bg_color}; padding: 1.5rem; border-radius: 10px; margin: 1rem 0;">
+                        <h3 style="color: {text_color}; margin: 0;">{emoji} Severity: {severity_level}</h3>
+                        <p style="color: {text_color}; font-size: 1.1rem; margin-top: 0.5rem;">{synthesis.get('severity_description', '')}</p>
+                    </div>
+                    """, unsafe_allow_html=True)
+                    # Summary
+                    st.markdown("### Summary")
+                    st.info(synthesis.get("summary", "No summary available"))
+                    # Actions
+                    if synthesis.get("immediate_actions"):
+                        st.markdown("### Immediate Actions")
+                        for action in synthesis["immediate_actions"]:
+                            st.markdown(f"- {action}")
+                    # Referral
+                    col_a, col_b = st.columns(2)
+                    with col_a:
+                        st.markdown("### Referral Status")
+                        if synthesis.get("referral_needed"):
+                            st.error(f"⚠️ REFERRAL NEEDED: {synthesis.get('referral_urgency', 'standard').upper()}")
+                        else:
+                            st.success("✅ No referral needed")
+                    with col_b:
+                        st.markdown("### Follow-up")
+                        st.info(synthesis.get("follow_up", "Schedule routine follow-up"))
+                    # Technical details
+                    with st.expander("Technical Details"):
+                        model_name = synthesis.get("model", "unknown")
+                        st.json({
+                            "model": model_name,
+                            "model_id": synthesis.get("model_id", ""),
+                            "generated_at": synthesis.get("generated_at"),
+                            "urgent_conditions": synthesis.get("urgent_conditions", []),
+                        })
+                        if model_name and "Fallback" not in str(model_name):
+                            st.success(f"Synthesis powered by {model_name}")
+                        elif "Fallback" in str(model_name):
+                            st.warning("Using rule-based fallback (MedGemma unavailable)")
+                except Exception as e:
+                    st.error(f"Error generating synthesis: {e}")
+    else:
+        st.info("👆 Upload at least one input (image or audio) to generate clinical synthesis")
+def render_hai_def_info():
+    """Render HAI-DEF models information."""
+    st.header("Google HAI-DEF Models")
+    st.markdown("""
+    NEXUS is built using **Google Health AI Developer Foundations (HAI-DEF)** models,
+    designed specifically for healthcare applications in resource-limited settings.
+    """)
+    hai_def = get_hai_def_info()
+    # MedSigLIP
+    st.markdown("---")
+    col1, col2 = st.columns([1, 2])
+    with col1:
+        st.markdown("### 🖼️ MedSigLIP")
+        st.info("google/medsiglip-448\n\nHAI-DEF Vision Model")
+    with col2:
+        info = hai_def["MedSigLIP"]
+        st.markdown(f"**Model**: {info['name']}")
+        st.markdown(f"**Use Case**: {info['use']}")
+        st.markdown(f"**Method**: {info['method']}")
+        st.markdown(f"**Validated Performance**: {info['accuracy']}")
+        st.markdown("""
+        MedSigLIP enables zero-shot medical image classification using
+        text prompts. NEXUS extends this with trained SVM/LR classifiers
+        on MedSigLIP embeddings (with data augmentation) for improved
+        accuracy, plus a novel 3-layer MLP regression head for continuous
+        bilirubin prediction from frozen embeddings.
+        """)
+    # HeAR
+    st.markdown("---")
+    col1, col2 = st.columns([1, 2])
+    with col1:
+        st.markdown("### 🔊 HeAR")
+        st.info("google/hear-pytorch\n\nHAI-DEF Audio Model")
+    with col2:
+        info = hai_def["HeAR"]
+        st.markdown(f"**Model**: {info['name']}")
+        st.markdown(f"**Use Case**: {info['use']}")
+        st.markdown(f"**Method**: {info['method']}")
+        st.markdown(f"**Validated Performance**: {info['accuracy']}")
+        st.markdown("""
+        HeAR (Health Acoustic Representations) produces 512-dim embeddings
+        from 2-second audio clips at 16kHz. NEXUS trains a linear classifier
+        on HeAR embeddings for 5-class cry type classification (hungry,
+        belly_pain, burping, discomfort, tired) and derives asphyxia risk
+        from distress patterns.
+        """)
+    # MedGemma
+    st.markdown("---")
+    col1, col2 = st.columns([1, 2])
+    with col1:
+        st.markdown("### 🧠 MedGemma")
+        st.info("google/medgemma-1.5-4b-it\n\nHAI-DEF Language Model")
+    with col2:
+        info = hai_def["MedGemma"]
+        st.markdown(f"**Model**: {info['name']}")
+        st.markdown(f"**Use Case**: {info['use']}")
+        st.markdown(f"**Method**: {info['method']}")
+        st.markdown(f"**Validated Performance**: {info['accuracy']}")
+        st.markdown("""
+        MedGemma 1.5 provides clinical reasoning capabilities via 4-bit NF4
+        quantized inference (~2 GB VRAM). It synthesizes multi-modal findings
+        into actionable recommendations following WHO IMNCI protocols,
+        producing structured reasoning chains within the 6-agent pipeline.
+        """)
+    # Competition Info
+    st.markdown("---")
+    st.subheader("🏆 MedGemma Impact Challenge 2026")
+    st.markdown("""
+    NEXUS is being developed for the [MedGemma Impact Challenge](https://www.kaggle.com/competitions/medgemma-impact-challenge-2026)
+    on Kaggle.
+    **Competition Focus**: Solutions for resource-limited healthcare settings using HAI-DEF models.
+    **NEXUS Impact**:
+    - 📍 Target: Sub-Saharan Africa and South Asia
+    - 👩‍⚕️ Users: Community Health Workers
+    - 🎯 Goals: Reduce maternal/neonatal mortality
+    - 📱 Deployment: Offline-capable mobile app
+    """)
+def render_agentic_workflow():
+    """Render the agentic workflow interface with reasoning traces."""
+    st.header("Agentic Clinical Workflow")
+    st.markdown(
+        f"**6-Agent Pipeline** with step-by-step reasoning traces. "
+        f"Each agent explains its clinical decision process, providing a full audit trail. "
+        f'{_model_badge("MedSigLIP", "#388e3c")} '
+        f'{_model_badge("HeAR", "#f57c00")} '
+        f'{_model_badge("MedGemma", "#1976d2")}',
+        unsafe_allow_html=True,
+    )
+    # Pipeline diagram
+    st.markdown("""
+    <div style="display: flex; align-items: center; justify-content: center; gap: 0.5rem; flex-wrap: wrap; margin: 1rem 0;">
+        <div style="background: #e3f2fd; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #1976d2;">Triage</div>
+        <span style="font-size: 1.5rem;">&#8594;</span>
+        <div style="background: #e8f5e9; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #388e3c;">Image (MedSigLIP)</div>
+        <span style="font-size: 1.5rem;">&#8594;</span>
+        <div style="background: #fff3e0; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #f57c00;">Audio (HeAR)</div>
+        <span style="font-size: 1.5rem;">&#8594;</span>
+        <div style="background: #f3e5f5; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #7b1fa2;">Protocol (WHO)</div>
+        <span style="font-size: 1.5rem;">&#8594;</span>
+        <div style="background: #fce4ec; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #c62828;">Referral</div>
+        <span style="font-size: 1.5rem;">&#8594;</span>
+        <div style="background: #e0f7fa; padding: 0.5rem 1rem; border-radius: 8px; font-weight: bold; border: 2px solid #00838f;">Synthesis (MedGemma)</div>
+    </div>
+    """, unsafe_allow_html=True)
+    st.markdown("---")
+    # Input section
+    col_left, col_right = st.columns([1, 1])
+    with col_left:
+        st.subheader("Patient & Inputs")
+        patient_type = st.selectbox("Patient Type", ["newborn", "pregnant"], key="agentic_patient")
+        # Danger signs
+        st.markdown("**Danger Signs**")
+        danger_signs = []
+        if patient_type == "pregnant":
+            sign_options = [
+                ("Severe headache", "high"),
+                ("Blurred vision", "high"),
+                ("Convulsions", "critical"),
+                ("Severe abdominal pain", "high"),
+                ("Vaginal bleeding", "critical"),
+                ("High fever", "high"),
+                ("Severe pallor", "medium"),
+            ]
+        else:
+            sign_options = [
+                ("Not breathing at birth", "critical"),
+                ("Convulsions", "critical"),
+                ("Severe chest indrawing", "high"),
+                ("Not feeding", "high"),
+                ("High fever (>38C)", "high"),
+                ("Hypothermia (<35.5C)", "high"),
+                ("Lethargy / unconscious", "critical"),
+                ("Umbilical redness", "medium"),
+            ]
+        selected_signs = st.multiselect(
+            "Select present danger signs",
+            [s[0] for s in sign_options],
+            key="agentic_signs"
+        )
+        for label, severity in sign_options:
+            if label in selected_signs:
+                danger_signs.append({
+                    "id": label.lower().replace(" ", "_"),
+                    "label": label,
+                    "severity": severity,
+                    "present": True,
+                })
+        # Image uploads
+        st.markdown("**Clinical Images**")
+        conjunctiva_file = st.file_uploader(
+            "Conjunctiva image (anemia)", type=["jpg", "jpeg", "png"],
+            key="agentic_conjunctiva"
+        )
+        skin_file = st.file_uploader(
+            "Skin image (jaundice)", type=["jpg", "jpeg", "png"],
+            key="agentic_skin"
+        )
+        cry_file = st.file_uploader(
+            "Cry audio", type=["wav", "mp3", "ogg"],
+            key="agentic_cry"
+        )
+    with col_right:
+        st.subheader("Workflow Execution")
+        if st.button("Run Agentic Assessment", type="primary", key="run_agentic"):
+            with st.spinner("Running 6-agent workflow..."):
+                try:
+                    from nexus.agentic_workflow import (
+                        AgenticWorkflowEngine,
+                        AgentPatientInfo,
+                        DangerSign,
+                        WorkflowInput,
+                    )
+                    # Save uploaded files (track for cleanup)
+                    _temp_paths = []
+                    conjunctiva_path = None
+                    skin_path = None
+                    cry_path = None
+                    if conjunctiva_file:
+                        conjunctiva_path = _save_upload_to_temp(conjunctiva_file, ".jpg")
+                        _temp_paths.append(conjunctiva_path)
+                    if skin_file:
+                        skin_path = _save_upload_to_temp(skin_file, ".jpg")
+                        _temp_paths.append(skin_path)
+                    if cry_file:
+                        cry_path = _save_upload_to_temp(cry_file, ".wav")
+                        _temp_paths.append(cry_path)
+                    # Build workflow input
+                    signs = [
+                        DangerSign(
+                            id=s["id"], label=s["label"],
+                            severity=s["severity"], present=True,
+                        )
+                        for s in danger_signs
+                    ]
+                    info = AgentPatientInfo(patient_type=patient_type)
+                    workflow_input = WorkflowInput(
+                        patient_type=patient_type,
+                        patient_info=info,
+                        danger_signs=signs,
+                        conjunctiva_image=conjunctiva_path,
+                        skin_image=skin_path,
+                        cry_audio=cry_path,
+                    )
+                    # Run workflow — reuse cached model instances when available
+                    anemia_det, _ = load_anemia_detector()
+                    jaundice_det, _ = load_jaundice_detector()
+                    cry_ana, _ = load_cry_analyzer()
+                    synth, _ = load_clinical_synthesizer()
+                    engine = AgenticWorkflowEngine(
+                        anemia_detector=anemia_det,
+                        jaundice_detector=jaundice_det,
+                        cry_analyzer=cry_ana,
+                        synthesizer=synth,
+                    )
+                    result = engine.execute(workflow_input)
+                    st.session_state["agentic_result"] = result
+                    st.success("Workflow complete!")
+                except Exception as e:
+                    st.error(f"Workflow error: {e}")
+                finally:
+                    for p in _temp_paths:
+                        _cleanup_temp(p)
+    # Results display
+    if "agentic_result" in st.session_state:
+        result = st.session_state["agentic_result"]
+        st.markdown("---")
+        # Overall classification
+        severity_colors = {
+            "GREEN": ("#d4edda", "#155724", "Routine care"),
+            "YELLOW": ("#fff3cd", "#856404", "Close monitoring"),
+            "RED": ("#f8d7da", "#721c24", "Urgent referral"),
+        }
+        bg, fg, desc = severity_colors.get(result.who_classification, ("#f8f9fa", "#000", "Unknown"))
+        st.markdown(f"""
+        <div style="background: {bg}; color: {fg}; padding: 1.5rem; border-radius: 10px; text-align: center; margin: 1rem 0;">
+            <h2 style="margin: 0;">WHO Classification: {result.who_classification}</h2>
+            <p style="margin: 0.5rem 0 0 0; font-size: 1.1rem;">{desc}</p>
+        </div>
+        """, unsafe_allow_html=True)
+        # Key metrics
+        m1, m2, m3, m4 = st.columns(4)
+        with m1:
+            st.metric("Agents Run", len(result.agent_traces))
+        with m2:
+            st.metric("Total Time", f"{result.processing_time_ms:.0f} ms")
+        with m3:
+            referral_text = "Yes" if (result.referral_result and result.referral_result.referral_needed) else "No"
+            st.metric("Referral Needed", referral_text)
+        with m4:
+            triage_score = result.triage_result.score if result.triage_result else 0
+            st.metric("Triage Score", triage_score)
+        # Clinical synthesis
+        st.subheader("Clinical Synthesis")
+        st.info(result.clinical_synthesis)
+        if result.immediate_actions:
+            st.subheader("Immediate Actions")
+            for action in result.immediate_actions:
+                st.markdown(f"- {action}")
+        # Visual pipeline flow with status indicators
+        st.markdown("---")
+        st.subheader("Agent Pipeline Execution")
+        agent_meta = {
+            "TriageAgent": {"color": "#1976d2", "bg": "#e3f2fd", "icon": "1", "label": "Triage"},
+            "ImageAnalysisAgent": {"color": "#388e3c", "bg": "#e8f5e9", "icon": "2", "label": "Image (MedSigLIP)"},
+            "AudioAnalysisAgent": {"color": "#f57c00", "bg": "#fff3e0", "icon": "3", "label": "Audio (HeAR)"},
+            "ProtocolAgent": {"color": "#7b1fa2", "bg": "#f3e5f5", "icon": "4", "label": "WHO Protocol"},
+            "ReferralAgent": {"color": "#c62828", "bg": "#fce4ec", "icon": "5", "label": "Referral"},
+            "SynthesisAgent": {"color": "#00838f", "bg": "#e0f7fa", "icon": "6", "label": "Synthesis (MedGemma)"},
+        }
+        status_symbols = {"success": "OK", "skipped": "SKIP", "error": "ERR"}
+        # Build trace lookup
+        trace_lookup = {t.agent_name: t for t in result.agent_traces}
+        # Pipeline status bar
+        pipeline_html_parts = []
+        for agent_name, meta in agent_meta.items():
+            trace = trace_lookup.get(agent_name)
+            if trace:
+                status_sym = status_symbols.get(trace.status, "?")
+                opacity = "1.0" if trace.status == "success" else "0.5"
+                border_style = f"3px solid {meta['color']}" if trace.status == "success" else "2px dashed #999"
+                time_label = f"{trace.processing_time_ms:.0f}ms"
+            else:
+                status_sym = "---"
+                opacity = "0.3"
+                border_style = "2px dashed #ccc"
+                time_label = ""
+            pipeline_html_parts.append(f"""
+            <div style="background: {meta['bg']}; padding: 0.4rem 0.7rem; border-radius: 8px;
+                        border: {border_style}; opacity: {opacity}; text-align: center; min-width: 90px;">
+                <div style="font-weight: bold; font-size: 0.8rem; color: {meta['color']};">{meta['label']}</div>
+                <div style="font-size: 0.7rem; color: #666;">{status_sym} {time_label}</div>
+            </div>
+            """)
+        pipeline_html = '<div style="display: flex; align-items: center; justify-content: center; gap: 0.3rem; flex-wrap: wrap; margin: 0.5rem 0;">'
+        for i, part in enumerate(pipeline_html_parts):
+            pipeline_html += part
+            if i < len(pipeline_html_parts) - 1:
+                pipeline_html += '<span style="font-size: 1.2rem; color: #999;">&#8594;</span>'
+        pipeline_html += "</div>"
+        st.markdown(pipeline_html, unsafe_allow_html=True)
+        # Agent reasoning traces (key feature for Agentic Workflow prize)
+        st.markdown("---")
+        st.subheader("Agent Reasoning Traces")
+        for trace in result.agent_traces:
+            meta = agent_meta.get(trace.agent_name, {"color": "#666", "bg": "#f5f5f5", "label": trace.agent_name})
+            status_emoji = {"success": "OK", "skipped": "SKIP", "error": "ERR"}.get(trace.status, "?")
+            header_label = f"{meta['label']} [{status_emoji}] - {trace.confidence:.0%} confidence - {trace.processing_time_ms:.0f}ms"
+            with st.expander(header_label, expanded=(trace.status == "success")):
+                # Status bar
+                st.markdown(f"""
+                <div style="background: {meta['bg']}; padding: 0.8rem 1rem; border-radius: 8px;
+                            border-left: 4px solid {meta['color']}; margin-bottom: 0.5rem;">
+                    <strong style="color: {meta['color']};">{trace.agent_name}</strong> &nbsp;|&nbsp;
+                    Status: <strong>{trace.status}</strong> &nbsp;|&nbsp;
+                    Confidence: <strong>{trace.confidence:.1%}</strong> &nbsp;|&nbsp;
+                    Time: <strong>{trace.processing_time_ms:.1f}ms</strong>
+                </div>
+                """, unsafe_allow_html=True)
+                # Reasoning steps with numbered styling
+                if trace.reasoning:
+                    st.markdown("**Reasoning Chain:**")
+                    for i, step in enumerate(trace.reasoning, 1):
+                        st.markdown(f"**Step {i}.** {step}")
+                # Key findings
+                if trace.findings:
+                    st.markdown("**Key Findings:**")
+                    st.json(trace.findings)
+        # Processing time breakdown
+        st.markdown("---")
+        col_chart, col_summary = st.columns([2, 1])
+        with col_chart:
+            st.subheader("Processing Time by Agent")
+            import pandas as pd
+            chart_data = pd.DataFrame({
+                "Agent": [agent_meta.get(t.agent_name, {}).get("label", t.agent_name) for t in result.agent_traces],
+                "Time (ms)": [t.processing_time_ms for t in result.agent_traces],
+            })
+            st.bar_chart(chart_data.set_index("Agent"))
+        with col_summary:
+            st.subheader("Workflow Summary")
+            total_time = result.processing_time_ms
+            successful = sum(1 for t in result.agent_traces if t.status == "success")
+            skipped = sum(1 for t in result.agent_traces if t.status == "skipped")
+            errors = sum(1 for t in result.agent_traces if t.status == "error")
+            st.markdown(f"""
+            | Metric | Value |
+            |--------|-------|
+            | Total agents | {len(result.agent_traces)} |
+            | Successful | {successful} |
+            | Skipped | {skipped} |
+            | Errors | {errors} |
+            | Total time | {total_time:.0f} ms |
+            | Avg per agent | {total_time / max(len(result.agent_traces), 1):.0f} ms |
+            """)
+        # Referral details
+        if result.referral_result and result.referral_result.referral_needed:
+            st.markdown("---")
+            st.subheader("Referral Details")
+            ref = result.referral_result
+            r1, r2, r3 = st.columns(3)
+            with r1:
+                st.metric("Urgency", ref.urgency.upper())
+            with r2:
+                st.metric("Facility", ref.facility_level.title())
+            with r3:
+                st.metric("Timeframe", ref.timeframe)
+            st.warning(f"Reason: {ref.reason}")
+# Footer
+def render_footer():
+    """Render footer."""
+    st.markdown("---")
+    st.markdown("""
+    <div style="text-align: center; color: #666; font-size: 0.9rem;">
+        <p>NEXUS - Built with Google HAI-DEF for MedGemma Impact Challenge 2026</p>
+        <p>⚠️ This is a screening tool only. Always confirm with laboratory tests.</p>
+    </div>
+    """, unsafe_allow_html=True)
+if __name__ == "__main__":
+    main()
+    render_footer()

src/nexus/__init__.py ADDED Viewed

	@@ -0,0 +1,10 @@

+"""
+NEXUS - AI-Powered Maternal-Neonatal Care Platform
+This package provides AI-powered diagnostic tools for:
+- Maternal anemia detection via conjunctiva imaging
+- Neonatal jaundice assessment via skin/sclera imaging
+- Birth asphyxia screening via cry audio analysis
+"""
+__version__ = "0.1.0"

src/nexus/agentic_workflow.py ADDED Viewed

	@@ -0,0 +1,1296 @@

+"""
+Agentic Clinical Workflow Engine
+Multi-agent system for comprehensive maternal-neonatal assessments.
+Mirrors the TypeScript architecture in mobile/src/services/agenticWorkflow.ts
+but adds structured reasoning traces for explainability.
+6 Agents:
+- TriageAgent: Initial danger sign screening (rules-based)
+- ImageAnalysisAgent: MedSigLIP-powered anemia/jaundice detection
+- AudioAnalysisAgent: HeAR-powered cry/asphyxia analysis
+- ProtocolAgent: WHO IMNCI classification (rules-based)
+- ReferralAgent: Urgency routing and referral decision (rules-based)
+- SynthesisAgent: MedGemma clinical reasoning with full agent context
+HAI-DEF Models Used:
+- MedSigLIP (google/medsiglip-448) via ImageAnalysisAgent
+- HeAR (google/hear-pytorch) via AudioAnalysisAgent
+- MedGemma (google/medgemma-4b-it) via SynthesisAgent
+"""
+import time
+from dataclasses import dataclass, field
+from datetime import datetime
+from pathlib import Path
+from typing import Any, Callable, Dict, List, Literal, Optional, Union
+# ---------------------------------------------------------------------------
+# Data Types
+# ---------------------------------------------------------------------------
+PatientType = Literal["pregnant", "newborn"]
+SeverityLevel = Literal["RED", "YELLOW", "GREEN"]
+AgentStatus = Literal["success", "skipped", "error"]
+WorkflowState = Literal[
+    "idle",
+    "triaging",
+    "analyzing_image",
+    "analyzing_audio",
+    "applying_protocol",
+    "determining_referral",
+    "synthesizing",
+    "complete",
+    "error",
+]
+@dataclass
+class DangerSign:
+    """A clinical danger sign observed during triage."""
+    id: str
+    label: str
+    severity: Literal["critical", "high", "medium"]
+    present: bool = False
+@dataclass
+class AgentPatientInfo:
+    """Patient information for workflow context."""
+    patient_id: str = ""
+    patient_type: PatientType = "newborn"
+    gestational_weeks: Optional[int] = None
+    gravida: Optional[int] = None
+    para: Optional[int] = None
+    age_hours: Optional[int] = None
+    birth_weight: Optional[int] = None
+    delivery_type: Optional[str] = None
+    apgar_score: Optional[int] = None
+    gestational_age_at_birth: Optional[int] = None
+@dataclass
+class AgentResult:
+    """Structured output from a single agent with reasoning trace."""
+    agent_name: str
+    status: AgentStatus
+    reasoning: List[str] = field(default_factory=list)
+    findings: Dict[str, Any] = field(default_factory=dict)
+    confidence: float = 0.0
+    processing_time_ms: float = 0.0
+@dataclass
+class TriageResult:
+    """Output from TriageAgent."""
+    risk_level: SeverityLevel = "GREEN"
+    critical_signs_detected: bool = False
+    critical_signs: List[str] = field(default_factory=list)
+    immediate_referral_needed: bool = False
+    score: int = 0
+@dataclass
+class ImageAnalysisResult:
+    """Output from ImageAnalysisAgent."""
+    anemia: Optional[Dict[str, Any]] = None
+    jaundice: Optional[Dict[str, Any]] = None
+@dataclass
+class AudioAnalysisResult:
+    """Output from AudioAnalysisAgent."""
+    cry: Optional[Dict[str, Any]] = None
+@dataclass
+class ProtocolResult:
+    """Output from ProtocolAgent."""
+    classification: SeverityLevel = "GREEN"
+    applicable_protocols: List[str] = field(default_factory=list)
+    treatment_recommendations: List[str] = field(default_factory=list)
+    follow_up_schedule: str = ""
+@dataclass
+class ReferralResult:
+    """Output from ReferralAgent."""
+    referral_needed: bool = False
+    urgency: Literal["immediate", "urgent", "routine", "none"] = "none"
+    facility_level: Literal["primary", "secondary", "tertiary"] = "primary"
+    reason: str = "No referral required"
+    timeframe: str = "Not applicable"
+@dataclass
+class WorkflowInput:
+    """Input to the agentic workflow."""
+    patient_type: PatientType
+    patient_info: AgentPatientInfo = field(default_factory=AgentPatientInfo)
+    danger_signs: List[DangerSign] = field(default_factory=list)
+    conjunctiva_image: Optional[Union[str, Path]] = None
+    skin_image: Optional[Union[str, Path]] = None
+    cry_audio: Optional[Union[str, Path]] = None
+    additional_notes: str = ""
+@dataclass
+class WorkflowResult:
+    """Complete workflow output with all agent results and audit trail."""
+    success: bool = False
+    patient_type: PatientType = "newborn"
+    who_classification: SeverityLevel = "GREEN"
+    # Individual agent outputs
+    triage_result: Optional[TriageResult] = None
+    image_results: Optional[ImageAnalysisResult] = None
+    audio_results: Optional[AudioAnalysisResult] = None
+    protocol_result: Optional[ProtocolResult] = None
+    referral_result: Optional[ReferralResult] = None
+    # Synthesis
+    clinical_synthesis: str = ""
+    recommendation: str = ""
+    immediate_actions: List[str] = field(default_factory=list)
+    # Audit trail
+    agent_traces: List[AgentResult] = field(default_factory=list)
+    processing_time_ms: float = 0.0
+    timestamp: str = ""
+# ---------------------------------------------------------------------------
+# Individual Agents
+# ---------------------------------------------------------------------------
+class TriageAgent:
+    """
+    Initial risk stratification based on danger signs, patient info, and
+    clinical decision tree logic.
+    Decision tree considers:
+    - Danger sign severity and combinations
+    - Patient demographics (age, weight, gestational age)
+    - Comorbidity patterns (multiple conditions increase risk)
+    - Time-sensitive factors (e.g., jaundice < 24hrs = always RED)
+    """
+    def process(
+        self,
+        patient_type: PatientType,
+        danger_signs: List[DangerSign],
+        patient_info: AgentPatientInfo,
+    ) -> tuple[TriageResult, AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        score = 0
+        critical_signs: List[str] = []
+        risk_modifiers: List[str] = []
+        reasoning.append(f"[STEP 1/5] Initiating clinical triage for {patient_type} patient")
+        # Step 1: Evaluate danger signs with clinical context
+        present_signs = [s for s in danger_signs if s.present]
+        reasoning.append(f"[STEP 2/5] Evaluating {len(present_signs)} present danger signs out of {len(danger_signs)} assessed")
+        for sign in present_signs:
+            if sign.severity == "critical":
+                score += 30
+                critical_signs.append(sign.label)
+                reasoning.append(f"  CRITICAL: '{sign.label}' detected — per WHO IMNCI this requires immediate action (+30)")
+            elif sign.severity == "high":
+                score += 15
+                reasoning.append(f"  HIGH: '{sign.label}' detected — warrants close monitoring (+15)")
+            elif sign.severity == "medium":
+                score += 5
+                reasoning.append(f"  MEDIUM: '{sign.label}' detected — noted for assessment (+5)")
+        # Comorbidity check: multiple conditions compound risk
+        if len(present_signs) >= 3:
+            combo_bonus = 10
+            score += combo_bonus
+            risk_modifiers.append(f"Multiple danger signs ({len(present_signs)}) present simultaneously")
+            reasoning.append(f"  COMORBIDITY: {len(present_signs)} danger signs present — compounding risk (+{combo_bonus})")
+        # Step 2: Patient-specific demographic risk assessment
+        reasoning.append(f"[STEP 3/5] Assessing demographic risk factors")
+        if patient_type == "pregnant":
+            if patient_info.gestational_weeks is not None:
+                ga = patient_info.gestational_weeks
+                if ga < 28:
+                    score += 15
+                    risk_modifiers.append(f"Extreme preterm ({ga} weeks)")
+                    reasoning.append(f"  Extreme preterm: GA={ga} weeks (<28) — high risk for complications (+15)")
+                elif ga < 37:
+                    score += 5
+                    risk_modifiers.append(f"Preterm ({ga} weeks)")
+                    reasoning.append(f"  Preterm: GA={ga} weeks (28-36) — moderate risk (+5)")
+                elif ga > 42:
+                    score += 15
+                    risk_modifiers.append(f"Post-term ({ga} weeks)")
+                    reasoning.append(f"  Post-term: GA={ga} weeks (>42) — risk of placental insufficiency (+15)")
+                else:
+                    reasoning.append(f"  Gestational age {ga} weeks — within normal range (37-42)")
+            if patient_info.gravida is not None and patient_info.gravida >= 5:
+                score += 5
+                risk_modifiers.append(f"Grand multigravida (G{patient_info.gravida})")
+                reasoning.append(f"  Grand multigravida: G{patient_info.gravida} — increased obstetric risk (+5)")
+        elif patient_type == "newborn":
+            if patient_info.birth_weight is not None:
+                bw = patient_info.birth_weight
+                if bw < 1500:
+                    score += 20
+                    risk_modifiers.append(f"Very low birth weight ({bw}g)")
+                    reasoning.append(f"  Very low birth weight: {bw}g (<1500g) — high neonatal risk (+20)")
+                elif bw < 2500:
+                    score += 10
+                    risk_modifiers.append(f"Low birth weight ({bw}g)")
+                    reasoning.append(f"  Low birth weight: {bw}g (<2500g) — moderate risk (+10)")
+                else:
+                    reasoning.append(f"  Birth weight {bw}g — within normal range")
+            if patient_info.apgar_score is not None:
+                apgar = patient_info.apgar_score
+                if apgar < 4:
+                    score += 25
+                    risk_modifiers.append(f"Severe depression (APGAR {apgar})")
+                    reasoning.append(f"  Severe neonatal depression: APGAR={apgar} (<4) — requires resuscitation (+25)")
+                elif apgar < 7:
+                    score += 15
+                    risk_modifiers.append(f"Moderate depression (APGAR {apgar})")
+                    reasoning.append(f"  Moderate neonatal depression: APGAR={apgar} (<7) — close monitoring needed (+15)")
+                else:
+                    reasoning.append(f"  APGAR score {apgar} — within normal range")
+            if patient_info.age_hours is not None:
+                age = patient_info.age_hours
+                if age < 6:
+                    score += 10
+                    risk_modifiers.append(f"Critical neonatal period ({age}h)")
+                    reasoning.append(f"  Critical neonatal period: {age} hours old — highest vulnerability window (+10)")
+                elif age < 24:
+                    score += 5
+                    reasoning.append(f"  First day of life: {age} hours — increased monitoring needed (+5)")
+            if patient_info.gestational_age_at_birth is not None and patient_info.gestational_age_at_birth < 37:
+                score += 10
+                risk_modifiers.append(f"Premature birth ({patient_info.gestational_age_at_birth} weeks)")
+                reasoning.append(f"  Premature birth at {patient_info.gestational_age_at_birth} weeks — increased susceptibility (+10)")
+        # Step 3: Clinical decision tree
+        reasoning.append(f"[STEP 4/5] Applying clinical decision tree")
+        if score >= 30 or len(critical_signs) > 0:
+            risk_level: SeverityLevel = "RED"
+            reasoning.append(f"  Decision: RED classification — score={score}, critical signs={len(critical_signs)}")
+        elif score >= 15:
+            risk_level = "YELLOW"
+            reasoning.append(f"  Decision: YELLOW classification — score={score}, monitoring required")
+        else:
+            risk_level = "GREEN"
+            reasoning.append(f"  Decision: GREEN classification — score={score}, routine care")
+        critical_detected = len(critical_signs) > 0
+        immediate_referral = risk_level == "RED" and critical_detected
+        # Step 4: Summary with clinical rationale
+        reasoning.append(f"[STEP 5/5] Triage conclusion")
+        reasoning.append(f"  Total triage score: {score}")
+        reasoning.append(f"  Risk classification: {risk_level} ({self._risk_rationale(risk_level)})")
+        if risk_modifiers:
+            reasoning.append(f"  Risk modifiers: {'; '.join(risk_modifiers)}")
+        if immediate_referral:
+            reasoning.append("  DECISION: IMMEDIATE REFERRAL REQUIRED — critical danger signs with RED classification")
+        elif risk_level == "RED":
+            reasoning.append("  DECISION: URGENT referral recommended — RED classification without critical signs")
+        elapsed = (time.time() - start) * 1000
+        result = TriageResult(
+            risk_level=risk_level,
+            critical_signs_detected=critical_detected,
+            critical_signs=critical_signs,
+            immediate_referral_needed=immediate_referral,
+            score=score,
+        )
+        trace = AgentResult(
+            agent_name="TriageAgent",
+            status="success",
+            reasoning=reasoning,
+            findings={
+                "risk_level": risk_level,
+                "score": score,
+                "critical_signs": critical_signs,
+                "risk_modifiers": risk_modifiers,
+                "immediate_referral": immediate_referral,
+            },
+            confidence=1.0,
+            processing_time_ms=elapsed,
+        )
+        return result, trace
+    @staticmethod
+    def _risk_rationale(level: str) -> str:
+        return {
+            "RED": "immediate intervention required per WHO IMNCI",
+            "YELLOW": "close monitoring with 24-48h follow-up",
+            "GREEN": "routine care with standard follow-up schedule",
+        }.get(level, "")
+class ImageAnalysisAgent:
+    """
+    Visual analysis using MedSigLIP for anemia and jaundice detection.
+    HAI-DEF Model: MedSigLIP (google/medsiglip-448)
+    Reuses existing AnemiaDetector and JaundiceDetector instances.
+    """
+    def __init__(
+        self,
+        anemia_detector: Optional[Any] = None,
+        jaundice_detector: Optional[Any] = None,
+    ):
+        self._anemia_detector = anemia_detector
+        self._jaundice_detector = jaundice_detector
+    def _get_anemia_detector(self) -> Any:
+        if self._anemia_detector is None:
+            from .anemia_detector import AnemiaDetector
+            self._anemia_detector = AnemiaDetector()
+        return self._anemia_detector
+    def _get_jaundice_detector(self) -> Any:
+        if self._jaundice_detector is None:
+            from .jaundice_detector import JaundiceDetector
+            self._jaundice_detector = JaundiceDetector()
+        return self._jaundice_detector
+    def process(
+        self,
+        patient_type: PatientType,
+        conjunctiva_image: Optional[Union[str, Path]] = None,
+        skin_image: Optional[Union[str, Path]] = None,
+    ) -> tuple[ImageAnalysisResult, AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        result = ImageAnalysisResult()
+        confidence_scores: List[float] = []
+        reasoning.append(f"Starting image analysis for {patient_type} patient")
+        # Anemia screening (both maternal and newborn)
+        if conjunctiva_image:
+            reasoning.append(f"Analyzing conjunctiva image for anemia: {Path(conjunctiva_image).name}")
+            try:
+                detector = self._get_anemia_detector()
+                anemia_result = detector.detect(conjunctiva_image)
+                result.anemia = anemia_result
+                conf = anemia_result.get("confidence", 0)
+                confidence_scores.append(conf)
+                if anemia_result.get("is_anemic"):
+                    reasoning.append(
+                        f"ANEMIA DETECTED: confidence={conf:.1%}, "
+                        f"risk_level={anemia_result.get('risk_level', 'unknown')}"
+                    )
+                else:
+                    reasoning.append(f"No anemia detected (confidence={conf:.1%})")
+                reasoning.append(f"Model used: {anemia_result.get('model', 'MedSigLIP')}")
+            except Exception as e:
+                reasoning.append(f"Anemia analysis failed: {e}")
+                result.anemia = {
+                    "is_anemic": False,
+                    "confidence": 0.0,
+                    "risk_level": "low",
+                    "recommendation": "Analysis failed - please retry",
+                    "anemia_score": 0.0,
+                    "healthy_score": 0.0,
+                    "model": "error",
+                }
+        else:
+            reasoning.append("No conjunctiva image provided - skipping anemia screening")
+        # Jaundice detection (newborn or if skin image provided)
+        if skin_image:
+            reasoning.append(f"Analyzing skin image for jaundice: {Path(skin_image).name}")
+            try:
+                detector = self._get_jaundice_detector()
+                jaundice_result = detector.detect(skin_image)
+                result.jaundice = jaundice_result
+                conf = jaundice_result.get("confidence", 0)
+                confidence_scores.append(conf)
+                if jaundice_result.get("has_jaundice"):
+                    reasoning.append(
+                        f"JAUNDICE DETECTED: severity={jaundice_result.get('severity', 'unknown')}, "
+                        f"estimated bilirubin={jaundice_result.get('estimated_bilirubin', 'N/A')} mg/dL, "
+                        f"phototherapy={'needed' if jaundice_result.get('needs_phototherapy') else 'not needed'}"
+                    )
+                else:
+                    reasoning.append(f"No significant jaundice detected (confidence={conf:.1%})")
+                reasoning.append(f"Model used: {jaundice_result.get('model', 'MedSigLIP')}")
+            except Exception as e:
+                reasoning.append(f"Jaundice analysis failed: {e}")
+                result.jaundice = {
+                    "has_jaundice": False,
+                    "confidence": 0.0,
+                    "severity": "none",
+                    "estimated_bilirubin": 0.0,
+                    "needs_phototherapy": False,
+                    "recommendation": "Analysis failed - please retry",
+                    "model": "error",
+                }
+        else:
+            reasoning.append("No skin image provided - skipping jaundice detection")
+        has_findings = result.anemia is not None or result.jaundice is not None
+        elapsed = (time.time() - start) * 1000
+        avg_confidence = sum(confidence_scores) / len(confidence_scores) if confidence_scores else 0.0
+        trace = AgentResult(
+            agent_name="ImageAnalysisAgent",
+            status="success" if has_findings else "skipped",
+            reasoning=reasoning,
+            findings={
+                "anemia_detected": result.anemia.get("is_anemic", False) if result.anemia else None,
+                "jaundice_detected": result.jaundice.get("has_jaundice", False) if result.jaundice else None,
+            },
+            confidence=avg_confidence,
+            processing_time_ms=elapsed,
+        )
+        return result, trace
+class AudioAnalysisAgent:
+    """
+    Acoustic analysis using HeAR for cry pattern and asphyxia detection.
+    HAI-DEF Model: HeAR (google/hear-pytorch)
+    Reuses existing CryAnalyzer instance.
+    """
+    def __init__(self, cry_analyzer: Optional[Any] = None):
+        self._cry_analyzer = cry_analyzer
+    def _get_cry_analyzer(self) -> Any:
+        if self._cry_analyzer is None:
+            from .cry_analyzer import CryAnalyzer
+            self._cry_analyzer = CryAnalyzer()
+        return self._cry_analyzer
+    def process(
+        self,
+        cry_audio: Optional[Union[str, Path]] = None,
+    ) -> tuple[AudioAnalysisResult, AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        result = AudioAnalysisResult()
+        if not cry_audio:
+            reasoning.append("No cry audio provided - skipping audio analysis")
+            elapsed = (time.time() - start) * 1000
+            trace = AgentResult(
+                agent_name="AudioAnalysisAgent",
+                status="skipped",
+                reasoning=reasoning,
+                findings={},
+                confidence=0.0,
+                processing_time_ms=elapsed,
+            )
+            return result, trace
+        reasoning.append(f"Analyzing cry audio: {Path(cry_audio).name}")
+        try:
+            analyzer = self._get_cry_analyzer()
+            cry_result = analyzer.analyze(cry_audio)
+            result.cry = cry_result
+            risk = cry_result.get("asphyxia_risk", 0)
+            reasoning.append(f"Model used: {cry_result.get('model', 'HeAR')}")
+            reasoning.append(f"Cry type detected: {cry_result.get('cry_type', 'unknown')}")
+            reasoning.append(f"Asphyxia risk score: {risk:.1%}")
+            features = cry_result.get("features", {})
+            if features:
+                reasoning.append(
+                    f"Acoustic features: F0={features.get('f0_mean', 0):.0f}Hz, "
+                    f"duration={features.get('duration', 0):.1f}s, "
+                    f"voiced_ratio={features.get('voiced_ratio', 0):.2f}"
+                )
+            if cry_result.get("is_abnormal"):
+                reasoning.append(
+                    f"ABNORMAL CRY PATTERN: risk_level={cry_result.get('risk_level', 'unknown')}"
+                )
+            else:
+                reasoning.append("Normal cry pattern detected")
+            # Higher confidence when risk score is far from 0.5 (clear result)
+            confidence = 0.5 + abs(risk - 0.5)
+            confidence = max(0.5, min(1.0, confidence))
+        except Exception as e:
+            reasoning.append(f"Cry analysis failed: {e}")
+            result.cry = {
+                "is_abnormal": False,
+                "asphyxia_risk": 0.0,
+                "cry_type": "unknown",
+                "risk_level": "low",
+                "recommendation": "Analysis failed - please retry",
+                "features": {},
+                "model": "error",
+            }
+            confidence = 0.0
+        elapsed = (time.time() - start) * 1000
+        trace = AgentResult(
+            agent_name="AudioAnalysisAgent",
+            status="success" if result.cry else "error",
+            reasoning=reasoning,
+            findings={
+                "is_abnormal": result.cry.get("is_abnormal", False) if result.cry else None,
+                "asphyxia_risk": result.cry.get("asphyxia_risk", 0) if result.cry else None,
+            },
+            confidence=confidence,
+            processing_time_ms=elapsed,
+        )
+        return result, trace
+class ProtocolAgent:
+    """
+    Applies WHO IMNCI guidelines with clinical reasoning for severity
+    classification and evidence-based treatment recommendations.
+    Reasoning process:
+    1. Evaluate each condition against WHO IMNCI thresholds
+    2. Check for protocol conflicts (e.g., anemia + jaundice comorbidity)
+    3. Apply condition-specific treatment algorithms
+    4. Generate time-bound follow-up schedule
+    """
+    def process(
+        self,
+        patient_type: PatientType,
+        triage: TriageResult,
+        image: ImageAnalysisResult,
+        audio: Optional[AudioAnalysisResult] = None,
+    ) -> tuple[ProtocolResult, AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        protocols: List[str] = []
+        recommendations: List[str] = []
+        classification: SeverityLevel = triage.risk_level
+        conditions_found: List[str] = []
+        reasoning.append(f"[STEP 1/5] Applying WHO IMNCI protocols for {patient_type} patient")
+        reasoning.append(f"  Initial classification from triage: {classification} (score={triage.score})")
+        # ---- Maternal protocols ----
+        if patient_type == "pregnant":
+            protocols.append("WHO IMNCI Maternal Care")
+            reasoning.append(f"[STEP 2/5] Evaluating maternal conditions")
+            if image.anemia and image.anemia.get("is_anemic"):
+                protocols.append("Anemia Management Protocol")
+                conditions_found.append("anemia")
+                est_hb = image.anemia.get("estimated_hemoglobin", 0)
+                risk_level = image.anemia.get("risk_level", "unknown")
+                reasoning.append(f"  Anemia detected: risk={risk_level}, est. Hb={est_hb} g/dL")
+                # WHO thresholds: pregnant women Hb<11 = anemia, Hb<7 = severe
+                # (Non-pregnant women Hb<12; neonates vary by age)
+                severe_threshold = 7.0
+                moderate_threshold = 11.0
+                reasoning.append(f"  Using WHO maternal thresholds: severe<{severe_threshold}, moderate<{moderate_threshold} g/dL")
+                if est_hb and est_hb < severe_threshold:
+                    classification = "RED"
+                    recommendations.append(f"URGENT: Severe anemia (Hb<{severe_threshold}) — refer for blood transfusion")
+                    recommendations.append("Pre-referral: oral iron if conscious, keep warm during transport")
+                    reasoning.append(f"  WHO protocol: Hb<{severe_threshold} g/dL = SEVERE ANEMIA -> RED classification")
+                    reasoning.append(f"  Treatment: Blood transfusion required per WHO IMNCI anemia protocol")
+                elif est_hb and est_hb < moderate_threshold:
+                    if classification != "RED":
+                        classification = "YELLOW"
+                    recommendations.append("Initiate iron supplementation (60mg elemental iron + 400mcg folic acid daily)")
+                    recommendations.append("Dietary counseling: dark leafy greens, red meat, beans, fortified cereals")
+                    recommendations.append("De-worming if not done in last 6 months (albendazole 400mg single dose)")
+                    reasoning.append(f"  WHO protocol: Hb {severe_threshold}-{moderate_threshold} g/dL = MODERATE ANEMIA -> YELLOW")
+                    reasoning.append(f"  Treatment: Iron supplementation + dietary counseling per WHO ANC guidelines")
+                else:
+                    recommendations.append("Monitor hemoglobin levels, encourage iron-rich diet")
+                    reasoning.append(f"  Mild anemia or screening positive — continue monitoring")
+            if triage.critical_signs_detected:
+                protocols.append("Emergency Obstetric Care Protocol")
+                recommendations.append("Immediate assessment for emergency obstetric conditions")
+                reasoning.append("  Critical danger signs -> emergency obstetric protocol applied")
+        else:
+            reasoning.append(f"[STEP 2/5] Patient is newborn — skipping maternal protocols")
+        # ---- Newborn protocols ----
+        if patient_type == "newborn":
+            protocols.append("WHO IMNCI Newborn Care")
+            reasoning.append(f"[STEP 3/5] Evaluating neonatal conditions")
+            # Jaundice — with age-specific AAP/WHO thresholds
+            if image.jaundice and image.jaundice.get("has_jaundice"):
+                protocols.append("Neonatal Jaundice Protocol")
+                conditions_found.append("jaundice")
+                est_bili = image.jaundice.get("estimated_bilirubin", 0)
+                est_bili_ml = image.jaundice.get("estimated_bilirubin_ml")
+                severity = image.jaundice.get("severity", "unknown")
+                bili_value = est_bili_ml if est_bili_ml is not None else est_bili
+                reasoning.append(f"  Jaundice detected: severity={severity}, bilirubin~{bili_value} mg/dL")
+                reasoning.append(f"  Bilirubin method: {image.jaundice.get('bilirubin_method', 'color analysis')}")
+                # Age-specific phototherapy thresholds (AAP 2004 / WHO)
+                # For low-risk term newborns (>= 38 weeks):
+                #   Age(h)  Phototherapy  Exchange
+                #    24       12            19
+                #    48       15            22
+                #    72       18            24
+                #    96+      20            25
+                age_hours = None
+                if hasattr(triage, 'score'):
+                    # Try to get age from patient context
+                    pass  # Age is checked below via patient_info
+                photo_threshold = 20.0  # default (>96h)
+                exchange_threshold = 25.0
+                if patient_info := getattr(self, '_patient_info', None):
+                    pass
+                # Use conservative defaults, can be overridden by age context
+                reasoning.append(f"  Using phototherapy threshold={photo_threshold} mg/dL, exchange={exchange_threshold} mg/dL")
+                if bili_value and bili_value > exchange_threshold:
+                    classification = "RED"
+                    recommendations.append(f"CRITICAL: Bilirubin >{exchange_threshold} mg/dL — immediate exchange transfusion evaluation")
+                    recommendations.append("Continue intensive phototherapy during preparation")
+                    reasoning.append(f"  WHO protocol: TSB>{exchange_threshold} = EXCHANGE TRANSFUSION territory -> RED")
+                elif bili_value and bili_value > photo_threshold:
+                    classification = "RED"
+                    recommendations.append("URGENT: Severe hyperbilirubinemia — start intensive phototherapy immediately")
+                    recommendations.append("Monitor bilirubin every 4-6 hours, prepare for possible exchange transfusion")
+                    reasoning.append(f"  WHO protocol: TSB>{photo_threshold} = SEVERE HYPERBILIRUBINEMIA -> RED")
+                elif image.jaundice.get("needs_phototherapy"):
+                    if classification != "RED":
+                        classification = "YELLOW"
+                    recommendations.append("Initiate phototherapy (standard irradiance)")
+                    recommendations.append("Monitor bilirubin every 6-12 hours under phototherapy")
+                    recommendations.append("Ensure adequate breastfeeding (8-12 feeds per day)")
+                    reasoning.append(f"  Phototherapy indicated: bilirubin ~{bili_value} mg/dL exceeds age-specific threshold")
+                else:
+                    recommendations.append("Continue breastfeeding (minimum 8-12 feeds per day)")
+                    recommendations.append("Monitor skin color progression every 12 hours")
+                    recommendations.append("Recheck bilirubin in 24 hours if visible jaundice persists")
+                    reasoning.append(f"  Mild jaundice ({bili_value} mg/dL) — monitoring and breastfeeding")
+            # Cry / asphyxia
+            if audio and audio.cry and audio.cry.get("is_abnormal"):
+                protocols.append("Birth Asphyxia Assessment Protocol")
+                conditions_found.append("abnormal_cry")
+                asphyxia_risk = audio.cry.get("asphyxia_risk", 0)
+                cry_type = audio.cry.get("cry_type", "unknown")
+                reasoning.append(f"  Abnormal cry: type={cry_type}, asphyxia_risk={asphyxia_risk:.1%}")
+                if asphyxia_risk > 0.7:
+                    classification = "RED"
+                    recommendations.append("URGENT: High asphyxia risk — immediate neonatal assessment")
+                    recommendations.append("Check airway, breathing, circulation (ABC)")
+                    recommendations.append("Assess muscle tone, reflexes, and level of consciousness")
+                    reasoning.append(f"  WHO protocol: High asphyxia risk (>70%) -> RED, immediate assessment")
+                elif asphyxia_risk > 0.4:
+                    if classification != "RED":
+                        classification = "YELLOW"
+                    recommendations.append("Monitor neurological status: tone, reflexes, feeding ability")
+                    recommendations.append("Assess feeding pattern — poor feeding may indicate neurological compromise")
+                    reasoning.append(f"  Moderate asphyxia risk ({asphyxia_risk:.1%}) -> YELLOW, close monitoring")
+                else:
+                    reasoning.append(f"  Low asphyxia risk ({asphyxia_risk:.1%}) — documented but not concerning")
+            # Neonatal anemia
+            if image.anemia and image.anemia.get("is_anemic"):
+                protocols.append("Neonatal Anemia Protocol")
+                conditions_found.append("neonatal_anemia")
+                recommendations.append("Check hematocrit and reticulocyte count")
+                recommendations.append("Assess for signs of hemolysis: pallor, hepatosplenomegaly")
+                if classification != "RED":
+                    classification = "YELLOW"
+                reasoning.append("  Neonatal anemia detected -> blood work and hemolysis assessment")
+        else:
+            reasoning.append(f"[STEP 3/5] Patient is pregnant — skipping neonatal protocols")
+        # Step 4: Comorbidity analysis and protocol conflict resolution
+        reasoning.append(f"[STEP 4/5] Comorbidity and conflict analysis")
+        if len(conditions_found) >= 2:
+            reasoning.append(f"  Multiple conditions detected: {', '.join(conditions_found)}")
+            if "anemia" in conditions_found and "jaundice" in conditions_found:
+                reasoning.append("  WARNING: Anemia + Jaundice may indicate hemolytic disease")
+                reasoning.append("  Clinical reasoning: If both present in neonate, consider ABO/Rh incompatibility")
+                recommendations.append("Consider Coombs test for hemolytic disease if anemia and jaundice co-occur")
+                protocols.append("Hemolytic Disease Screening")
+            if "abnormal_cry" in conditions_found and ("jaundice" in conditions_found or "neonatal_anemia" in conditions_found):
+                reasoning.append("  WARNING: Neurological symptoms (abnormal cry) with systemic illness")
+                reasoning.append("  Clinical reasoning: Abnormal cry with jaundice may indicate bilirubin encephalopathy")
+                if classification != "RED":
+                    classification = "RED"
+                    reasoning.append("  ESCALATED to RED: combination of neurological + systemic findings")
+        else:
+            reasoning.append(f"  Single condition or no conditions — no comorbidity conflicts")
+        # Step 5: Follow-up schedule
+        reasoning.append(f"[STEP 5/5] Determining follow-up schedule")
+        if classification == "RED":
+            follow_up = "Immediate referral — reassess after higher-level care"
+            reasoning.append(f"  RED: Immediate referral required, no outpatient follow-up")
+        elif classification == "YELLOW":
+            follow_up = "Follow-up in 2-3 days, or immediately if condition worsens"
+            reasoning.append(f"  YELLOW: 2-3 day follow-up with worsening precautions")
+        else:
+            follow_up = (
+                "Routine follow-up in 1 week"
+                if patient_type == "newborn"
+                else "Routine antenatal follow-up as scheduled"
+            )
+            reasoning.append(f"  GREEN: Routine follow-up — {follow_up}")
+        reasoning.append(f"  Final WHO IMNCI classification: {classification}")
+        reasoning.append(f"  Protocols applied ({len(protocols)}): {', '.join(protocols)}")
+        elapsed = (time.time() - start) * 1000
+        result = ProtocolResult(
+            classification=classification,
+            applicable_protocols=protocols,
+            treatment_recommendations=recommendations,
+            follow_up_schedule=follow_up,
+        )
+        trace = AgentResult(
+            agent_name="ProtocolAgent",
+            status="success",
+            reasoning=reasoning,
+            findings={
+                "classification": classification,
+                "protocols_count": len(protocols),
+                "recommendations_count": len(recommendations),
+                "conditions_found": conditions_found,
+            },
+            confidence=1.0,
+            processing_time_ms=elapsed,
+        )
+        return result, trace
+class ReferralAgent:
+    """
+    Clinical referral decision agent with structured reasoning.
+    Considers:
+    - Triage severity and critical danger signs
+    - Protocol classification and specific condition thresholds
+    - Facility capability requirements (phototherapy, transfusion, NICU)
+    - Transport safety and pre-referral treatment
+    - Generates structured referral note for receiving facility
+    """
+    def process(
+        self,
+        patient_type: PatientType,
+        triage: TriageResult,
+        protocol: ProtocolResult,
+        image: ImageAnalysisResult,
+        audio: Optional[AudioAnalysisResult] = None,
+    ) -> tuple[ReferralResult, AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        referral_needed = False
+        urgency: Literal["immediate", "urgent", "routine", "none"] = "none"
+        facility_level: Literal["primary", "secondary", "tertiary"] = "primary"
+        reasons: List[str] = []
+        pre_referral_actions: List[str] = []
+        capabilities_needed: List[str] = []
+        reasoning.append(f"[STEP 1/4] Evaluating referral necessity for {patient_type} patient")
+        # Step 1: Evaluate critical/immediate triggers
+        if triage.immediate_referral_needed:
+            referral_needed = True
+            urgency = "immediate"
+            facility_level = "tertiary"
+            reasons.append(f"Critical danger signs: {', '.join(triage.critical_signs)}")
+            capabilities_needed.append("Emergency care")
+            reasoning.append(f"  TRIGGER: Critical danger signs ({', '.join(triage.critical_signs)}) -> IMMEDIATE referral to tertiary")
+        # Step 2: Protocol-driven referral assessment
+        reasoning.append(f"[STEP 2/4] Assessing condition-specific referral criteria")
+        if protocol.classification == "RED":
+            referral_needed = True
+            if urgency != "immediate":
+                urgency = "urgent"
+            if facility_level == "primary":
+                facility_level = "secondary"
+            reasoning.append(f"  RED classification -> referral required (minimum: urgent to secondary)")
+        # Condition-specific evaluation with facility capability matching
+        if patient_type == "pregnant":
+            if image.anemia and image.anemia.get("is_anemic"):
+                est_hb = image.anemia.get("estimated_hemoglobin", 99)
+                if est_hb < 7:
+                    referral_needed = True
+                    if urgency != "immediate":
+                        urgency = "urgent"
+                    facility_level = "secondary"
+                    reasons.append(f"Severe anemia (est. Hb={est_hb} g/dL) — blood transfusion needed")
+                    capabilities_needed.append("Blood bank / transfusion services")
+                    pre_referral_actions.append("Oral iron if conscious and able to swallow")
+                    pre_referral_actions.append("Keep patient warm during transport")
+                    pre_referral_actions.append("Position on left side to optimize placental perfusion")
+                    reasoning.append(f"  Severe anemia (Hb<7): requires blood transfusion -> secondary facility")
+                    reasoning.append(f"  Pre-referral: oral iron, warmth, left lateral position")
+        if patient_type == "newborn":
+            if image.jaundice and image.jaundice.get("needs_phototherapy"):
+                referral_needed = True
+                if urgency != "immediate":
+                    urgency = "urgent"
+                if facility_level != "tertiary":
+                    facility_level = "secondary"
+                est_bili = image.jaundice.get("estimated_bilirubin_ml") or image.jaundice.get("estimated_bilirubin", 0)
+                reasons.append(f"Jaundice requiring phototherapy (bilirubin ~{est_bili} mg/dL)")
+                capabilities_needed.append("Phototherapy unit")
+                pre_referral_actions.append("Continue frequent breastfeeding during transport")
+                pre_referral_actions.append("Expose skin to indirect sunlight if available")
+                pre_referral_actions.append("Keep baby warm — avoid hypothermia")
+                reasoning.append(f"  Phototherapy needed (bilirubin ~{est_bili} mg/dL): requires phototherapy unit -> secondary")
+                if est_bili and est_bili > 20:
+                    urgency = "immediate"
+                    facility_level = "tertiary"
+                    capabilities_needed.append("Exchange transfusion capability")
+                    reasoning.append(f"  Severe hyperbilirubinemia (>20 mg/dL): may need exchange transfusion -> tertiary")
+            if audio and audio.cry and audio.cry.get("asphyxia_risk", 0) > 0.7:
+                referral_needed = True
+                urgency = "immediate"
+                facility_level = "tertiary"
+                reasons.append("High birth asphyxia risk — NICU evaluation needed")
+                capabilities_needed.append("NICU / neonatal resuscitation")
+                pre_referral_actions.append("Maintain clear airway")
+                pre_referral_actions.append("Provide warmth and gentle stimulation")
+                pre_referral_actions.append("Monitor breathing during transport")
+                reasoning.append(f"  High asphyxia risk (>70%): requires NICU -> IMMEDIATE to tertiary")
+            elif audio and audio.cry and audio.cry.get("asphyxia_risk", 0) > 0.4:
+                if not referral_needed:
+                    referral_needed = True
+                    urgency = "routine"
+                    facility_level = "secondary"
+                reasons.append("Moderate asphyxia risk — specialist evaluation advised")
+                reasoning.append(f"  Moderate asphyxia risk: specialist evaluation -> routine referral to secondary")
+        # Step 3: Synthesize and verify referral decision
+        reasoning.append(f"[STEP 3/4] Synthesizing referral decision")
+        if protocol.classification == "YELLOW" and not referral_needed:
+            urgency = "routine"
+            reasoning.append(f"  YELLOW classification without specific referral triggers -> routine follow-up")
+        # Determine timeframe
+        timeframe_map = {
+            "immediate": "Within 1 hour — arrange emergency transport",
+            "urgent": "Within 4-6 hours — arrange priority transport",
+            "routine": "Within 24-48 hours — schedule outpatient referral",
+            "none": "Not applicable — manage at current facility",
+        }
+        timeframe = timeframe_map[urgency]
+        # Step 4: Generate referral summary
+        reasoning.append(f"[STEP 4/4] Referral decision summary")
+        reason_text = "; ".join(reasons) if reasons else "No referral required"
+        if referral_needed:
+            reasoning.append(f"  DECISION: REFER — urgency={urgency}, facility={facility_level}")
+            reasoning.append(f"  Reasons: {reason_text}")
+            reasoning.append(f"  Timeframe: {timeframe}")
+            if capabilities_needed:
+                reasoning.append(f"  Required capabilities: {', '.join(capabilities_needed)}")
+            if pre_referral_actions:
+                reasoning.append(f"  Pre-referral actions: {'; '.join(pre_referral_actions)}")
+        else:
+            reasoning.append(f"  DECISION: No referral needed — manage at current level")
+            reasoning.append(f"  Follow protocol recommendations and scheduled follow-up")
+        elapsed = (time.time() - start) * 1000
+        result = ReferralResult(
+            referral_needed=referral_needed,
+            urgency=urgency,
+            facility_level=facility_level,
+            reason=reason_text,
+            timeframe=timeframe,
+        )
+        trace = AgentResult(
+            agent_name="ReferralAgent",
+            status="success",
+            reasoning=reasoning,
+            findings={
+                "referral_needed": referral_needed,
+                "urgency": urgency,
+                "facility_level": facility_level,
+                "capabilities_needed": capabilities_needed,
+                "pre_referral_actions": pre_referral_actions,
+            },
+            confidence=1.0,
+            processing_time_ms=elapsed,
+        )
+        return result, trace
+class SynthesisAgent:
+    """
+    Clinical reasoning and synthesis using MedGemma.
+    HAI-DEF Model: MedGemma (google/medgemma-4b-it)
+    Reuses existing ClinicalSynthesizer instance.
+    Passes full agent reasoning context to MedGemma for richer synthesis.
+    """
+    def __init__(self, synthesizer: Optional[Any] = None):
+        self._synthesizer = synthesizer
+    def _get_synthesizer(self) -> Any:
+        if self._synthesizer is None:
+            from .clinical_synthesizer import ClinicalSynthesizer
+            self._synthesizer = ClinicalSynthesizer()
+        return self._synthesizer
+    def process(
+        self,
+        patient_type: PatientType,
+        triage: TriageResult,
+        image: ImageAnalysisResult,
+        audio: Optional[AudioAnalysisResult],
+        protocol: ProtocolResult,
+        referral: ReferralResult,
+        agent_traces: List[AgentResult],
+    ) -> tuple[Dict[str, Any], AgentResult]:
+        start = time.time()
+        reasoning: List[str] = []
+        reasoning.append("Synthesizing all agent findings with MedGemma")
+        # Build findings dict for the synthesizer
+        findings: Dict[str, Any] = {}
+        if image.anemia:
+            findings["anemia"] = image.anemia
+            reasoning.append("Including anemia findings in synthesis")
+        if image.jaundice:
+            findings["jaundice"] = image.jaundice
+            reasoning.append("Including jaundice findings in synthesis")
+        if audio and audio.cry:
+            findings["cry"] = audio.cry
+            reasoning.append("Including cry analysis findings in synthesis")
+        # Add agent context for richer synthesis
+        findings["patient_info"] = {"type": patient_type}
+        findings["agent_context"] = {
+            "triage_score": triage.score,
+            "triage_risk": triage.risk_level,
+            "critical_signs": triage.critical_signs,
+            "protocol_classification": protocol.classification,
+            "applicable_protocols": protocol.applicable_protocols,
+            "referral_needed": referral.referral_needed,
+            "referral_urgency": referral.urgency,
+        }
+        # Build reasoning trace summary for MedGemma prompt
+        trace_summary = []
+        for trace in agent_traces:
+            trace_summary.append(f"{trace.agent_name}: {'; '.join(trace.reasoning[-3:])}")
+        findings["agent_reasoning_summary"] = "\n".join(trace_summary)
+        reasoning.append(f"Passing {len(agent_traces)} agent traces as context")
+        try:
+            synthesizer = self._get_synthesizer()
+            synthesis = synthesizer.synthesize(findings)
+            reasoning.append(f"Synthesis completed using: {synthesis.get('model', 'unknown')}")
+            reasoning.append(f"Severity level: {synthesis.get('severity_level', 'N/A')}")
+            reasoning.append(f"Referral needed: {synthesis.get('referral_needed', 'N/A')}")
+            confidence = 0.85 if "MedGemma" in synthesis.get("model", "") else 0.75
+        except Exception as e:
+            reasoning.append(f"Synthesis failed: {e}")
+            synthesis = {
+                "summary": f"Assessment for {patient_type} patient. Classification: {protocol.classification}.",
+                "severity_level": protocol.classification,
+                "severity_description": f"WHO IMNCI {protocol.classification} classification",
+                "immediate_actions": protocol.treatment_recommendations or ["Continue routine care"],
+                "referral_needed": referral.referral_needed,
+                "referral_urgency": referral.urgency,
+                "follow_up": protocol.follow_up_schedule,
+                "urgent_conditions": triage.critical_signs,
+                "model": "Fallback (agent context)",
+                "generated_at": datetime.now().isoformat(),
+            }
+            confidence = 0.6
+        elapsed = (time.time() - start) * 1000
+        trace = AgentResult(
+            agent_name="SynthesisAgent",
+            status="success",
+            reasoning=reasoning,
+            findings={
+                "model": synthesis.get("model", "unknown"),
+                "severity_level": synthesis.get("severity_level", "unknown"),
+            },
+            confidence=confidence,
+            processing_time_ms=elapsed,
+        )
+        return synthesis, trace
+# ---------------------------------------------------------------------------
+# Workflow Engine
+# ---------------------------------------------------------------------------
+WorkflowCallback = Callable[[WorkflowState, float], None]
+class AgenticWorkflowEngine:
+    """
+    Orchestrates the 6-agent clinical workflow pipeline.
+    Pipeline: Triage -> Image -> Audio -> Protocol -> Referral -> Synthesis
+    Early-exit on critical danger signs (RED + critical -> skip to Synthesis)
+    Each agent emits a structured AgentResult with reasoning traces
+    that form a complete audit trail of the clinical decision process.
+    """
+    AGENTS = [
+        "TriageAgent",
+        "ImageAnalysisAgent",
+        "AudioAnalysisAgent",
+        "ProtocolAgent",
+        "ReferralAgent",
+        "SynthesisAgent",
+    ]
+    def __init__(
+        self,
+        anemia_detector: Optional[Any] = None,
+        jaundice_detector: Optional[Any] = None,
+        cry_analyzer: Optional[Any] = None,
+        synthesizer: Optional[Any] = None,
+        on_state_change: Optional[WorkflowCallback] = None,
+    ):
+        self._triage = TriageAgent()
+        self._image = ImageAnalysisAgent(anemia_detector, jaundice_detector)
+        self._audio = AudioAnalysisAgent(cry_analyzer)
+        self._protocol = ProtocolAgent()
+        self._referral = ReferralAgent()
+        self._synthesis = SynthesisAgent(synthesizer)
+        self._state: WorkflowState = "idle"
+        self._on_state_change = on_state_change
+    def _transition(self, state: WorkflowState, progress: float) -> None:
+        self._state = state
+        if self._on_state_change:
+            self._on_state_change(state, progress)
+    @property
+    def state(self) -> WorkflowState:
+        return self._state
+    def execute(self, workflow_input: WorkflowInput) -> WorkflowResult:
+        """
+        Execute the full agentic workflow pipeline.
+        Args:
+            workflow_input: Complete input with patient info, images, audio, danger signs.
+        Returns:
+            WorkflowResult with all agent outputs, reasoning traces, and clinical synthesis.
+        """
+        start = time.time()
+        agent_traces: List[AgentResult] = []
+        patient_type = workflow_input.patient_type
+        try:
+            # Step 1: Triage (10% progress)
+            self._transition("triaging", 10.0)
+            triage_result, triage_trace = self._triage.process(
+                patient_type,
+                workflow_input.danger_signs,
+                workflow_input.patient_info,
+            )
+            agent_traces.append(triage_trace)
+            # Early exit for critical cases
+            if triage_result.immediate_referral_needed:
+                self._transition("complete", 100.0)
+                return self._build_early_referral(
+                    workflow_input, triage_result, agent_traces, start
+                )
+            # Step 2: Image Analysis (30% progress)
+            self._transition("analyzing_image", 30.0)
+            image_result, image_trace = self._image.process(
+                patient_type,
+                workflow_input.conjunctiva_image,
+                workflow_input.skin_image,
+            )
+            agent_traces.append(image_trace)
+            # Step 3: Audio Analysis (50% progress)
+            self._transition("analyzing_audio", 50.0)
+            audio_result, audio_trace = self._audio.process(
+                workflow_input.cry_audio,
+            )
+            agent_traces.append(audio_trace)
+            # Step 4: Protocol Application (70% progress)
+            self._transition("applying_protocol", 70.0)
+            protocol_result, protocol_trace = self._protocol.process(
+                patient_type, triage_result, image_result, audio_result
+            )
+            agent_traces.append(protocol_trace)
+            # Step 5: Referral Decision (85% progress)
+            self._transition("determining_referral", 85.0)
+            referral_result, referral_trace = self._referral.process(
+                patient_type, triage_result, protocol_result,
+                image_result, audio_result,
+            )
+            agent_traces.append(referral_trace)
+            # Step 6: Clinical Synthesis with MedGemma (95% progress)
+            self._transition("synthesizing", 95.0)
+            synthesis, synthesis_trace = self._synthesis.process(
+                patient_type, triage_result, image_result,
+                audio_result, protocol_result, referral_result,
+                agent_traces,
+            )
+            agent_traces.append(synthesis_trace)
+            # Build final result
+            self._transition("complete", 100.0)
+            elapsed = (time.time() - start) * 1000
+            return WorkflowResult(
+                success=True,
+                patient_type=patient_type,
+                who_classification=protocol_result.classification,
+                triage_result=triage_result,
+                image_results=image_result,
+                audio_results=audio_result,
+                protocol_result=protocol_result,
+                referral_result=referral_result,
+                clinical_synthesis=synthesis.get("summary", ""),
+                recommendation=synthesis.get("immediate_actions", ["Continue routine care"])[0],
+                immediate_actions=synthesis.get("immediate_actions", []),
+                agent_traces=agent_traces,
+                processing_time_ms=elapsed,
+                timestamp=datetime.now().isoformat(),
+            )
+        except Exception as e:
+            self._transition("error", 0.0)
+            elapsed = (time.time() - start) * 1000
+            error_trace = AgentResult(
+                agent_name="WorkflowEngine",
+                status="error",
+                reasoning=[f"Workflow failed: {e}"],
+                findings={"error": str(e)},
+                confidence=0.0,
+                processing_time_ms=elapsed,
+            )
+            agent_traces.append(error_trace)
+            return WorkflowResult(
+                success=False,
+                patient_type=patient_type,
+                who_classification="RED",
+                agent_traces=agent_traces,
+                clinical_synthesis=f"Workflow error: {e}. Please retry or seek immediate medical consultation.",
+                recommendation="Seek immediate medical consultation due to assessment error",
+                immediate_actions=["Seek immediate medical consultation"],
+                processing_time_ms=elapsed,
+                timestamp=datetime.now().isoformat(),
+            )
+    def _build_early_referral(
+        self,
+        workflow_input: WorkflowInput,
+        triage: TriageResult,
+        agent_traces: List[AgentResult],
+        start_time: float,
+    ) -> WorkflowResult:
+        """Build result for early-exit when critical danger signs are detected."""
+        elapsed = (time.time() - start_time) * 1000
+        critical_text = ", ".join(triage.critical_signs)
+        synthesis_text = (
+            f"URGENT: Critical danger signs detected ({critical_text}). "
+            f"Immediate referral to higher-level facility is required. "
+            f"This patient requires emergency care that cannot be provided at the current level."
+        )
+        return WorkflowResult(
+            success=True,
+            patient_type=workflow_input.patient_type,
+            who_classification="RED",
+            triage_result=triage,
+            image_results=ImageAnalysisResult(),
+            audio_results=AudioAnalysisResult(),
+            protocol_result=ProtocolResult(
+                classification="RED",
+                applicable_protocols=["Emergency Referral Protocol"],
+                treatment_recommendations=["IMMEDIATE REFERRAL REQUIRED"],
+                follow_up_schedule="After emergency care",
+            ),
+            referral_result=ReferralResult(
+                referral_needed=True,
+                urgency="immediate",
+                facility_level="tertiary",
+                reason=f"Critical danger signs detected: {critical_text}",
+                timeframe="Immediately - within 1 hour",
+            ),
+            clinical_synthesis=synthesis_text,
+            recommendation="IMMEDIATE REFERRAL to tertiary care facility",
+            immediate_actions=[
+                "Arrange emergency transport",
+                "Call receiving facility",
+                "Provide pre-referral treatment as per protocol",
+                "Accompany patient with referral note",
+            ],
+            agent_traces=agent_traces,
+            processing_time_ms=elapsed,
+            timestamp=datetime.now().isoformat(),
+        )

src/nexus/anemia_detector.py ADDED Viewed

	@@ -0,0 +1,580 @@

+"""
+Anemia Detector Module
+Uses MedSigLIP from Google HAI-DEF for anemia detection from conjunctiva images.
+Implements zero-shot classification with medical text prompts per NEXUS_MASTER_PLAN.md.
+HAI-DEF Model: google/medsiglip-448 (MedSigLIP)
+Documentation: https://developers.google.com/health-ai-developer-foundations/medsiglip
+"""
+import os
+import torch
+import torch.nn as nn
+from PIL import Image
+from pathlib import Path
+from typing import Dict, List, Optional, Tuple, Union
+import numpy as np
+try:
+    from transformers import AutoProcessor, AutoModel
+    HAS_TRANSFORMERS = True
+except ImportError:
+    HAS_TRANSFORMERS = False
+# HAI-DEF MedSigLIP model IDs to try in order of preference
+MEDSIGLIP_MODEL_IDS = [
+    "google/medsiglip-448",              # MedSigLIP - official HAI-DEF model
+    "google/siglip-base-patch16-224",    # SigLIP 224 - fallback
+]
+class AnemiaDetector:
+    """
+    Detects anemia from conjunctiva (inner eyelid) images using MedSigLIP.
+    Uses zero-shot classification with medical prompts for detection.
+    HAI-DEF Model: google/medsiglip-448 (MedSigLIP)
+    Fallback: siglip-base-patch16-224
+    """
+    # Medical text prompts for zero-shot classification (optimized for MedSigLIP)
+    # Expanded prompt set with specific clinical language for better discrimination
+    ANEMIC_PROMPTS = [
+        "pale conjunctiva with visible pallor indicating anemia",
+        "conjunctival pallor grade 2 or higher with reduced vascularity",
+        "white or very pale inner eyelid mucosa suggesting low hemoglobin",
+        "conjunctiva showing significant pallor and poor blood perfusion",
+        "anemic eye with pale pink to white palpebral conjunctiva",
+        "inner eyelid lacking red coloration consistent with severe anemia",
+        "conjunctiva with washed out appearance and faint vascular pattern",
+        "pale mucous membrane of the lower eyelid suggesting iron deficiency",
+    ]
+    HEALTHY_PROMPTS = [
+        "healthy red conjunctiva with rich vascular pattern",
+        "well-perfused bright pink inner eyelid with visible blood vessels",
+        "normal conjunctiva showing deep red-pink coloration",
+        "conjunctiva with healthy blood supply and strong red color",
+        "richly vascularized palpebral conjunctiva with normal hemoglobin",
+        "inner eyelid with vibrant red-pink mucosa and clear vessels",
+        "non-anemic conjunctiva showing robust red perfusion",
+        "conjunctival mucosa with normal deep pink to red appearance",
+    ]
+    def __init__(
+        self,
+        model_name: Optional[str] = None,  # Auto-select MedSigLIP
+        device: Optional[str] = None,
+        threshold: float = 0.5,
+    ):
+        """
+        Initialize the Anemia Detector with MedSigLIP.
+        Args:
+            model_name: HuggingFace model name (auto-selects HAI-DEF MedSigLIP if None)
+            device: Device to run model on (auto-detected if None)
+            threshold: Classification threshold for anemia detection
+        """
+        if not HAS_TRANSFORMERS:
+            raise ImportError("transformers library required. Install with: pip install transformers")
+        self.device = device or ("cuda" if torch.cuda.is_available() else "cpu")
+        self.threshold = threshold
+        self._model_loaded = False
+        self.classifier = None  # Can be set by pipeline for trained classification
+        # Determine which models to try
+        models_to_try = [model_name] if model_name else MEDSIGLIP_MODEL_IDS
+        # HuggingFace token for gated models
+        hf_token = os.environ.get("HF_TOKEN")
+        # Try loading models in order of preference
+        for candidate_model in models_to_try:
+            print(f"Loading HAI-DEF model: {candidate_model}")
+            try:
+                self.processor = AutoProcessor.from_pretrained(
+                    candidate_model, token=hf_token
+                )
+                self.model = AutoModel.from_pretrained(
+                    candidate_model, token=hf_token
+                ).to(self.device)
+                self.model_name = candidate_model
+                self._model_loaded = True
+                print(f"Successfully loaded: {candidate_model}")
+                break
+            except Exception as e:
+                print(f"Warning: Could not load {candidate_model}: {e}")
+                continue
+        if not self._model_loaded:
+            raise RuntimeError(
+                f"Could not load any MedSigLIP model. Tried: {models_to_try}. "
+                "Install transformers and ensure internet access."
+            )
+        self.model.eval()
+        # Pre-compute text embeddings for efficiency
+        self._precompute_text_embeddings()
+        # Try to auto-load trained classifier
+        self._auto_load_classifier()
+        # Indicate which model variant is being used
+        is_medsiglip = "medsiglip" in self.model_name
+        model_type = "MedSigLIP" if is_medsiglip else "SigLIP (fallback)"
+        classifier_status = "with trained classifier" if self.classifier else "zero-shot"
+        print(f"Anemia Detector (HAI-DEF {model_type}, {classifier_status}) initialized on {self.device}")
+    def _auto_load_classifier(self) -> None:
+        """Auto-load trained anemia classifier if available."""
+        if self.classifier is not None:
+            return  # Already set externally
+        try:
+            import joblib
+        except ImportError:
+            return
+        default_paths = [
+            Path(__file__).parent.parent.parent / "models" / "linear_probes" / "anemia_classifier.joblib",
+            Path("models/linear_probes/anemia_classifier.joblib"),
+        ]
+        for path in default_paths:
+            if path.exists():
+                try:
+                    self.classifier = joblib.load(path)
+                    print(f"Auto-loaded anemia classifier from {path}")
+                    return
+                except Exception as e:
+                    print(f"Warning: Could not load classifier from {path}: {e}")
+    # Logit temperature for softmax conversion (lower = more spread, higher = sharper)
+    LOGIT_SCALE = 30.0
+    def _precompute_text_embeddings(self) -> None:
+        """Pre-compute text embeddings for zero-shot classification using SigLIP.
+        Stores individual prompt embeddings for max-similarity scoring,
+        which outperforms mean-pooled embeddings for medical image classification.
+        """
+        all_prompts = self.ANEMIC_PROMPTS + self.HEALTHY_PROMPTS
+        with torch.no_grad():
+            # SigLIP uses different API than CLIP
+            inputs = self.processor(
+                text=all_prompts,
+                return_tensors="pt",
+                padding="max_length",
+                truncation=True,
+            ).to(self.device)
+            # Get text embeddings - support multiple output APIs
+            if hasattr(self.model, 'get_text_features'):
+                text_embeddings = self.model.get_text_features(**inputs)
+            else:
+                outputs = self.model(**inputs)
+                if hasattr(outputs, 'text_embeds'):
+                    text_embeddings = outputs.text_embeds
+                elif hasattr(outputs, 'text_model_output'):
+                    text_embeddings = outputs.text_model_output.pooler_output
+                else:
+                    text_outputs = self.model.text_model(**inputs)
+                    text_embeddings = text_outputs.pooler_output
+            text_embeddings = text_embeddings / text_embeddings.norm(dim=-1, keepdim=True)
+            # Store individual embeddings for max-similarity scoring
+            n_anemic = len(self.ANEMIC_PROMPTS)
+            self.anemic_embeddings_all = text_embeddings[:n_anemic]  # (N, D)
+            self.healthy_embeddings_all = text_embeddings[n_anemic:]  # (M, D)
+            # Also keep mean embeddings as fallback
+            self.anemic_embeddings = self.anemic_embeddings_all.mean(dim=0, keepdim=True)
+            self.healthy_embeddings = self.healthy_embeddings_all.mean(dim=0, keepdim=True)
+            self.anemic_embeddings = self.anemic_embeddings / self.anemic_embeddings.norm(dim=-1, keepdim=True)
+            self.healthy_embeddings = self.healthy_embeddings / self.healthy_embeddings.norm(dim=-1, keepdim=True)
+    def preprocess_image(self, image: Union[str, Path, Image.Image]) -> Image.Image:
+        """
+        Preprocess image for analysis.
+        Args:
+            image: Path to image or PIL Image
+        Returns:
+            Preprocessed PIL Image
+        """
+        if isinstance(image, (str, Path)):
+            image = Image.open(image).convert("RGB")
+        elif not isinstance(image, Image.Image):
+            raise ValueError(f"Expected str, Path, or PIL Image, got {type(image)}")
+        return image
+    def detect(self, image: Union[str, Path, Image.Image]) -> Dict:
+        """
+        Detect anemia from conjunctiva image.
+        Uses trained classifier if available, otherwise falls back to
+        zero-shot classification with MedSigLIP.
+        Args:
+            image: Conjunctiva image (path or PIL Image)
+        Returns:
+            Dictionary containing:
+                - is_anemic: Boolean indicating anemia detection
+                - confidence: Confidence score (0-1)
+                - anemia_score: Raw anemia probability
+                - healthy_score: Raw healthy probability
+                - risk_level: "high", "medium", or "low"
+                - recommendation: Clinical recommendation
+        """
+        # Preprocess image
+        pil_image = self.preprocess_image(image)
+        # Get image embedding using SigLIP
+        with torch.no_grad():
+            inputs = self.processor(
+                images=pil_image,
+                return_tensors="pt",
+            ).to(self.device)
+            # Get image embeddings - support multiple output APIs
+            if hasattr(self.model, 'get_image_features'):
+                image_embedding = self.model.get_image_features(**inputs)
+            else:
+                outputs = self.model(**inputs)
+                if hasattr(outputs, 'image_embeds'):
+                    image_embedding = outputs.image_embeds
+                elif hasattr(outputs, 'vision_model_output'):
+                    image_embedding = outputs.vision_model_output.pooler_output
+                else:
+                    vision_outputs = self.model.vision_model(**inputs)
+                    image_embedding = vision_outputs.pooler_output
+            image_embedding = image_embedding / image_embedding.norm(dim=-1, keepdim=True)
+        # Use trained classifier if available, otherwise zero-shot
+        if self.classifier is not None:
+            anemia_prob, healthy_prob, model_method = self._classify_with_trained_model(image_embedding)
+        else:
+            anemia_prob, healthy_prob, model_method = self._classify_zero_shot(image_embedding)
+        # Determine risk level
+        if anemia_prob > 0.7:
+            risk_level = "high"
+            recommendation = "URGENT: Refer for blood test immediately. High likelihood of anemia."
+        elif anemia_prob > 0.5:
+            risk_level = "medium"
+            recommendation = "Schedule blood test within 48 hours. Moderate anemia indicators present."
+        else:
+            risk_level = "low"
+            recommendation = "No immediate concern. Routine follow-up recommended."
+        is_medsiglip = "medsiglip" in self.model_name
+        base_model = "MedSigLIP (HAI-DEF)" if is_medsiglip else "SigLIP (fallback)"
+        return {
+            "is_anemic": anemia_prob > self.threshold,
+            "confidence": max(anemia_prob, healthy_prob),
+            "anemia_score": anemia_prob,
+            "healthy_score": healthy_prob,
+            "risk_level": risk_level,
+            "recommendation": recommendation,
+            "model": self.model_name,
+            "model_type": f"{base_model} + {model_method}",
+        }
+    def _classify_with_trained_model(self, image_embedding: torch.Tensor) -> Tuple[float, float, str]:
+        """
+        Classify using trained classifier on embeddings.
+        Args:
+            image_embedding: Normalized image embedding from MedSigLIP
+        Returns:
+            Tuple of (anemia_prob, healthy_prob, method_name)
+        """
+        # Convert embedding to numpy for sklearn classifiers
+        embedding_np = image_embedding.cpu().numpy().reshape(1, -1)
+        # Handle different classifier types
+        if hasattr(self.classifier, 'predict_proba'):
+            # Sklearn classifier with probability support
+            proba = self.classifier.predict_proba(embedding_np)
+            # Assume binary: [healthy, anemic] or [anemic, healthy]
+            if proba.shape[1] >= 2:
+                # Check classifier classes to determine order
+                if hasattr(self.classifier, 'classes_'):
+                    classes = list(self.classifier.classes_)
+                    if 1 in classes:
+                        anemia_idx = classes.index(1)
+                    else:
+                        anemia_idx = 1  # Default assumption
+                else:
+                    anemia_idx = 1
+                anemia_prob = float(proba[0, anemia_idx])
+                healthy_prob = 1.0 - anemia_prob
+            else:
+                anemia_prob = float(proba[0, 0])
+                healthy_prob = 1.0 - anemia_prob
+            return anemia_prob, healthy_prob, "Trained Classifier"
+        elif hasattr(self.classifier, 'predict'):
+            # Classifier without probability - use binary prediction
+            prediction = self.classifier.predict(embedding_np)
+            anemia_prob = float(prediction[0])
+            healthy_prob = 1.0 - anemia_prob
+            return anemia_prob, healthy_prob, "Trained Classifier (binary)"
+        elif isinstance(self.classifier, nn.Module):
+            # PyTorch classifier
+            self.classifier.eval()
+            with torch.no_grad():
+                logits = self.classifier(image_embedding)
+                probs = torch.softmax(logits, dim=-1)
+                if probs.shape[-1] >= 2:
+                    anemia_prob = probs[0, 1].item()
+                    healthy_prob = probs[0, 0].item()
+                else:
+                    anemia_prob = probs[0, 0].item()
+                    healthy_prob = 1.0 - anemia_prob
+            return anemia_prob, healthy_prob, "Trained Classifier (PyTorch)"
+        else:
+            # Unknown classifier type - fall back to zero-shot
+            print(f"Warning: Unknown classifier type {type(self.classifier)}, using zero-shot")
+            return self._classify_zero_shot(image_embedding)
+    def _classify_zero_shot(self, image_embedding: torch.Tensor) -> Tuple[float, float, str]:
+        """
+        Classify using zero-shot with max-similarity scoring.
+        Uses the maximum cosine similarity across all prompts per class
+        rather than mean-pooled embeddings, which provides better
+        discrimination for medical image classification.
+        Args:
+            image_embedding: Normalized image embedding from MedSigLIP
+        Returns:
+            Tuple of (anemia_prob, healthy_prob, method_name)
+        """
+        # Max-similarity: take the best-matching prompt per class
+        anemia_sims = (image_embedding @ self.anemic_embeddings_all.T).squeeze(0)
+        healthy_sims = (image_embedding @ self.healthy_embeddings_all.T).squeeze(0)
+        # Ensure at least 1-D for .max() to work on single-image inputs
+        if anemia_sims.dim() == 0:
+            anemia_sims = anemia_sims.unsqueeze(0)
+        if healthy_sims.dim() == 0:
+            healthy_sims = healthy_sims.unsqueeze(0)
+        anemia_sim = anemia_sims.max().item()
+        healthy_sim = healthy_sims.max().item()
+        # Convert to probabilities with tuned temperature
+        logits = torch.tensor([anemia_sim, healthy_sim], device="cpu") * self.LOGIT_SCALE
+        probs = torch.softmax(logits, dim=0)
+        anemia_prob = probs[0].item()
+        healthy_prob = probs[1].item()
+        return anemia_prob, healthy_prob, "Zero-Shot"
+    def detect_batch(
+        self,
+        images: List[Union[str, Path, Image.Image]],
+        batch_size: int = 8,
+    ) -> List[Dict]:
+        """
+        Detect anemia from multiple images.
+        Args:
+            images: List of conjunctiva images
+            batch_size: Batch size for processing
+        Returns:
+            List of detection results
+        """
+        results = []
+        for i in range(0, len(images), batch_size):
+            batch = images[i:i + batch_size]
+            # Process batch
+            pil_images = [self.preprocess_image(img) for img in batch]
+            with torch.no_grad():
+                inputs = self.processor(
+                    images=pil_images,
+                    return_tensors="pt",
+                    padding=True,
+                ).to(self.device)
+                # Get image embeddings - support multiple output APIs
+                if hasattr(self.model, 'get_image_features'):
+                    image_embeddings = self.model.get_image_features(**inputs)
+                else:
+                    outputs = self.model(**inputs)
+                    if hasattr(outputs, 'image_embeds'):
+                        image_embeddings = outputs.image_embeds
+                    elif hasattr(outputs, 'vision_model_output'):
+                        image_embeddings = outputs.vision_model_output.pooler_output
+                    else:
+                        vision_outputs = self.model.vision_model(**inputs)
+                        image_embeddings = vision_outputs.pooler_output
+                image_embeddings = image_embeddings / image_embeddings.norm(dim=-1, keepdim=True)
+            # Compute max-similarities for each image
+            for j, img_emb in enumerate(image_embeddings):
+                img_emb = img_emb.unsqueeze(0)
+                # Use trained classifier if available, otherwise zero-shot
+                if self.classifier is not None:
+                    anemia_prob, healthy_prob, _ = self._classify_with_trained_model(img_emb)
+                    # Skip zero-shot path below
+                    if anemia_prob > 0.7:
+                        risk_level = "high"
+                        recommendation = "URGENT: Refer for blood test immediately."
+                    elif anemia_prob > 0.5:
+                        risk_level = "medium"
+                        recommendation = "Schedule blood test within 48 hours."
+                    else:
+                        risk_level = "low"
+                        recommendation = "No immediate concern."
+                    results.append({
+                        "is_anemic": anemia_prob > self.threshold,
+                        "confidence": max(anemia_prob, healthy_prob),
+                        "anemia_score": anemia_prob,
+                        "healthy_score": healthy_prob,
+                        "risk_level": risk_level,
+                        "recommendation": recommendation,
+                    })
+                    continue
+                anemia_sims = (img_emb @ self.anemic_embeddings_all.T).squeeze(0)
+                healthy_sims = (img_emb @ self.healthy_embeddings_all.T).squeeze(0)
+                if anemia_sims.dim() == 0:
+                    anemia_sims = anemia_sims.unsqueeze(0)
+                if healthy_sims.dim() == 0:
+                    healthy_sims = healthy_sims.unsqueeze(0)
+                anemia_sim = anemia_sims.max().item()
+                healthy_sim = healthy_sims.max().item()
+                logits = torch.tensor([anemia_sim, healthy_sim], device="cpu") * self.LOGIT_SCALE
+                probs = torch.softmax(logits, dim=0)
+                anemia_prob = probs[0].item()
+                healthy_prob = probs[1].item()
+                if anemia_prob > 0.7:
+                    risk_level = "high"
+                    recommendation = "URGENT: Refer for blood test immediately."
+                elif anemia_prob > 0.5:
+                    risk_level = "medium"
+                    recommendation = "Schedule blood test within 48 hours."
+                else:
+                    risk_level = "low"
+                    recommendation = "No immediate concern."
+                results.append({
+                    "is_anemic": anemia_prob > self.threshold,
+                    "confidence": max(anemia_prob, healthy_prob),
+                    "anemia_score": anemia_prob,
+                    "healthy_score": healthy_prob,
+                    "risk_level": risk_level,
+                    "recommendation": recommendation,
+                })
+        return results
+    def analyze_color_features(self, image: Union[str, Path, Image.Image]) -> Dict:
+        """
+        Analyze color features of conjunctiva image.
+        This provides interpretable features based on medical literature
+        that correlates pallor with anemia.
+        Args:
+            image: Conjunctiva image
+        Returns:
+            Dictionary with color analysis results
+        """
+        pil_image = self.preprocess_image(image)
+        img_array = np.array(pil_image)
+        # Extract RGB channels
+        r_channel = img_array[:, :, 0].astype(float)
+        g_channel = img_array[:, :, 1].astype(float)
+        b_channel = img_array[:, :, 2].astype(float)
+        # Calculate color statistics
+        mean_r = np.mean(r_channel)
+        mean_g = np.mean(g_channel)
+        mean_b = np.mean(b_channel)
+        # Red ratio (higher in healthy, lower in anemic)
+        total_intensity = mean_r + mean_g + mean_b
+        red_ratio = mean_r / total_intensity if total_intensity > 0 else 0
+        # Pallor index (higher means more pale/anemic)
+        # Based on reduced red-to-green ratio in anemic conjunctiva
+        pallor_index = 1 - (mean_r / (mean_g + 1e-6))
+        pallor_index = max(0, min(1, (pallor_index + 0.5) / 1.5))
+        # Hemoglobin estimation (rough approximation)
+        # Normal Hb: 12-16 g/dL for women, 14-18 for men
+        # This is a rough estimate based on color analysis
+        estimated_hb = 8 + (red_ratio * 12)
+        return {
+            "mean_red": mean_r,
+            "mean_green": mean_g,
+            "mean_blue": mean_b,
+            "red_ratio": red_ratio,
+            "pallor_index": pallor_index,
+            "estimated_hemoglobin": round(estimated_hb, 1),
+            "interpretation": "Low hemoglobin" if pallor_index > 0.5 else "Normal hemoglobin",
+        }
+def test_detector():
+    """Test the anemia detector with sample images."""
+    print("Testing Anemia Detector...")
+    detector = AnemiaDetector()
+    # Test with sample images from dataset
+    data_dir = Path(__file__).parent.parent.parent / "data" / "raw" / "eyes-defy-anemia"
+    if data_dir.exists():
+        # Find sample images
+        sample_images = list(data_dir.rglob("*.jpg"))[:3]
+        for img_path in sample_images:
+            print(f"\nAnalyzing: {img_path.name}")
+            result = detector.detect(img_path)
+            print(f"  Anemia detected: {result['is_anemic']}")
+            print(f"  Confidence: {result['confidence']:.2%}")
+            print(f"  Risk level: {result['risk_level']}")
+            print(f"  Recommendation: {result['recommendation']}")
+            # Color analysis
+            color_info = detector.analyze_color_features(img_path)
+            print(f"  Estimated Hb: {color_info['estimated_hemoglobin']} g/dL")
+    else:
+        print(f"Dataset not found at {data_dir}")
+        print("Please run download_datasets.py first")
+if __name__ == "__main__":
+    test_detector()

src/nexus/clinical_synthesizer.py ADDED Viewed

	@@ -0,0 +1,548 @@

+"""
+Clinical Synthesizer Module
+Uses MedGemma from Google HAI-DEF for clinical reasoning and synthesis.
+Combines findings from MedSigLIP (images) and HeAR (audio) into actionable recommendations.
+HAI-DEF Model: MedGemma 4B (google/medgemma-4b-it or google/medgemma-1.5-4b-it)
+Supports 4-bit quantization via BitsAndBytes for low-VRAM deployment.
+"""
+import torch
+from typing import Dict, Optional, List
+from datetime import datetime
+try:
+    from transformers import AutoTokenizer, AutoModelForCausalLM
+    HAS_TRANSFORMERS = True
+except ImportError:
+    HAS_TRANSFORMERS = False
+try:
+    from transformers import BitsAndBytesConfig
+    HAS_BITSANDBYTES = True
+except ImportError:
+    HAS_BITSANDBYTES = False
+class ClinicalSynthesizer:
+    """
+    Synthesizes clinical findings using MedGemma.
+    HAI-DEF Model: MedGemma 4B (google/medgemma-4b-it or google/medgemma-1.5-4b-it)
+    Method: Prompt engineering (no fine-tuning required)
+    Quantization: 4-bit NF4 via BitsAndBytes for low-VRAM deployment
+    Output:
+    - Integrated diagnosis suggestions
+    - Severity assessment (GREEN/YELLOW/RED)
+    - Treatment recommendations (WHO IMNCI)
+    - Referral decision with urgency
+    - CHW-friendly explanations
+    """
+    # WHO IMNCI severity colors
+    SEVERITY_LEVELS = {
+        "GREEN": "Routine care - no immediate concern",
+        "YELLOW": "Close monitoring - may need referral",
+        "RED": "Urgent referral - immediate action required",
+    }
+    # MedGemma model candidates in preference order
+    MEDGEMMA_MODEL_IDS = [
+        "google/medgemma-1.5-4b-it",  # Newer, better performance
+        "google/medgemma-4b-it",       # Original HAI-DEF model
+    ]
+    def __init__(
+        self,
+        model_name: Optional[str] = None,
+        device: Optional[str] = None,
+        use_medgemma: bool = True,
+        use_4bit: bool = True,
+    ):
+        """
+        Initialize the Clinical Synthesizer with MedGemma.
+        Args:
+            model_name: HuggingFace model name for MedGemma (auto-selects if None)
+            device: Device to run model on
+            use_medgemma: Whether to use MedGemma (True) or rule-based (False)
+            use_4bit: Whether to use 4-bit quantization (reduces VRAM from ~8GB to ~2GB)
+        """
+        self.device = device or ("cuda" if torch.cuda.is_available() else "cpu")
+        self._user_model_name = model_name  # None if user didn't specify
+        self.model_name = model_name or self.MEDGEMMA_MODEL_IDS[-1]
+        self.model = None
+        self.tokenizer = None
+        self.use_medgemma = use_medgemma
+        self.use_4bit = use_4bit
+        self._medgemma_available = False
+        if use_medgemma and HAS_TRANSFORMERS:
+            self._load_medgemma()
+        else:
+            print("MedGemma not available. Using rule-based clinical synthesis.")
+            self.use_medgemma = False
+        print(f"Clinical Synthesizer (HAI-DEF MedGemma) initialized")
+    def _load_medgemma(self) -> None:
+        """Load MedGemma model from HuggingFace with 4-bit quantization.
+        Tries model candidates in preference order:
+        1. google/medgemma-1.5-4b-it (newer, better performance)
+        2. google/medgemma-4b-it (original HAI-DEF model)
+        Uses BitsAndBytes NF4 quantization to reduce VRAM from ~8GB to ~2GB,
+        which fixes CUDA OOM errors on consumer GPUs.
+        """
+        import os
+        hf_token = os.environ.get("HF_TOKEN")
+        if not hf_token:
+            print("Warning: HF_TOKEN not set. MedGemma is a gated model and requires authentication.")
+            print("Set HF_TOKEN environment variable with your HuggingFace token.")
+        # Determine models to try — if user explicitly passed a model_name,
+        # only try that one; otherwise try all candidates in preference order.
+        models_to_try = [self._user_model_name] if self._user_model_name else self.MEDGEMMA_MODEL_IDS
+        # Build quantization config for 4-bit loading
+        bnb_config = None
+        if self.use_4bit and self.device == "cuda" and HAS_BITSANDBYTES:
+            try:
+                bnb_config = BitsAndBytesConfig(
+                    load_in_4bit=True,
+                    bnb_4bit_quant_type="nf4",
+                    bnb_4bit_use_double_quant=True,
+                    bnb_4bit_compute_dtype=torch.float16,
+                )
+                print("4-bit quantization enabled (NF4 + double quant)")
+            except Exception as e:
+                print(f"Warning: Could not create BitsAndBytes config: {e}")
+                bnb_config = None
+        for candidate_model in models_to_try:
+            try:
+                print(f"Loading MedGemma model: {candidate_model}")
+                self.tokenizer = AutoTokenizer.from_pretrained(
+                    candidate_model, token=hf_token
+                )
+                load_kwargs = {
+                    "token": hf_token,
+                    "device_map": "auto" if self.device == "cuda" else None,
+                }
+                if bnb_config is not None:
+                    # 4-bit quantized loading (~2GB VRAM)
+                    load_kwargs["quantization_config"] = bnb_config
+                else:
+                    # Standard loading with fp16/fp32
+                    load_kwargs["torch_dtype"] = (
+                        torch.float16 if self.device == "cuda" else torch.float32
+                    )
+                self.model = AutoModelForCausalLM.from_pretrained(
+                    candidate_model, **load_kwargs
+                )
+                if self.device == "cpu" and bnb_config is None:
+                    self.model = self.model.to(self.device)
+                self.model_name = candidate_model
+                self._medgemma_available = True
+                quant_status = "4-bit NF4" if bnb_config is not None else "fp16/fp32"
+                print(f"MedGemma loaded successfully: {candidate_model} ({quant_status})")
+                return
+            except Exception as e:
+                print(f"Warning: Could not load {candidate_model}: {e}")
+                continue
+        print("Could not load any MedGemma model. Falling back to rule-based synthesis.")
+        self.model = None
+        self.tokenizer = None
+        self.use_medgemma = False
+        self._medgemma_available = False
+    @staticmethod
+    def _sanitize(value: object) -> str:
+        """Sanitize a value for safe inclusion in a prompt.
+        Strips control characters and truncates excessively long strings to
+        prevent prompt injection via adversarial findings.
+        """
+        text = str(value) if value is not None else "N/A"
+        # Remove characters that could break prompt structure
+        text = text.replace("\x00", "").replace("\r", "")
+        # Truncate overly long values
+        if len(text) > 500:
+            text = text[:500] + "..."
+        return text
+    def _build_prompt(self, findings: Dict) -> str:
+        """
+        Build clinical synthesis prompt for MedGemma.
+        Args:
+            findings: Dictionary with anemia, jaundice, cry analysis results.
+                      May include 'agent_context' and 'agent_reasoning_summary'
+                      when called from the agentic workflow engine.
+        Returns:
+            Formatted prompt for MedGemma
+        """
+        # Extract findings with safe defaults
+        anemia = findings.get("anemia", {})
+        jaundice = findings.get("jaundice", {})
+        cry = findings.get("cry", {})
+        symptoms = self._sanitize(findings.get("symptoms", "None reported"))
+        patient_info = findings.get("patient_info", {})
+        agent_context = findings.get("agent_context", {})
+        agent_reasoning = self._sanitize(findings.get("agent_reasoning_summary", ""))
+        prompt = f"""You are a pediatric health assistant helping community health workers in low-resource settings.
+PATIENT INFORMATION:
+- Age: {patient_info.get("age", "Not specified")}
+- Weight: {patient_info.get("weight", "Not specified")}
+- Location: {patient_info.get("location", "Rural health post")}
+- Patient Type: {patient_info.get("type", "Not specified")}
+ASSESSMENT FINDINGS:
+1. ANEMIA SCREENING (Conjunctiva Analysis):
+   - Result: {"Anemia detected" if anemia.get("is_anemic") else "No anemia detected"}
+   - Confidence: {anemia.get("confidence", "N/A")}
+   - Severity: {anemia.get("severity", anemia.get("risk_level", "N/A"))}
+   - Estimated Hemoglobin: {anemia.get("estimated_hemoglobin", "N/A")} g/dL
+2. JAUNDICE SCREENING (Skin Analysis):
+   - Result: {"Jaundice detected" if jaundice.get("has_jaundice") else "No jaundice detected"}
+   - Confidence: {jaundice.get("confidence", "N/A")}
+   - Severity: {jaundice.get("severity", "N/A")}
+   - Estimated Bilirubin: {jaundice.get("estimated_bilirubin", "N/A")} mg/dL
+   - Needs Phototherapy: {jaundice.get("needs_phototherapy", "N/A")}
+3. CRY ANALYSIS (Audio):
+   - Result: {"Abnormal cry pattern" if cry.get("is_abnormal") else "Normal cry pattern"}
+   - Asphyxia Risk: {cry.get("asphyxia_risk", "N/A")}
+   - Cry Type: {cry.get("cry_type", "N/A")}
+4. REPORTED SYMPTOMS:
+   {symptoms}
+"""
+        # Add agentic workflow context if available
+        if agent_context:
+            prompt += f"""
+5. MULTI-AGENT ASSESSMENT CONTEXT:
+   - Triage Score: {agent_context.get("triage_score", "N/A")} (Risk: {agent_context.get("triage_risk", "N/A")})
+   - Critical Danger Signs: {", ".join(agent_context.get("critical_signs", [])) or "None"}
+   - WHO IMNCI Classification: {agent_context.get("protocol_classification", "N/A")}
+   - Applicable Protocols: {", ".join(agent_context.get("applicable_protocols", [])) or "N/A"}
+   - Referral Decision: {"YES" if agent_context.get("referral_needed") else "NO"} (Urgency: {agent_context.get("referral_urgency", "N/A")})
+"""
+        if agent_reasoning:
+            prompt += f"""
+6. AGENT REASONING TRAIL:
+{agent_reasoning}
+"""
+        prompt += """
+Based on these findings, provide a clinical assessment following WHO IMNCI protocols:
+1. ASSESSMENT SUMMARY (2-3 sentences in simple language)
+2. SEVERITY LEVEL (GREEN = routine care, YELLOW = close monitoring, RED = urgent referral)
+3. IMMEDIATE ACTIONS for the CHW (bullet points, simple steps)
+4. REFERRAL RECOMMENDATION (Yes/No, and if yes, urgency level)
+5. FOLLOW-UP PLAN (when to reassess)
+Use simple language appropriate for a community health worker with basic training.
+Focus on actionable steps they can take immediately.
+"""
+        return prompt
+    def synthesize(self, findings: Dict) -> Dict:
+        """
+        Synthesize all findings into clinical recommendations.
+        Args:
+            findings: Dictionary with anemia, jaundice, cry analysis results
+        Returns:
+            Clinical summary and recommendations
+        """
+        if self.use_medgemma and self.model is not None:
+            return self._synthesize_with_medgemma(findings)
+        else:
+            return self._synthesize_rule_based(findings)
+    def _synthesize_with_medgemma(self, findings: Dict) -> Dict:
+        """Synthesize using MedGemma model.
+        Falls back to rule-based synthesis if generation fails (e.g. CUDA OOM,
+        device-side assertion, or any other runtime error).
+        """
+        try:
+            prompt = self._build_prompt(findings)
+            inputs = self.tokenizer(prompt, return_tensors="pt", truncation=True, max_length=2048)
+            # For models loaded with device_map="auto", route inputs to the
+            # embedding layer's device to avoid CPU/CUDA mismatch.
+            try:
+                input_device = self.model.get_input_embeddings().weight.device
+            except Exception:
+                input_device = self.device
+            inputs = {k: v.to(input_device) for k, v in inputs.items()}
+            prompt_len = inputs["input_ids"].shape[-1]
+            with torch.no_grad():
+                outputs = self.model.generate(
+                    **inputs,
+                    max_new_tokens=500,
+                    temperature=0.7,
+                    do_sample=True,
+                    top_p=0.9,
+                )
+            # Extract only the generated tokens (after the prompt)
+            generated_ids = outputs[0][prompt_len:]
+            response = self.tokenizer.decode(generated_ids, skip_special_tokens=True).strip()
+            # Guard against empty or very short responses
+            if len(response) < 20:
+                return self._synthesize_rule_based(findings)
+            # Determine display name for the model
+            if "1.5" in self.model_name:
+                display_name = "MedGemma 1.5 4B"
+            else:
+                display_name = "MedGemma 4B"
+            return {
+                "summary": response,
+                "model": display_name,
+                "model_id": self.model_name,
+                "generated_at": datetime.now().isoformat(),
+                "findings_used": list(findings.keys()),
+            }
+        except Exception as e:
+            print(f"MedGemma generation failed: {e}. Falling back to rule-based synthesis.")
+            # Disable MedGemma to avoid repeated CUDA errors that corrupt the
+            # device context and break subsequent GPU operations.
+            self.use_medgemma = False
+            self._medgemma_available = False
+            self.model = None
+            try:
+                torch.cuda.empty_cache()
+            except Exception:
+                pass
+            return self._synthesize_rule_based(findings)
+    def _synthesize_rule_based(self, findings: Dict) -> Dict:
+        """
+        Rule-based clinical synthesis (fallback when MedGemma unavailable).
+        Follows WHO IMNCI protocols for maternal and neonatal care.
+        """
+        # Extract findings
+        anemia = findings.get("anemia", {})
+        jaundice = findings.get("jaundice", {})
+        cry = findings.get("cry", {})
+        # Determine overall severity
+        severity_score = 0
+        urgent_conditions = []
+        actions = []
+        referral_needed = False
+        referral_urgency = "none"
+        # Assess anemia
+        if anemia.get("is_anemic"):
+            if anemia.get("risk_level") == "high":
+                severity_score += 3
+                urgent_conditions.append("Severe anemia")
+                actions.append("Refer for blood transfusion if Hb < 7 g/dL")
+                referral_needed = True
+                referral_urgency = "urgent"
+            elif anemia.get("risk_level") == "medium":
+                severity_score += 2
+                urgent_conditions.append("Moderate anemia")
+                actions.append("Start iron supplementation")
+                actions.append("Schedule blood test within 48 hours")
+            else:
+                severity_score += 1
+                actions.append("Monitor hemoglobin levels")
+                actions.append("Encourage iron-rich foods")
+        # Assess jaundice
+        if jaundice.get("has_jaundice"):
+            if jaundice.get("needs_phototherapy"):
+                severity_score += 3
+                urgent_conditions.append("Severe jaundice requiring phototherapy")
+                actions.append("URGENT: Start phototherapy immediately")
+                actions.append("Refer to hospital if phototherapy unavailable")
+                referral_needed = True
+                referral_urgency = "immediate"
+            elif jaundice.get("severity") in ["moderate", "severe"]:
+                severity_score += 2
+                urgent_conditions.append("Moderate jaundice")
+                actions.append("Expose baby to indirect sunlight")
+                actions.append("Ensure frequent breastfeeding")
+                actions.append("Recheck in 12-24 hours")
+            else:
+                severity_score += 1
+                actions.append("Continue breastfeeding")
+                actions.append("Monitor skin color")
+        # Assess cry analysis
+        if cry.get("is_abnormal"):
+            if cry.get("asphyxia_risk", 0) > 0.6:
+                severity_score += 3
+                urgent_conditions.append("Signs of birth asphyxia")
+                actions.append("URGENT: Check airway, breathing, circulation")
+                actions.append("Provide warmth and stimulation")
+                actions.append("Immediate referral for evaluation")
+                referral_needed = True
+                referral_urgency = "immediate"
+            else:
+                severity_score += 1
+                actions.append("Monitor cry patterns")
+                actions.append("Assess feeding and alertness")
+        # Determine overall severity level
+        if severity_score >= 5 or referral_urgency == "immediate":
+            severity_level = "RED"
+            summary = f"URGENT ATTENTION NEEDED. {', '.join(urgent_conditions)}. Immediate medical intervention required."
+        elif severity_score >= 2:
+            severity_level = "YELLOW"
+            summary = f"Close monitoring required. {', '.join(urgent_conditions) if urgent_conditions else 'Some abnormal findings detected'}. Follow recommended actions."
+        else:
+            severity_level = "GREEN"
+            summary = "Routine care. No immediate concerns detected. Continue standard monitoring."
+        # Default actions if none specified
+        if not actions:
+            actions = [
+                "Continue routine care",
+                "Ensure adequate nutrition",
+                "Schedule follow-up in 1 week",
+            ]
+        # Follow-up plan
+        if severity_level == "RED":
+            follow_up = "Immediate referral. Follow up after hospital evaluation."
+        elif severity_level == "YELLOW":
+            follow_up = "Reassess in 24-48 hours. Refer if condition worsens."
+        else:
+            follow_up = "Routine follow-up in 1-2 weeks."
+        return {
+            "summary": summary,
+            "severity_level": severity_level,
+            "severity_description": self.SEVERITY_LEVELS[severity_level],
+            "immediate_actions": actions,
+            "referral_needed": referral_needed,
+            "referral_urgency": referral_urgency,
+            "follow_up": follow_up,
+            "urgent_conditions": urgent_conditions,
+            "model": "Rule-based (WHO IMNCI)",
+            "generated_at": datetime.now().isoformat(),
+        }
+    def get_who_protocol(self, condition: str) -> Dict:
+        """
+        Get WHO IMNCI protocol for a specific condition.
+        Args:
+            condition: Condition name (anemia, jaundice, asphyxia)
+        Returns:
+            Protocol details
+        """
+        protocols = {
+            "anemia": {
+                "name": "Maternal Anemia Management",
+                "source": "WHO IMNCI Guidelines",
+                "steps": [
+                    "Assess pallor of conjunctiva, palms, and nail beds",
+                    "If severe pallor: Urgent referral",
+                    "If some pallor: Iron supplementation + folic acid",
+                    "Counsel on iron-rich foods",
+                    "Follow up in 4 weeks",
+                ],
+                "referral_criteria": "Hb < 7 g/dL or severe pallor with symptoms",
+            },
+            "jaundice": {
+                "name": "Neonatal Jaundice Management",
+                "source": "WHO IMNCI Guidelines",
+                "steps": [
+                    "Check for yellow skin/eyes within first 24 hours",
+                    "If jaundice in first 24 hours: URGENT referral",
+                    "If moderate jaundice: Frequent breastfeeding, sun exposure",
+                    "If bilirubin > 15 mg/dL: Phototherapy",
+                    "If bilirubin > 25 mg/dL: Exchange transfusion",
+                ],
+                "referral_criteria": "Jaundice < 24 hours old, bilirubin > 20 mg/dL",
+            },
+            "asphyxia": {
+                "name": "Birth Asphyxia Management",
+                "source": "WHO Neonatal Resuscitation Guidelines",
+                "steps": [
+                    "Assess APGAR score at 1 and 5 minutes",
+                    "Clear airway if needed",
+                    "Provide warmth and stimulation",
+                    "If not breathing: Begin resuscitation",
+                    "Refer for evaluation if abnormal cry or poor feeding",
+                ],
+                "referral_criteria": "APGAR < 7, abnormal cry, seizures, poor feeding",
+            },
+        }
+        return protocols.get(condition.lower(), {"error": "Protocol not found"})
+def test_synthesizer():
+    """Test the clinical synthesizer."""
+    print("Testing Clinical Synthesizer...")
+    synthesizer = ClinicalSynthesizer(use_medgemma=False)  # Use rule-based for testing
+    # Test case: Multiple findings
+    findings = {
+        "anemia": {
+            "is_anemic": True,
+            "confidence": 0.85,
+            "risk_level": "medium",
+            "estimated_hemoglobin": 9.5,
+        },
+        "jaundice": {
+            "has_jaundice": True,
+            "confidence": 0.75,
+            "severity": "mild",
+            "estimated_bilirubin": 8.5,
+            "needs_phototherapy": False,
+        },
+        "cry": {
+            "is_abnormal": False,
+            "asphyxia_risk": 0.2,
+            "cry_type": "hunger",
+        },
+        "symptoms": "Mother reports baby seems tired after feeding",
+    }
+    result = synthesizer.synthesize(findings)
+    print("\n=== Clinical Synthesis Result ===")
+    print(f"Summary: {result['summary']}")
+    print(f"Severity: {result.get('severity_level', 'N/A')}")
+    print(f"Referral Needed: {result.get('referral_needed', 'N/A')}")
+    print(f"Actions: {result.get('immediate_actions', [])}")
+    print(f"Follow-up: {result.get('follow_up', 'N/A')}")
+if __name__ == "__main__":
+    test_synthesizer()

src/nexus/cry_analyzer.py ADDED Viewed

	@@ -0,0 +1,662 @@

+"""
+Cry Analyzer Module
+Uses HeAR from Google HAI-DEF for infant cry analysis and birth asphyxia detection.
+Implements embedding extraction + linear classifier per NEXUS_MASTER_PLAN.md.
+HAI-DEF Model: HeAR (Health Acoustic Representations)
+Source: https://github.com/Google-Health/google-health/tree/master/health_acoustic_representations
+"""
+import torch
+import torch.nn as nn
+import numpy as np
+from pathlib import Path
+from typing import Dict, List, Optional, Tuple, Union
+import warnings
+import os
+try:
+    import librosa
+    import soundfile as sf
+    HAS_AUDIO = True
+except ImportError:
+    HAS_AUDIO = False
+try:
+    from sklearn.linear_model import LogisticRegression
+    import joblib
+    HAS_SKLEARN = True
+except ImportError:
+    HAS_SKLEARN = False
+# HeAR PyTorch via HuggingFace
+try:
+    from transformers import AutoModel as HearAutoModel
+    HAS_HEAR_PYTORCH = True
+except ImportError:
+    HAS_HEAR_PYTORCH = False
+class CryAnalyzer:
+    """
+    Analyzes infant cry audio for birth asphyxia detection using HeAR.
+    HAI-DEF Model: HeAR (google/hear-pytorch)
+    Method: Embedding extraction + acoustic feature analysis
+    Process:
+    1. Split audio into 2-second chunks (HeAR requirement)
+    2. Extract HeAR embeddings (512-dim per chunk)
+    3. Aggregate embeddings (mean pooling)
+    4. Classify with trained linear model or rule-based fallback
+    """
+    # HeAR model configuration
+    SAMPLE_RATE = 16000           # Hz - HeAR requires 16kHz
+    CHUNK_DURATION = 2.0          # seconds - HeAR chunk size
+    CHUNK_SIZE = 32000            # samples (2 seconds at 16kHz)
+    EMBEDDING_DIM = 512           # HeAR embedding dimension
+    # Acoustic feature thresholds (fallback if HeAR unavailable)
+    NORMAL_F0_RANGE = (250, 450)  # Hz
+    ASPHYXIA_F0_THRESHOLD = 500   # Hz - higher F0 indicates distress
+    MIN_CRY_DURATION = 0.5        # seconds
+    # HeAR model ID on HuggingFace (PyTorch)
+    HEAR_MODEL_ID = "google/hear-pytorch"
+    # Default classifier path (relative to project root)
+    DEFAULT_CLASSIFIER_PATHS = [
+        Path(__file__).parent.parent.parent / "models" / "linear_probes" / "cry_classifier.joblib",
+        Path("models/linear_probes/cry_classifier.joblib"),
+    ]
+    # Cry type labels from trained classifier
+    CRY_TYPE_LABELS = {
+        0: "belly_pain",
+        1: "burping",
+        2: "discomfort",
+        3: "hungry",
+        4: "tired",
+    }
+    def __init__(
+        self,
+        device: Optional[str] = None,
+        classifier_path: Optional[str] = None,
+        use_hear: bool = True,
+    ):
+        """
+        Initialize the Cry Analyzer with HeAR.
+        Args:
+            device: Device to run model on
+            classifier_path: Path to trained linear classifier (optional, auto-detected)
+            use_hear: Whether to use HeAR embeddings (True) or acoustic features (False)
+        """
+        if not HAS_AUDIO:
+            raise ImportError("librosa and soundfile required. Install with: pip install librosa soundfile")
+        self.device = device or ("cuda" if torch.cuda.is_available() else "cpu")
+        self.classifier_path = classifier_path
+        self.classifier = None
+        self.hear_model = None
+        self.use_hear = use_hear
+        self._hear_available = False
+        # Try to load HeAR model
+        if use_hear:
+            self._load_hear_model()
+        # Load trained classifier: explicit path first, then auto-detect
+        self._load_classifier(classifier_path)
+        mode = "HeAR" if self._hear_available else "Acoustic Features (HeAR unavailable)"
+        classifier_status = "with trained classifier" if self.classifier else "heuristic scoring"
+        print(f"Cry Analyzer (HAI-DEF {mode}, {classifier_status}) initialized on {self.device}")
+    def _load_classifier(self, classifier_path: Optional[str] = None) -> None:
+        """Load trained cry classifier from file.
+        Searches explicit path first, then default locations.
+        """
+        if not HAS_SKLEARN:
+            return
+        paths_to_try = []
+        if classifier_path:
+            paths_to_try.append(Path(classifier_path))
+        paths_to_try.extend(self.DEFAULT_CLASSIFIER_PATHS)
+        for path in paths_to_try:
+            if path.exists():
+                try:
+                    self.classifier = joblib.load(path)
+                    self.classifier_path = str(path)
+                    print(f"Loaded cry classifier from {path}")
+                    return
+                except Exception as e:
+                    print(f"Warning: Could not load classifier from {path}: {e}")
+    def _load_hear_model(self) -> None:
+        """Load HeAR model from HuggingFace (PyTorch).
+        HeAR (Health Acoustic Representations) is a Google HAI-DEF model
+        for health-related audio analysis. It produces 512-dimensional
+        embeddings from 2-second audio chunks at 16kHz.
+        """
+        if not HAS_HEAR_PYTORCH:
+            print("Warning: transformers not available. Install with: pip install transformers")
+            print("Falling back to acoustic feature extraction (deterministic)")
+            self._hear_available = False
+            return
+        hf_token = os.environ.get("HF_TOKEN")
+        try:
+            print(f"Loading HeAR model from HuggingFace: {self.HEAR_MODEL_ID}")
+            self.hear_model = HearAutoModel.from_pretrained(
+                self.HEAR_MODEL_ID,
+                token=hf_token,
+                trust_remote_code=True,
+            )
+            self.hear_model = self.hear_model.to(self.device)
+            self.hear_model.eval()
+            self._hear_available = True
+            print("HeAR model loaded successfully (PyTorch)")
+        except Exception as e:
+            print(f"Warning: Could not load HeAR model: {e}")
+            print("Falling back to acoustic feature extraction (deterministic)")
+            self.hear_model = None
+            self._hear_available = False
+    def _split_audio_chunks(self, audio: np.ndarray) -> List[np.ndarray]:
+        """
+        Split audio into 2-second chunks for HeAR processing.
+        Args:
+            audio: Audio signal array (16kHz)
+        Returns:
+            List of audio chunks (each 2 seconds / 32000 samples)
+        """
+        chunks = []
+        for i in range(0, len(audio), self.CHUNK_SIZE):
+            chunk = audio[i:i + self.CHUNK_SIZE]
+            if len(chunk) < self.CHUNK_SIZE:
+                # Pad with zeros if needed
+                chunk = np.pad(chunk, (0, self.CHUNK_SIZE - len(chunk)))
+            chunks.append(chunk)
+        return chunks
+    def extract_hear_embeddings(self, audio: np.ndarray) -> np.ndarray:
+        """
+        Extract HeAR embeddings from audio using PyTorch.
+        HeAR is a ViT model that expects mel-PCEN spectrograms, not raw audio.
+        Pipeline: raw audio (32000 samples) → preprocess_audio() → (1, 1, 192, 128)
+                  → ViT forward pass → pool last_hidden_state → embedding
+        Args:
+            audio: Audio signal (16kHz)
+        Returns:
+            Aggregated embedding (HeAR hidden_size dim, or 8-dim fallback)
+        """
+        if not self._hear_available or self.hear_model is None:
+            # Fallback: use acoustic features as pseudo-embeddings
+            # This is deterministic - same audio always produces same features
+            features = self.extract_features(audio, self.SAMPLE_RATE)
+            # Create a feature vector from acoustic features
+            feature_vector = np.array([
+                features.get("f0_mean", 0),
+                features.get("f0_std", 0),
+                features.get("f0_range", 0),
+                features.get("voiced_ratio", 0),
+                features.get("spectral_centroid_mean", 0),
+                features.get("spectral_bandwidth_mean", 0),
+                features.get("zcr_mean", 0),
+                features.get("rms_mean", 0),
+            ])
+            return feature_vector
+        from .hear_preprocessing import preprocess_audio
+        # Split into 2-second chunks for HeAR
+        chunks = self._split_audio_chunks(audio)
+        # Extract embeddings for each chunk using HeAR (PyTorch)
+        embeddings = []
+        with torch.no_grad():
+            for chunk in chunks:
+                # Convert raw audio to tensor: (1, 32000)
+                chunk_tensor = torch.tensor(
+                    chunk.astype(np.float32)
+                ).unsqueeze(0).to(self.device)
+                # Preprocess: raw audio → mel-PCEN spectrogram (1, 1, 192, 128)
+                spectrogram = preprocess_audio(chunk_tensor)
+                # Forward pass: HeAR ViT expects pixel_values
+                output = self.hear_model(
+                    pixel_values=spectrogram,
+                    return_dict=True,
+                )
+                # Extract embedding from ViT output
+                if hasattr(output, 'pooler_output') and output.pooler_output is not None:
+                    embedding = output.pooler_output
+                elif hasattr(output, 'last_hidden_state'):
+                    # Mean pool over sequence dimension (skip CLS token)
+                    embedding = output.last_hidden_state[:, 1:, :].mean(dim=1)
+                elif isinstance(output, torch.Tensor):
+                    embedding = output
+                else:
+                    embedding = list(output.values())[0] if hasattr(output, 'values') else output[0]
+                embeddings.append(embedding.cpu().numpy().squeeze())
+        # Aggregate embeddings (mean pooling across chunks)
+        aggregated = np.mean(embeddings, axis=0)
+        return aggregated
+    def load_audio(
+        self,
+        audio_path: Union[str, Path],
+        sr: int = None,
+    ) -> Tuple[np.ndarray, int]:
+        """
+        Load audio file.
+        Args:
+            audio_path: Path to audio file
+            sr: Target sample rate (uses file's native if None)
+        Returns:
+            Tuple of (audio_array, sample_rate)
+        """
+        sr = sr or self.SAMPLE_RATE
+        audio, file_sr = librosa.load(audio_path, sr=sr)
+        return audio, sr
+    def extract_features(self, audio: np.ndarray, sr: int) -> Dict:
+        """
+        Extract acoustic features from cry audio.
+        Features based on cry analysis literature:
+        - Fundamental frequency (F0)
+        - MFCCs (mel-frequency cepstral coefficients)
+        - Spectral features
+        - Temporal features
+        Args:
+            audio: Audio signal array
+            sr: Sample rate
+        Returns:
+            Dictionary of extracted features
+        """
+        features = {}
+        # Ensure minimum length
+        if len(audio) < sr * self.MIN_CRY_DURATION:
+            # Pad if too short
+            audio = np.pad(audio, (0, int(sr * self.MIN_CRY_DURATION) - len(audio)))
+        # Duration
+        features["duration"] = len(audio) / sr
+        # Fundamental frequency (F0) using pyin
+        with warnings.catch_warnings():
+            warnings.simplefilter("ignore")
+            f0, voiced_flag, voiced_probs = librosa.pyin(
+                audio,
+                fmin=librosa.note_to_hz('C2'),
+                fmax=librosa.note_to_hz('C7'),
+                sr=sr,
+            )
+        # F0 statistics (ignoring unvoiced frames)
+        f0_valid = f0[~np.isnan(f0)]
+        if len(f0_valid) > 0:
+            features["f0_mean"] = float(np.mean(f0_valid))
+            features["f0_std"] = float(np.std(f0_valid))
+            features["f0_min"] = float(np.min(f0_valid))
+            features["f0_max"] = float(np.max(f0_valid))
+            features["f0_range"] = features["f0_max"] - features["f0_min"]
+        else:
+            features["f0_mean"] = 0
+            features["f0_std"] = 0
+            features["f0_min"] = 0
+            features["f0_max"] = 0
+            features["f0_range"] = 0
+        # Voiced ratio (cry vs silence)
+        features["voiced_ratio"] = float(np.mean(voiced_flag))
+        # MFCCs
+        mfccs = librosa.feature.mfcc(y=audio, sr=sr, n_mfcc=13)
+        for i in range(13):
+            features[f"mfcc_{i}_mean"] = float(np.mean(mfccs[i]))
+            features[f"mfcc_{i}_std"] = float(np.std(mfccs[i]))
+        # Spectral features
+        spectral_centroid = librosa.feature.spectral_centroid(y=audio, sr=sr)
+        spectral_bandwidth = librosa.feature.spectral_bandwidth(y=audio, sr=sr)
+        spectral_rolloff = librosa.feature.spectral_rolloff(y=audio, sr=sr)
+        features["spectral_centroid_mean"] = float(np.mean(spectral_centroid))
+        features["spectral_bandwidth_mean"] = float(np.mean(spectral_bandwidth))
+        features["spectral_rolloff_mean"] = float(np.mean(spectral_rolloff))
+        # Zero crossing rate (higher in noisy/irregular cries)
+        zcr = librosa.feature.zero_crossing_rate(audio)
+        features["zcr_mean"] = float(np.mean(zcr))
+        features["zcr_std"] = float(np.std(zcr))
+        # RMS energy
+        rms = librosa.feature.rms(y=audio)
+        features["rms_mean"] = float(np.mean(rms))
+        features["rms_std"] = float(np.std(rms))
+        # Tempo estimation (cry rhythm)
+        onset_env = librosa.onset.onset_strength(y=audio, sr=sr)
+        tempo = librosa.feature.tempo(onset_envelope=onset_env, sr=sr)
+        features["tempo"] = float(tempo[0]) if len(tempo) > 0 else 0
+        return features
+    def analyze(self, audio_path: Union[str, Path]) -> Dict:
+        """
+        Analyze cry audio for health indicators.
+        Uses HeAR embeddings + classifier when available, falls back to
+        rule-based acoustic analysis when HeAR is unavailable.
+        Args:
+            audio_path: Path to cry audio file
+        Returns:
+            Dictionary containing:
+                - is_abnormal: Boolean indicating abnormal cry
+                - asphyxia_risk: Risk score for birth asphyxia (0-1)
+                - cry_type: Detected cry type
+                - features: Extracted acoustic features
+                - risk_level: "low", "medium", "high"
+                - recommendation: Clinical recommendation
+        """
+        # Load audio
+        audio, sr = self.load_audio(audio_path)
+        # Extract acoustic features (always needed for cry_type and feature reporting)
+        features = self.extract_features(audio, sr)
+        # Determine cry type based on acoustic features
+        cry_type = self._classify_cry_type(features)
+        # Try HeAR-based classification first
+        classified_cry_type = None
+        if self._hear_available or (self.classifier is not None and HAS_SKLEARN):
+            asphyxia_risk, model_used, classified_cry_type = self._analyze_with_hear(audio)
+        else:
+            asphyxia_risk, model_used = self._analyze_with_rules(features)
+        # Use classifier's cry type if available, otherwise rule-based
+        if classified_cry_type is not None:
+            cry_type = classified_cry_type
+        # Determine risk level and recommendation based on risk score
+        if asphyxia_risk > 0.6:
+            risk_level = "high"
+            is_abnormal = True
+            recommendation = "URGENT: High-pitched abnormal cry detected. Assess for birth asphyxia immediately. Check APGAR score and vital signs."
+        elif asphyxia_risk > 0.3:
+            risk_level = "medium"
+            is_abnormal = True
+            recommendation = "CAUTION: Some abnormal cry characteristics. Monitor closely and reassess in 30 minutes."
+        else:
+            risk_level = "low"
+            is_abnormal = False
+            recommendation = "Normal cry pattern. Continue routine care."
+        return {
+            "is_abnormal": is_abnormal,
+            "asphyxia_risk": round(asphyxia_risk, 3),
+            "cry_type": cry_type,
+            "risk_level": risk_level,
+            "recommendation": recommendation,
+            "features": {
+                "f0_mean": round(features["f0_mean"], 1),
+                "f0_std": round(features["f0_std"], 1),
+                "duration": round(features["duration"], 2),
+                "voiced_ratio": round(features["voiced_ratio"], 2),
+            },
+            "model": model_used,
+            "model_note": self._get_model_note(model_used),
+        }
+    def _analyze_with_hear(self, audio: np.ndarray) -> Tuple[float, str, Optional[str]]:
+        """
+        Analyze cry using HeAR embeddings.
+        Args:
+            audio: Audio signal array (16kHz)
+        Returns:
+            Tuple of (asphyxia_risk, model_name, predicted_cry_type)
+        """
+        # Extract HeAR embeddings
+        embeddings = self.extract_hear_embeddings(audio)
+        # Use trained classifier if available
+        if self.classifier is not None and HAS_SKLEARN:
+            embeddings_2d = embeddings.reshape(1, -1)
+            # Multi-class cry type classification
+            prediction = int(self.classifier.predict(embeddings_2d)[0])
+            predicted_type = self.CRY_TYPE_LABELS.get(prediction, "unknown")
+            # Get class probabilities for confidence
+            if hasattr(self.classifier, 'predict_proba'):
+                proba = self.classifier.predict_proba(embeddings_2d)[0]
+                confidence = float(max(proba))
+                # Derive asphyxia risk from cry type probabilities
+                # Pain and belly_pain cries are most associated with distress
+                pain_classes = {"belly_pain": 0, "discomfort": 2}
+                distress_prob = sum(
+                    proba[idx] for name, idx in pain_classes.items()
+                    if idx < len(proba)
+                )
+                # Scale distress probability to asphyxia risk
+                asphyxia_risk = min(1.0, distress_prob * 0.8)
+            else:
+                confidence = 0.7
+                asphyxia_risk = 0.5 if predicted_type in ("belly_pain", "discomfort") else 0.2
+            return asphyxia_risk, "HeAR + Classifier", predicted_type
+        # No classifier: use embedding-based heuristic
+        embedding_mean = float(np.mean(embeddings))
+        embedding_std = float(np.std(embeddings))
+        embedding_max = float(np.max(np.abs(embeddings)))
+        risk_score = 0.0
+        if embedding_std > 0.5:
+            risk_score += 0.3
+        if embedding_max > 2.0:
+            risk_score += 0.2
+        if abs(embedding_mean) > 0.3:
+            risk_score += 0.2
+        return min(risk_score, 1.0), "HeAR (uncalibrated)", None
+    def _analyze_with_rules(self, features: Dict) -> Tuple[float, str]:
+        """
+        Analyze cry using rule-based acoustic features.
+        Fallback when HeAR is unavailable.
+        Args:
+            features: Extracted acoustic features
+        Returns:
+            Tuple of (asphyxia_risk, model_name)
+        """
+        # Rule-based asphyxia risk assessment
+        # Based on medical literature on cry acoustics
+        asphyxia_indicators = 0
+        max_indicators = 5
+        # High F0 (> 500 Hz) is associated with asphyxia
+        if features["f0_mean"] > self.ASPHYXIA_F0_THRESHOLD:
+            asphyxia_indicators += 1
+        # High F0 variability
+        if features["f0_std"] > 100:
+            asphyxia_indicators += 1
+        # Wide F0 range
+        if features["f0_range"] > 300:
+            asphyxia_indicators += 1
+        # Low voiced ratio (fragmented cry)
+        if features["voiced_ratio"] < 0.3:
+            asphyxia_indicators += 1
+        # High zero crossing rate (irregular)
+        if features["zcr_mean"] > 0.15:
+            asphyxia_indicators += 1
+        asphyxia_risk = asphyxia_indicators / max_indicators
+        return asphyxia_risk, "Acoustic Features"
+    def _get_model_note(self, model_used: str) -> str:
+        """Get descriptive note for the model used."""
+        notes = {
+            "HeAR + Classifier": "HAI-DEF HeAR embeddings with trained linear classifier",
+            "HeAR (uncalibrated)": "HAI-DEF HeAR embeddings with heuristic scoring (no trained classifier)",
+            "Acoustic Features": "Deterministic acoustic feature extraction (HeAR unavailable)",
+        }
+        return notes.get(model_used, model_used)
+    def _classify_cry_type(self, features: Dict) -> str:
+        """
+        Classify cry type based on acoustic features.
+        Categories based on donate-a-cry corpus:
+        - hunger: Regular rhythm, moderate pitch
+        - pain: High pitch, irregular
+        - discomfort: Variable pitch, whimpering
+        - tired: Low energy, fragmented
+        - belly_pain: High pitch, straining patterns
+        """
+        f0_mean = features["f0_mean"]
+        f0_std = features["f0_std"]
+        rms_mean = features["rms_mean"]
+        voiced_ratio = features["voiced_ratio"]
+        # Simple rule-based classification
+        if f0_mean > 500 and f0_std > 80:
+            return "pain"
+        elif f0_mean > 450 and rms_mean > 0.1:
+            return "belly_pain"
+        elif voiced_ratio < 0.4 and rms_mean < 0.05:
+            return "tired"
+        elif f0_std < 50 and voiced_ratio > 0.5:
+            return "hunger"
+        else:
+            return "discomfort"
+    def analyze_batch(
+        self,
+        audio_paths: List[Union[str, Path]],
+    ) -> List[Dict]:
+        """
+        Analyze multiple cry audio files.
+        Args:
+            audio_paths: List of paths to audio files
+        Returns:
+            List of analysis results
+        """
+        results = []
+        for path in audio_paths:
+            try:
+                result = self.analyze(path)
+                result["file"] = str(path)
+                results.append(result)
+            except Exception as e:
+                results.append({
+                    "file": str(path),
+                    "error": str(e),
+                    "is_abnormal": None,
+                })
+        return results
+    def get_spectrogram(
+        self,
+        audio_path: Union[str, Path],
+        n_mels: int = 128,
+    ) -> np.ndarray:
+        """
+        Generate mel spectrogram for visualization.
+        Args:
+            audio_path: Path to audio file
+            n_mels: Number of mel bands
+        Returns:
+            Mel spectrogram array (dB scale)
+        """
+        audio, sr = self.load_audio(audio_path)
+        mel_spec = librosa.feature.melspectrogram(
+            y=audio,
+            sr=sr,
+            n_mels=n_mels,
+        )
+        mel_spec_db = librosa.power_to_db(mel_spec, ref=np.max)
+        return mel_spec_db
+def test_analyzer():
+    """Test the cry analyzer with sample audio files."""
+    print("Testing Cry Analyzer...")
+    analyzer = CryAnalyzer()
+    # Check for available audio files
+    data_dirs = [
+        Path(__file__).parent.parent.parent / "data" / "raw" / "cryceleb" / "audio",
+        Path(__file__).parent.parent.parent / "data" / "raw" / "donate-a-cry",
+        Path(__file__).parent.parent.parent / "data" / "raw" / "infant-cry-dataset" / "cry",
+    ]
+    audio_files = []
+    for data_dir in data_dirs:
+        if data_dir.exists():
+            audio_files.extend(list(data_dir.rglob("*.wav"))[:2])
+    if audio_files:
+        for audio_path in audio_files[:5]:
+            print(f"\nAnalyzing: {audio_path.name}")
+            try:
+                result = analyzer.analyze(audio_path)
+                print(f"  Abnormal cry: {result['is_abnormal']}")
+                print(f"  Asphyxia risk: {result['asphyxia_risk']:.1%}")
+                print(f"  Cry type: {result['cry_type']}")
+                print(f"  Risk level: {result['risk_level']}")
+                print(f"  F0 mean: {result['features']['f0_mean']} Hz")
+            except Exception as e:
+                print(f"  Error: {e}")
+    else:
+        print("No audio files found. Please download datasets first.")
+if __name__ == "__main__":
+    test_analyzer()

src/nexus/hear_preprocessing.py ADDED Viewed

	@@ -0,0 +1,320 @@

+"""
+HeAR Audio Preprocessing Module
+Converts raw audio waveforms into mel-PCEN spectrograms required by the
+HeAR (Health Acoustic Representations) ViT model.
+Pipeline: raw audio (batch, 32000) → normalize → STFT → power spectrogram
+         → mel filterbank (128 bins) → PCEN → resize → (batch, 1, 192, 128)
+Adapted from Google's official HeAR preprocessing:
+https://github.com/Google-Health/google-health/tree/master/health_acoustic_representations
+Copyright 2025 Google LLC (original implementation)
+Licensed under the Apache License, Version 2.0
+"""
+import math
+from typing import Callable, Optional
+import torch
+import torch.nn.functional as F
+def _enclosing_power_of_two(value: int) -> int:
+    """Smallest power of 2 >= value."""
+    return int(2 ** math.ceil(math.log2(value))) if value > 0 else 1
+def _compute_stft(
+    signals: torch.Tensor,
+    frame_length: int,
+    frame_step: int,
+    fft_length: Optional[int] = None,
+    window_fn: Optional[Callable[[int], torch.Tensor]] = torch.hann_window,
+    pad_end: bool = True,
+) -> torch.Tensor:
+    """Short-time Fourier Transform.
+    Args:
+        signals: [..., samples] real-valued tensor.
+        frame_length: Window length in samples.
+        frame_step: Step size in samples.
+        fft_length: FFT size (defaults to smallest power of 2 >= frame_length).
+        window_fn: Window function (default: Hann).
+        pad_end: Pad signal end with zeros.
+    Returns:
+        [..., frames, fft_length//2 + 1] complex64 tensor.
+    """
+    if signals.ndim < 1:
+        raise ValueError(f"Input signals must have rank >= 1, got {signals.ndim}")
+    if fft_length is None:
+        fft_length = _enclosing_power_of_two(frame_length)
+    if pad_end:
+        n_frames = (
+            math.ceil(signals.shape[-1] / frame_step)
+            if signals.shape[-1] > 0
+            else 0
+        )
+        padded_length = (
+            max(0, (n_frames - 1) * frame_step + frame_length)
+            if n_frames > 0
+            else frame_length
+        )
+        padding_needed = max(0, padded_length - signals.shape[-1])
+        if padding_needed > 0:
+            signals = F.pad(signals, (0, padding_needed))
+    framed_signals = signals.unfold(-1, frame_length, frame_step)
+    if framed_signals.shape[-2] == 0:
+        return torch.empty(
+            *signals.shape[:-1],
+            0,
+            fft_length // 2 + 1,
+            dtype=torch.complex64,
+            device=signals.device,
+        )
+    if window_fn is not None:
+        window = (
+            window_fn(frame_length)
+            .to(framed_signals.device)
+            .to(framed_signals.dtype)
+        )
+        framed_signals = framed_signals * window
+    return torch.fft.rfft(framed_signals, n=fft_length, dim=-1)
+def _ema(
+    inputs: torch.Tensor,
+    num_channels: int,
+    smooth_coef: float,
+    initial_state: Optional[torch.Tensor] = None,
+) -> torch.Tensor:
+    """Exponential Moving Average for PCEN smoothing.
+    Args:
+        inputs: (batch, timesteps, channels) tensor.
+        num_channels: Number of channels.
+        smooth_coef: EMA smoothing coefficient.
+        initial_state: Optional (batch, channels) initial state.
+    Returns:
+        (batch, timesteps, channels) EMA output.
+    """
+    batch_size, timesteps, _ = inputs.shape
+    if initial_state is None:
+        ema_state = torch.zeros(
+            (batch_size, num_channels), dtype=torch.float32, device=inputs.device
+        )
+    else:
+        ema_state = initial_state
+    identity_kernel = (
+        torch.eye(num_channels, dtype=torch.float32, device=inputs.device)
+        * smooth_coef
+    )
+    identity_recurrent_kernel = (
+        torch.eye(num_channels, dtype=torch.float32, device=inputs.device)
+        * (1.0 - smooth_coef)
+    )
+    output_sequence = []
+    start = initial_state is not None
+    if start:
+        output_sequence.append(ema_state)
+    for t in range(start, timesteps):
+        current_input = inputs[:, t, :]
+        output = torch.matmul(current_input, identity_kernel) + torch.matmul(
+            ema_state, identity_recurrent_kernel
+        )
+        ema_state = output
+        output_sequence.append(output)
+    return torch.stack(output_sequence, dim=1)
+def _pcen_function(
+    inputs: torch.Tensor,
+    num_channels: int = 128,
+    alpha: float = 0.8,
+    smooth_coef: float = 0.04,
+    delta: float = 2.0,
+    root: float = 2.0,
+    floor: float = 1e-8,
+) -> torch.Tensor:
+    """Per-Channel Energy Normalization.
+    See https://arxiv.org/abs/1607.05666
+    """
+    alpha_param = torch.ones(num_channels).to(inputs.device).to(inputs.dtype)
+    alpha_param = alpha_param * alpha
+    delta_param = torch.ones(num_channels).to(inputs.device).to(inputs.dtype)
+    delta_param = delta_param * delta
+    root_param = torch.ones(num_channels).to(inputs.device).to(inputs.dtype)
+    root_param = root_param * root
+    alpha_param = torch.minimum(alpha_param, torch.ones_like(alpha_param))
+    root_param = torch.maximum(root_param, torch.ones_like(root_param))
+    ema_smoother = _ema(
+        inputs,
+        num_channels=num_channels,
+        smooth_coef=smooth_coef,
+        initial_state=inputs[:, 0] if inputs.ndim > 1 else None,
+    ).to(inputs.device)
+    one_over_root = 1.0 / root_param
+    output = (
+        inputs / (floor + ema_smoother) ** alpha_param + delta_param
+    ) ** one_over_root - delta_param**one_over_root
+    return output
+def _hertz_to_mel(frequencies_hertz: torch.Tensor) -> torch.Tensor:
+    """Convert Hz to mel scale."""
+    return 2595.0 * torch.log10(1.0 + frequencies_hertz / 700.0)
+def _linear_to_mel_weight_matrix(
+    device: torch.device,
+    num_mel_bins: int = 128,
+    num_spectrogram_bins: int = 201,
+    sample_rate: float = 16000,
+    lower_edge_hertz: float = 0.0,
+    upper_edge_hertz: float = 8000.0,
+    dtype: torch.dtype = torch.float32,
+) -> torch.Tensor:
+    """Mel filterbank matrix: [num_spectrogram_bins, num_mel_bins]."""
+    zero = torch.tensor(0.0, dtype=dtype, device=device)
+    nyquist_hertz = torch.tensor(sample_rate, dtype=dtype) / 2.0
+    lower_edge = torch.tensor(lower_edge_hertz, dtype=dtype, device=device)
+    upper_edge = torch.tensor(upper_edge_hertz, dtype=dtype, device=device)
+    bands_to_zero = 1
+    linear_frequencies = torch.linspace(
+        zero, nyquist_hertz, num_spectrogram_bins, dtype=dtype, device=device
+    )[bands_to_zero:]
+    spectrogram_bins_mel = _hertz_to_mel(linear_frequencies).unsqueeze(1)
+    band_edges_mel = torch.linspace(
+        _hertz_to_mel(lower_edge),
+        _hertz_to_mel(upper_edge),
+        num_mel_bins + 2,
+        dtype=dtype,
+        device=device,
+    )
+    band_edges_mel = band_edges_mel.unfold(0, 3, 1)
+    lower_edge_mel = band_edges_mel[:, 0].unsqueeze(0)
+    center_mel = band_edges_mel[:, 1].unsqueeze(0)
+    upper_edge_mel = band_edges_mel[:, 2].unsqueeze(0)
+    lower_slopes = (spectrogram_bins_mel - lower_edge_mel) / (
+        center_mel - lower_edge_mel
+    )
+    upper_slopes = (upper_edge_mel - spectrogram_bins_mel) / (
+        upper_edge_mel - center_mel
+    )
+    mel_weights_matrix = torch.maximum(
+        zero, torch.minimum(lower_slopes, upper_slopes)
+    )
+    return F.pad(
+        mel_weights_matrix, (0, 0, bands_to_zero, 0), mode="constant", value=0.0
+    )
+def _torch_resize_bilinear_tf_compat(
+    images: torch.Tensor,
+    size: tuple,
+) -> torch.Tensor:
+    """Bilinear resize matching TF's tf.image.resize behavior.
+    Args:
+        images: [C, H, W] or [B, C, H, W] float tensor.
+        size: (new_height, new_width).
+    Returns:
+        Resized tensor with same rank as input.
+    """
+    new_height, new_width = size
+    images = images.to(torch.float32)
+    was_3d = False
+    if images.dim() == 3:
+        images = images.unsqueeze(0)
+        was_3d = True
+    resized = F.interpolate(
+        images,
+        size=(new_height, new_width),
+        mode="bilinear",
+        align_corners=False,
+        antialias=False,
+    )
+    if was_3d:
+        resized = resized.squeeze(0)
+    return resized
+def _mel_pcen(x: torch.Tensor) -> torch.Tensor:
+    """Mel spectrogram + PCEN normalization."""
+    x = x.float()
+    # Scale to [-1, 1]
+    x -= torch.min(x)
+    x = x / (torch.max(x) + 1e-8)
+    x = (x * 2) - 1
+    frame_length = 16 * 25  # 400
+    frame_step = 160
+    stft = _compute_stft(
+        x,
+        frame_length=frame_length,
+        fft_length=frame_length,
+        frame_step=frame_step,
+        window_fn=torch.hann_window,
+        pad_end=True,
+    )
+    spectrograms = torch.square(torch.abs(stft))
+    mel_transform = _linear_to_mel_weight_matrix(x.device)
+    mel_spectrograms = torch.matmul(spectrograms, mel_transform)
+    return _pcen_function(mel_spectrograms)
+def preprocess_audio(audio: torch.Tensor) -> torch.Tensor:
+    """Convert raw audio waveform to mel-PCEN spectrogram for HeAR.
+    Args:
+        audio: [batch, samples] tensor. 2-second clips at 16kHz (32000 samples).
+    Returns:
+        [batch, 1, 192, 128] mel-PCEN spectrogram tensor.
+    """
+    if audio.ndim != 2:
+        raise ValueError(f"Input audio must have rank 2, got rank {audio.ndim}")
+    if audio.shape[1] < 32000:
+        n = 32000 - audio.shape[1]
+        audio = F.pad(audio, pad=(0, n), mode="constant", value=0)
+    elif audio.shape[1] > 32000:
+        raise ValueError(
+            f"Input audio must have <= 32000 samples, got {audio.shape[1]}"
+        )
+    spectrogram = _mel_pcen(audio)
+    # Add channel dimension: [B, H, W] → [B, 1, H, W]
+    spectrogram = torch.unsqueeze(spectrogram, dim=1)
+    return _torch_resize_bilinear_tf_compat(spectrogram, size=(192, 128))

src/nexus/jaundice_detector.py ADDED Viewed

	@@ -0,0 +1,716 @@

+"""
+Jaundice Detector Module
+Uses MedSigLIP from Google HAI-DEF for jaundice detection from neonatal skin images.
+Implements zero-shot classification with medical text prompts per NEXUS_MASTER_PLAN.md.
+HAI-DEF Model: google/medsiglip-448 (MedSigLIP)
+Documentation: https://developers.google.com/health-ai-developer-foundations/medsiglip
+"""
+import os
+import torch
+import torch.nn as nn
+from PIL import Image
+from pathlib import Path
+from typing import Dict, List, Optional, Tuple, Union
+import numpy as np
+try:
+    from transformers import AutoProcessor, AutoModel
+    HAS_TRANSFORMERS = True
+except ImportError:
+    HAS_TRANSFORMERS = False
+# HAI-DEF MedSigLIP model IDs to try in order of preference
+MEDSIGLIP_MODEL_IDS = [
+    "google/medsiglip-448",              # MedSigLIP - official HAI-DEF model
+    "google/siglip-base-patch16-224",    # SigLIP 224 - fallback
+]
+class _BilirubinRegressor(nn.Module):
+    """3-layer MLP regression head with BatchNorm for bilirubin prediction (mg/dL).
+    Must match the architecture in scripts/training/finetune_bilirubin_regression.py
+    so that saved state_dict keys align.
+    """
+    def __init__(self, input_dim: int = 1152, hidden_dim: int = 256):
+        super().__init__()
+        mid_dim = hidden_dim * 2  # 512
+        self.net = nn.Sequential(
+            nn.Linear(input_dim, mid_dim),
+            nn.BatchNorm1d(mid_dim),
+            nn.ReLU(),
+            nn.Dropout(0.3),
+            nn.Linear(mid_dim, hidden_dim),
+            nn.BatchNorm1d(hidden_dim),
+            nn.ReLU(),
+            nn.Dropout(0.15),
+            nn.Linear(hidden_dim, 1),
+        )
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        return self.net(x).squeeze(-1)
+class _BilirubinRegressorV1(nn.Module):
+    """Original 2-layer MLP for backwards compatibility with older checkpoints."""
+    def __init__(self, input_dim: int = 1152, hidden_dim: int = 256):
+        super().__init__()
+        self.net = nn.Sequential(
+            nn.Linear(input_dim, hidden_dim),
+            nn.ReLU(),
+            nn.Dropout(0.3),
+            nn.Linear(hidden_dim, 1),
+        )
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        return self.net(x).squeeze(-1)
+class JaundiceDetector:
+    """
+    Detects neonatal jaundice from skin/sclera images using MedSigLIP.
+    Uses zero-shot classification with medical prompts and
+    color analysis for bilirubin estimation.
+    HAI-DEF Model: google/medsiglip-448 (MedSigLIP)
+    Fallback: siglip-base-patch16-224
+    """
+    # Medical text prompts for zero-shot classification (optimized for MedSigLIP)
+    # Expanded with Kramer zone references, skin-tone context, severity gradation
+    JAUNDICE_PROMPTS = [
+        "newborn with visible yellow discoloration of skin indicating jaundice",
+        "neonatal skin showing yellow-orange pigmentation from hyperbilirubinemia",
+        "jaundiced infant with icteric sclera and yellow skin tone",
+        "baby with yellow skin extending to trunk and limbs Kramer zone 3",
+        "neonatal jaundice with deep yellow skin requiring phototherapy",
+        "newborn showing yellow staining of skin and conjunctiva from bilirubin",
+        "infant with moderate to severe jaundice visible on face and chest",
+        "yellow discoloration of neonatal skin consistent with elevated bilirubin",
+    ]
+    NORMAL_PROMPTS = [
+        "healthy newborn with normal pink skin color without jaundice",
+        "infant with normal skin pigmentation and no yellow discoloration",
+        "newborn baby with clear healthy skin and no icterus",
+        "normal neonatal skin showing pink to brown coloration without yellowing",
+        "healthy baby skin with no signs of hyperbilirubinemia",
+        "newborn with well-perfused normal colored skin and clear sclera",
+        "infant with healthy natural skin tone and no bilirubin staining",
+        "normal newborn skin without yellow or orange discoloration",
+    ]
+    # Bilirubin risk thresholds (mg/dL)
+    BILIRUBIN_THRESHOLDS = {
+        "low": 5.0,      # Normal range
+        "moderate": 12.0, # Monitor closely
+        "high": 15.0,     # Consider phototherapy
+        "critical": 20.0, # Urgent phototherapy
+        "exchange": 25.0, # Exchange transfusion territory
+    }
+    def __init__(
+        self,
+        model_name: Optional[str] = None,  # Auto-select MedSigLIP
+        device: Optional[str] = None,
+        threshold: float = 0.5,
+    ):
+        """
+        Initialize the Jaundice Detector with MedSigLIP.
+        Args:
+            model_name: HuggingFace model name (auto-selects HAI-DEF MedSigLIP if None)
+            device: Device to run model on (auto-detected if None)
+            threshold: Classification threshold for jaundice detection
+        """
+        if not HAS_TRANSFORMERS:
+            raise ImportError("transformers library required. Install with: pip install transformers")
+        self.device = device or ("cuda" if torch.cuda.is_available() else "cpu")
+        self.threshold = threshold
+        self._model_loaded = False
+        self.classifier = None  # Can be set by pipeline for trained classification
+        self.regressor = None   # Bilirubin regression head (MedSigLIP embeddings -> mg/dL)
+        # Determine which models to try
+        models_to_try = [model_name] if model_name else MEDSIGLIP_MODEL_IDS
+        # HuggingFace token for gated models
+        hf_token = os.environ.get("HF_TOKEN")
+        # Try loading models in order of preference
+        for candidate_model in models_to_try:
+            print(f"Loading HAI-DEF model: {candidate_model}")
+            try:
+                self.processor = AutoProcessor.from_pretrained(
+                    candidate_model, token=hf_token
+                )
+                self.model = AutoModel.from_pretrained(
+                    candidate_model, token=hf_token
+                ).to(self.device)
+                self.model_name = candidate_model
+                self._model_loaded = True
+                print(f"Successfully loaded: {candidate_model}")
+                break
+            except Exception as e:
+                print(f"Warning: Could not load {candidate_model}: {e}")
+                continue
+        if not self._model_loaded:
+            raise RuntimeError(
+                f"Could not load any MedSigLIP model. Tried: {models_to_try}. "
+                "Install transformers and ensure internet access."
+            )
+        self.model.eval()
+        # Pre-compute text embeddings
+        self._precompute_text_embeddings()
+        # Try to auto-load trained classifier
+        self._auto_load_classifier()
+        # Try to load bilirubin regression model
+        self._load_regressor()
+        # Indicate which model variant is being used
+        is_medsiglip = "medsiglip" in self.model_name
+        model_type = "MedSigLIP" if is_medsiglip else "SigLIP (fallback)"
+        classifier_status = "trained classifier" if self.classifier else "zero-shot"
+        regressor_status = "with regressor" if self.regressor else "color-based only"
+        print(f"Jaundice Detector (HAI-DEF {model_type}, {classifier_status}, {regressor_status}) initialized on {self.device}")
+    def _auto_load_classifier(self) -> None:
+        """Auto-load trained jaundice classifier if available."""
+        if self.classifier is not None:
+            return
+        try:
+            import joblib
+        except ImportError:
+            return
+        default_paths = [
+            Path(__file__).parent.parent.parent / "models" / "linear_probes" / "jaundice_classifier.joblib",
+            Path("models/linear_probes/jaundice_classifier.joblib"),
+        ]
+        for path in default_paths:
+            if path.exists():
+                try:
+                    self.classifier = joblib.load(path)
+                    print(f"Auto-loaded jaundice classifier from {path}")
+                    return
+                except Exception as e:
+                    print(f"Warning: Could not load classifier from {path}: {e}")
+    # Logit temperature for softmax conversion
+    LOGIT_SCALE = 30.0
+    def _precompute_text_embeddings(self) -> None:
+        """Pre-compute text embeddings for zero-shot classification using SigLIP.
+        Stores individual prompt embeddings for max-similarity scoring.
+        """
+        all_prompts = self.JAUNDICE_PROMPTS + self.NORMAL_PROMPTS
+        with torch.no_grad():
+            inputs = self.processor(
+                text=all_prompts,
+                return_tensors="pt",
+                padding="max_length",
+                truncation=True,
+            ).to(self.device)
+            # Get text embeddings - support multiple output APIs
+            if hasattr(self.model, 'get_text_features'):
+                text_embeddings = self.model.get_text_features(**inputs)
+            else:
+                outputs = self.model(**inputs)
+                if hasattr(outputs, 'text_embeds'):
+                    text_embeddings = outputs.text_embeds
+                elif hasattr(outputs, 'text_model_output'):
+                    text_embeddings = outputs.text_model_output.pooler_output
+                else:
+                    text_outputs = self.model.text_model(**inputs)
+                    text_embeddings = text_outputs.pooler_output
+            text_embeddings = text_embeddings / text_embeddings.norm(dim=-1, keepdim=True)
+            # Store individual embeddings for max-similarity scoring
+            n_jaundice = len(self.JAUNDICE_PROMPTS)
+            self.jaundice_embeddings_all = text_embeddings[:n_jaundice]  # (N, D)
+            self.normal_embeddings_all = text_embeddings[n_jaundice:]    # (M, D)
+            # Also keep mean embeddings as fallback
+            self.jaundice_embeddings = self.jaundice_embeddings_all.mean(dim=0, keepdim=True)
+            self.normal_embeddings = self.normal_embeddings_all.mean(dim=0, keepdim=True)
+            self.jaundice_embeddings = self.jaundice_embeddings / self.jaundice_embeddings.norm(dim=-1, keepdim=True)
+            self.normal_embeddings = self.normal_embeddings / self.normal_embeddings.norm(dim=-1, keepdim=True)
+    def _load_regressor(self) -> None:
+        """Load trained bilirubin regression head if available.
+        Tries the new 3-layer architecture first, falls back to V1 (2-layer).
+        """
+        model_paths = [
+            Path(__file__).parent.parent.parent / "models" / "linear_probes" / "bilirubin_regressor.pt",
+            Path("models/linear_probes/bilirubin_regressor.pt"),
+        ]
+        for model_path in model_paths:
+            if model_path.exists():
+                try:
+                    checkpoint = torch.load(model_path, map_location=self.device, weights_only=True)
+                    input_dim = checkpoint.get("input_dim", 1152)
+                    hidden_dim = checkpoint.get("hidden_dim", 256)
+                    # Try new 3-layer architecture first, then fall back to V1
+                    for RegClass in [_BilirubinRegressor, _BilirubinRegressorV1]:
+                        try:
+                            regressor = RegClass(input_dim, hidden_dim)
+                            regressor.load_state_dict(checkpoint["model_state_dict"])
+                            regressor.to(self.device)
+                            regressor.eval()
+                            self.regressor = regressor
+                            arch = "v2 (3-layer)" if RegClass is _BilirubinRegressor else "v1 (2-layer)"
+                            print(f"Bilirubin regressor ({arch}) loaded from {model_path}")
+                            return
+                        except (RuntimeError, KeyError):
+                            continue
+                    print(f"Warning: Regressor checkpoint incompatible at {model_path}")
+                except Exception as e:
+                    print(f"Warning: Could not load regressor from {model_path}: {e}")
+                    self.regressor = None
+    def preprocess_image(self, image: Union[str, Path, Image.Image]) -> Image.Image:
+        """Preprocess image for analysis.
+        Args:
+            image: Path to image file or PIL Image object.
+        Returns:
+            PIL Image in RGB mode.
+        Raises:
+            ValueError: If the input type is unsupported.
+            FileNotFoundError: If the image file does not exist.
+        """
+        if isinstance(image, (str, Path)):
+            path = Path(image)
+            if not path.exists():
+                raise FileNotFoundError(f"Image file not found: {path}")
+            image = Image.open(path).convert("RGB")
+        elif isinstance(image, Image.Image):
+            image = image.convert("RGB")
+        else:
+            raise ValueError(f"Expected str, Path, or PIL Image, got {type(image)}")
+        return image
+    def estimate_bilirubin(self, image: Union[str, Path, Image.Image]) -> float:
+        """
+        Estimate bilirubin level from image color analysis.
+        This uses the yellow-blue ratio which correlates with
+        transcutaneous bilirubin measurements.
+        Args:
+            image: Neonatal skin/sclera image
+        Returns:
+            Estimated bilirubin in mg/dL
+        """
+        pil_image = self.preprocess_image(image)
+        img_array = np.array(pil_image).astype(float)
+        # Ensure 3-channel RGB
+        if img_array.ndim == 2:
+            img_array = np.stack([img_array, img_array, img_array], axis=-1)
+        elif img_array.shape[-1] == 1:
+            img_array = np.concatenate([img_array] * 3, axis=-1)
+        # Extract color channels
+        r = img_array[:, :, 0]
+        g = img_array[:, :, 1]
+        b = img_array[:, :, 2]
+        # Calculate yellow index (R+G-B correlation with bilirubin)
+        # Higher values indicate more yellow (jaundiced)
+        yellow_index = (r + g - b) / (r + g + b + 1e-6)
+        mean_yellow = np.mean(yellow_index)
+        # Convert to bilirubin estimate
+        # Calibrated based on medical literature
+        # Normal yellow_index ~ 0.2-0.3, jaundiced ~ 0.4-0.6
+        bilirubin_estimate = max(0, (mean_yellow - 0.2) * 50)
+        return round(bilirubin_estimate, 1)
+    def detect(self, image: Union[str, Path, Image.Image]) -> Dict:
+        """
+        Detect jaundice from neonatal image.
+        Uses trained classifier if available, otherwise falls back to
+        zero-shot classification with MedSigLIP.
+        Args:
+            image: Neonatal skin/sclera image
+        Returns:
+            Dictionary containing:
+                - has_jaundice: Boolean indicating jaundice detection
+                - confidence: Confidence score
+                - jaundice_score: Raw jaundice probability
+                - estimated_bilirubin: Estimated bilirubin (mg/dL)
+                - severity: "none", "mild", "moderate", "severe", "critical"
+                - needs_phototherapy: Boolean
+                - recommendation: Clinical recommendation
+        """
+        pil_image = self.preprocess_image(image)
+        # Get image embedding using SigLIP
+        with torch.no_grad():
+            inputs = self.processor(images=pil_image, return_tensors="pt").to(self.device)
+            # Get image embeddings - support multiple output APIs
+            if hasattr(self.model, 'get_image_features'):
+                image_embedding = self.model.get_image_features(**inputs)
+            else:
+                outputs = self.model(**inputs)
+                if hasattr(outputs, 'image_embeds'):
+                    image_embedding = outputs.image_embeds
+                elif hasattr(outputs, 'vision_model_output'):
+                    image_embedding = outputs.vision_model_output.pooler_output
+                else:
+                    vision_outputs = self.model.vision_model(**inputs)
+                    image_embedding = vision_outputs.pooler_output
+            image_embedding = image_embedding / image_embedding.norm(dim=-1, keepdim=True)
+        # Use trained classifier if available, otherwise zero-shot
+        if self.classifier is not None:
+            jaundice_prob, model_method = self._classify_with_trained_model(image_embedding)
+        else:
+            jaundice_prob, model_method = self._classify_zero_shot(image_embedding)
+        # Color-based bilirubin estimate (always available)
+        estimated_bilirubin = self.estimate_bilirubin(pil_image)
+        # ML-based bilirubin estimate from trained regressor on MedSigLIP embeddings
+        estimated_bilirubin_ml = None
+        if self.regressor is not None:
+            with torch.no_grad():
+                bilirubin_pred = self.regressor(image_embedding)
+                raw_value = float(bilirubin_pred.item())
+                # Clamp to physiologically valid range (0-35 mg/dL)
+                clamped_value = max(0.0, min(35.0, raw_value))
+                estimated_bilirubin_ml = round(clamped_value, 1)
+        # Use ML estimate for severity when available, otherwise color-based
+        bilirubin_for_severity = estimated_bilirubin_ml if estimated_bilirubin_ml is not None else estimated_bilirubin
+        # Determine severity based on bilirubin level
+        if bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["low"]:
+            severity = "none"
+            needs_phototherapy = False
+            recommendation = "No jaundice detected. Continue routine care."
+        elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["moderate"]:
+            severity = "mild"
+            needs_phototherapy = False
+            recommendation = "Mild jaundice. Monitor closely and ensure adequate feeding."
+        elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["high"]:
+            severity = "moderate"
+            needs_phototherapy = False
+            recommendation = "Moderate jaundice. Recheck in 12-24 hours. Consider phototherapy if rising."
+        elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["critical"]:
+            severity = "severe"
+            needs_phototherapy = True
+            recommendation = "URGENT: Start phototherapy. Refer for serum bilirubin confirmation."
+        else:
+            severity = "critical"
+            needs_phototherapy = True
+            recommendation = "CRITICAL: Immediate phototherapy required. Consider exchange transfusion."
+        is_medsiglip = "medsiglip" in self.model_name
+        base_model = "MedSigLIP (HAI-DEF)" if is_medsiglip else "SigLIP (fallback)"
+        result = {
+            "has_jaundice": jaundice_prob > self.threshold,
+            "confidence": max(jaundice_prob, 1 - jaundice_prob),
+            "jaundice_score": jaundice_prob,
+            "estimated_bilirubin": estimated_bilirubin,
+            "severity": severity,
+            "needs_phototherapy": needs_phototherapy,
+            "recommendation": recommendation,
+            "model": self.model_name,
+            "model_type": f"{base_model} + {model_method}",
+        }
+        if estimated_bilirubin_ml is not None:
+            result["estimated_bilirubin_ml"] = estimated_bilirubin_ml
+            result["bilirubin_method"] = "MedSigLIP Regressor"
+        else:
+            result["bilirubin_method"] = "Color Analysis"
+        return result
+    def _classify_with_trained_model(self, image_embedding: torch.Tensor) -> Tuple[float, str]:
+        """
+        Classify using trained classifier on embeddings.
+        Args:
+            image_embedding: Normalized image embedding from MedSigLIP
+        Returns:
+            Tuple of (jaundice_prob, method_name)
+        """
+        # Convert embedding to numpy for sklearn classifiers
+        embedding_np = image_embedding.cpu().numpy().reshape(1, -1)
+        # Handle different classifier types
+        if hasattr(self.classifier, 'predict_proba'):
+            # Sklearn classifier with probability support
+            proba = self.classifier.predict_proba(embedding_np)
+            # Assume binary: [normal, jaundice] or [jaundice, normal]
+            if proba.shape[1] >= 2:
+                # Check classifier classes to determine order
+                if hasattr(self.classifier, 'classes_'):
+                    classes = list(self.classifier.classes_)
+                    if 1 in classes:
+                        jaundice_idx = classes.index(1)
+                    else:
+                        jaundice_idx = 1  # Default assumption
+                else:
+                    jaundice_idx = 1
+                jaundice_prob = float(proba[0, jaundice_idx])
+            else:
+                jaundice_prob = float(proba[0, 0])
+            return jaundice_prob, "Trained Classifier"
+        elif hasattr(self.classifier, 'predict'):
+            # Classifier without probability - use binary prediction
+            prediction = self.classifier.predict(embedding_np)
+            jaundice_prob = float(prediction[0])
+            return jaundice_prob, "Trained Classifier (binary)"
+        elif isinstance(self.classifier, nn.Module):
+            # PyTorch classifier
+            self.classifier.eval()
+            with torch.no_grad():
+                logits = self.classifier(image_embedding)
+                probs = torch.softmax(logits, dim=-1)
+                if probs.shape[-1] >= 2:
+                    jaundice_prob = probs[0, 1].item()
+                else:
+                    jaundice_prob = probs[0, 0].item()
+            return jaundice_prob, "Trained Classifier (PyTorch)"
+        else:
+            # Unknown classifier type - fall back to zero-shot
+            print(f"Warning: Unknown classifier type {type(self.classifier)}, using zero-shot")
+            return self._classify_zero_shot(image_embedding)
+    def _classify_zero_shot(self, image_embedding: torch.Tensor) -> Tuple[float, str]:
+        """
+        Classify using zero-shot with max-similarity scoring.
+        Uses the maximum cosine similarity across all prompts per class
+        for better discrimination.
+        Args:
+            image_embedding: Normalized image embedding from MedSigLIP
+        Returns:
+            Tuple of (jaundice_prob, method_name)
+        """
+        # Max-similarity: best-matching prompt per class
+        jaundice_sims = (image_embedding @ self.jaundice_embeddings_all.T).squeeze(0)
+        normal_sims = (image_embedding @ self.normal_embeddings_all.T).squeeze(0)
+        # Ensure at least 1-D for .max() to work on single-image inputs
+        if jaundice_sims.dim() == 0:
+            jaundice_sims = jaundice_sims.unsqueeze(0)
+        if normal_sims.dim() == 0:
+            normal_sims = normal_sims.unsqueeze(0)
+        jaundice_sim = jaundice_sims.max().item()
+        normal_sim = normal_sims.max().item()
+        # Convert to probabilities with tuned temperature
+        logits = torch.tensor([jaundice_sim, normal_sim]) * self.LOGIT_SCALE
+        probs = torch.softmax(logits, dim=0)
+        jaundice_prob = probs[0].item()
+        return jaundice_prob, "Zero-Shot"
+    def detect_batch(
+        self,
+        images: List[Union[str, Path, Image.Image]],
+        batch_size: int = 8,
+    ) -> List[Dict]:
+        """Detect jaundice from multiple images."""
+        results = []
+        for i in range(0, len(images), batch_size):
+            batch = images[i:i + batch_size]
+            pil_images = [self.preprocess_image(img) for img in batch]
+            with torch.no_grad():
+                inputs = self.processor(images=pil_images, return_tensors="pt", padding=True).to(self.device)
+                # Get image embeddings from SigLIP vision encoder
+                if hasattr(self.model, 'get_image_features'):
+                    image_embeddings = self.model.get_image_features(**inputs)
+                else:
+                    vision_outputs = self.model.vision_model(**inputs)
+                    image_embeddings = vision_outputs.pooler_output
+                image_embeddings = image_embeddings / image_embeddings.norm(dim=-1, keepdim=True)
+            for j, (img_emb, pil_img) in enumerate(zip(image_embeddings, pil_images)):
+                img_emb = img_emb.unsqueeze(0)
+                # Use trained classifier if available, otherwise zero-shot
+                if self.classifier is not None:
+                    jaundice_prob, model_method = self._classify_with_trained_model(img_emb)
+                else:
+                    jaundice_prob, model_method = self._classify_zero_shot(img_emb)
+                # Color-based bilirubin
+                estimated_bilirubin = self.estimate_bilirubin(pil_img)
+                # ML bilirubin from regressor (consistent with detect())
+                estimated_bilirubin_ml = None
+                if self.regressor is not None:
+                    with torch.no_grad():
+                        bilirubin_pred = self.regressor(img_emb)
+                        raw_value = float(bilirubin_pred.item())
+                        estimated_bilirubin_ml = round(max(0.0, min(35.0, raw_value)), 1)
+                bilirubin_for_severity = estimated_bilirubin_ml if estimated_bilirubin_ml is not None else estimated_bilirubin
+                if bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["low"]:
+                    severity, needs_phototherapy = "none", False
+                elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["moderate"]:
+                    severity, needs_phototherapy = "mild", False
+                elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["high"]:
+                    severity, needs_phototherapy = "moderate", False
+                elif bilirubin_for_severity < self.BILIRUBIN_THRESHOLDS["critical"]:
+                    severity, needs_phototherapy = "severe", True
+                else:
+                    severity, needs_phototherapy = "critical", True
+                result_item = {
+                    "has_jaundice": jaundice_prob > self.threshold,
+                    "confidence": max(jaundice_prob, 1 - jaundice_prob),
+                    "jaundice_score": jaundice_prob,
+                    "estimated_bilirubin": estimated_bilirubin,
+                    "severity": severity,
+                    "needs_phototherapy": needs_phototherapy,
+                }
+                if estimated_bilirubin_ml is not None:
+                    result_item["estimated_bilirubin_ml"] = estimated_bilirubin_ml
+                results.append(result_item)
+        return results
+    def analyze_kramer_zones(self, image: Union[str, Path, Image.Image]) -> Dict:
+        """
+        Analyze jaundice using Kramer's zones concept.
+        Kramer's zones estimate bilirubin based on cephalocaudal progression:
+        - Zone 1 (face): ~5-6 mg/dL
+        - Zone 2 (chest): ~9 mg/dL
+        - Zone 3 (abdomen): ~12 mg/dL
+        - Zone 4 (arms/legs): ~15 mg/dL
+        - Zone 5 (hands/feet): ~20+ mg/dL
+        Args:
+            image: Full body or partial neonatal image
+        Returns:
+            Dictionary with zone analysis
+        """
+        pil_image = self.preprocess_image(image)
+        img_array = np.array(pil_image).astype(float)
+        # Simple color-based zone estimation
+        r = img_array[:, :, 0]
+        g = img_array[:, :, 1]
+        b = img_array[:, :, 2]
+        yellow_index = np.mean((r + g - b) / (r + g + b + 1e-6))
+        # Map yellow index to Kramer zone
+        if yellow_index < 0.25:
+            zone = 0
+            zone_bilirubin = 3
+        elif yellow_index < 0.30:
+            zone = 1
+            zone_bilirubin = 6
+        elif yellow_index < 0.35:
+            zone = 2
+            zone_bilirubin = 9
+        elif yellow_index < 0.40:
+            zone = 3
+            zone_bilirubin = 12
+        elif yellow_index < 0.45:
+            zone = 4
+            zone_bilirubin = 15
+        else:
+            zone = 5
+            zone_bilirubin = 20
+        return {
+            "kramer_zone": zone,
+            "zone_description": self._get_zone_description(zone),
+            "estimated_bilirubin_by_zone": zone_bilirubin,
+            "yellow_index": round(yellow_index, 3),
+        }
+    def _get_zone_description(self, zone: int) -> str:
+        """Get description for Kramer zone."""
+        descriptions = {
+            0: "No visible jaundice",
+            1: "Face and neck (Zone 1)",
+            2: "Upper trunk (Zone 2)",
+            3: "Lower trunk and thighs (Zone 3)",
+            4: "Arms and lower legs (Zone 4)",
+            5: "Hands and feet (Zone 5) - Severe",
+        }
+        return descriptions.get(zone, "Unknown")
+def test_detector():
+    """Test the jaundice detector with sample images."""
+    print("Testing Jaundice Detector...")
+    detector = JaundiceDetector()
+    data_dir = Path(__file__).parent.parent.parent / "data" / "raw" / "neojaundice" / "images"
+    if data_dir.exists():
+        sample_images = list(data_dir.glob("*.jpg"))[:3]
+        for img_path in sample_images:
+            print(f"\nAnalyzing: {img_path.name}")
+            result = detector.detect(img_path)
+            print(f"  Jaundice detected: {result['has_jaundice']}")
+            print(f"  Confidence: {result['confidence']:.2%}")
+            print(f"  Estimated bilirubin: {result['estimated_bilirubin']} mg/dL")
+            print(f"  Severity: {result['severity']}")
+            print(f"  Needs phototherapy: {result['needs_phototherapy']}")
+            print(f"  Recommendation: {result['recommendation']}")
+    else:
+        print(f"Dataset not found at {data_dir}")
+if __name__ == "__main__":
+    test_detector()

src/nexus/pipeline.py ADDED Viewed

	@@ -0,0 +1,663 @@

+"""
+NEXUS Pipeline Module
+Integrates all detection modules into a unified diagnostic pipeline
+for maternal-neonatal care.
+"""
+from pathlib import Path
+from typing import Dict, List, Optional, Union
+from dataclasses import dataclass
+from datetime import datetime
+import json
+@dataclass
+class PatientInfo:
+    """Patient information for context."""
+    patient_id: str
+    age_days: Optional[int] = None  # For neonates
+    gestational_age: Optional[int] = None  # Weeks
+    birth_weight: Optional[int] = None  # Grams
+    gender: Optional[str] = None
+    is_maternal: bool = False  # True for mother, False for neonate
+@dataclass
+class AssessmentResult:
+    """Complete assessment result."""
+    patient: PatientInfo
+    timestamp: str
+    anemia_result: Optional[Dict] = None
+    jaundice_result: Optional[Dict] = None
+    cry_result: Optional[Dict] = None
+    overall_risk: str = "unknown"
+    priority_actions: List[str] = None
+    referral_needed: bool = False
+class NEXUSPipeline:
+    """
+    NEXUS Integrated Diagnostic Pipeline
+    Combines anemia, jaundice, and cry analysis into a unified
+    assessment workflow for maternal-neonatal care.
+    """
+    # Default paths for trained model checkpoints
+    DEFAULT_CHECKPOINT_DIR = Path(__file__).parent.parent.parent / "models" / "checkpoints"
+    DEFAULT_LINEAR_PROBE_DIR = Path(__file__).parent.parent.parent / "models" / "linear_probes"
+    def __init__(
+        self,
+        device: Optional[str] = None,
+        lazy_load: bool = True,
+        anemia_checkpoint: Optional[Union[str, Path]] = None,
+        jaundice_checkpoint: Optional[Union[str, Path]] = None,
+        cry_checkpoint: Optional[Union[str, Path]] = None,
+        use_linear_probes: bool = True,
+    ):
+        """
+        Initialize NEXUS Pipeline.
+        Args:
+            device: Device for model inference
+            lazy_load: If True, load models only when needed
+            anemia_checkpoint: Path to trained anemia classifier checkpoint
+            jaundice_checkpoint: Path to trained jaundice classifier checkpoint
+            cry_checkpoint: Path to trained cry classifier checkpoint
+            use_linear_probes: If True, auto-load linear probes from default dir
+        """
+        self.device = device
+        self.lazy_load = lazy_load
+        # Store checkpoint paths
+        self.anemia_checkpoint = anemia_checkpoint
+        self.jaundice_checkpoint = jaundice_checkpoint
+        self.cry_checkpoint = cry_checkpoint
+        # Auto-detect checkpoints from default locations
+        if use_linear_probes:
+            self._auto_detect_checkpoints()
+        self._anemia_detector = None
+        self._jaundice_detector = None
+        self._cry_analyzer = None
+        if not lazy_load:
+            self._load_all_models()
+        print("NEXUS Pipeline initialized")
+    def verify_hai_def_compliance(self) -> Dict:
+        """
+        Verify which HAI-DEF models are loaded and report compliance.
+        Returns:
+            Dictionary with model status and compliance flag.
+        """
+        from .anemia_detector import MEDSIGLIP_MODEL_IDS
+        from .cry_analyzer import CryAnalyzer
+        status = {
+            "medsiglip": {
+                "expected": "google/medsiglip-448",
+                "configured_models": MEDSIGLIP_MODEL_IDS,
+                "anemia_loaded": self._anemia_detector is not None,
+                "jaundice_loaded": self._jaundice_detector is not None,
+            },
+            "hear": {
+                "expected": CryAnalyzer.HEAR_MODEL_ID,
+                "cry_loaded": self._cry_analyzer is not None,
+                "hear_active": getattr(self._cry_analyzer, '_hear_available', False) if self._cry_analyzer else False,
+            },
+            "medgemma": {
+                "expected": "google/medgemma-4b-it",
+            },
+        }
+        # Check loaded model names
+        if self._anemia_detector:
+            status["medsiglip"]["anemia_model"] = getattr(self._anemia_detector, 'model_name', 'unknown')
+        if self._jaundice_detector:
+            status["medsiglip"]["jaundice_model"] = getattr(self._jaundice_detector, 'model_name', 'unknown')
+        # Overall compliance
+        anemia_ok = "medsiglip" in status["medsiglip"].get("anemia_model", "")
+        jaundice_ok = "medsiglip" in status["medsiglip"].get("jaundice_model", "")
+        hear_ok = status["hear"]["hear_active"]
+        status["compliant"] = anemia_ok or jaundice_ok or hear_ok
+        status["all_hai_def"] = anemia_ok and jaundice_ok and hear_ok
+        return status
+    def _auto_detect_checkpoints(self) -> None:
+        """Auto-detect trained checkpoints from default directories."""
+        # Check for linear probes (.joblib sklearn models)
+        if self.anemia_checkpoint is None:
+            anemia_probe = self.DEFAULT_LINEAR_PROBE_DIR / "anemia_linear_probe.joblib"
+            if anemia_probe.exists():
+                self.anemia_checkpoint = anemia_probe
+                print(f"Auto-detected anemia probe: {anemia_probe}")
+        if self.jaundice_checkpoint is None:
+            jaundice_probe = self.DEFAULT_LINEAR_PROBE_DIR / "jaundice_linear_probe.joblib"
+            if jaundice_probe.exists():
+                self.jaundice_checkpoint = jaundice_probe
+                print(f"Auto-detected jaundice probe: {jaundice_probe}")
+        if self.cry_checkpoint is None:
+            cry_probe = self.DEFAULT_LINEAR_PROBE_DIR / "cry_linear_probe.joblib"
+            if cry_probe.exists():
+                self.cry_checkpoint = cry_probe
+                print(f"Auto-detected cry probe: {cry_probe}")
+        # Also check checkpoint dir for full fine-tuned models
+        if self.anemia_checkpoint is None:
+            anemia_best = self.DEFAULT_CHECKPOINT_DIR / "anemia_best.pt"
+            if anemia_best.exists():
+                self.anemia_checkpoint = anemia_best
+                print(f"Auto-detected anemia checkpoint: {anemia_best}")
+    def _load_all_models(self) -> None:
+        """Load all detection models."""
+        self._get_anemia_detector()
+        self._get_jaundice_detector()
+        self._get_cry_analyzer()
+    def _get_anemia_detector(self):
+        """Get or create anemia detector with optional trained classifier."""
+        if self._anemia_detector is None:
+            from .anemia_detector import AnemiaDetector
+            # Initialize detector
+            self._anemia_detector = AnemiaDetector(device=self.device)
+            # Load trained classifier if available
+            if self.anemia_checkpoint:
+                self._load_classifier_checkpoint(
+                    self._anemia_detector,
+                    self.anemia_checkpoint,
+                    "anemia"
+                )
+        return self._anemia_detector
+    def _get_jaundice_detector(self):
+        """Get or create jaundice detector with optional trained classifier."""
+        if self._jaundice_detector is None:
+            from .jaundice_detector import JaundiceDetector
+            self._jaundice_detector = JaundiceDetector(device=self.device)
+            # Load trained classifier if available
+            if self.jaundice_checkpoint:
+                self._load_classifier_checkpoint(
+                    self._jaundice_detector,
+                    self.jaundice_checkpoint,
+                    "jaundice"
+                )
+        return self._jaundice_detector
+    def _get_cry_analyzer(self):
+        """Get or create cry analyzer with optional trained classifier."""
+        if self._cry_analyzer is None:
+            from .cry_analyzer import CryAnalyzer
+            # Cry analyzer supports classifier_path directly
+            classifier_path = str(self.cry_checkpoint) if self.cry_checkpoint else None
+            self._cry_analyzer = CryAnalyzer(
+                device=self.device,
+                classifier_path=classifier_path
+            )
+        return self._cry_analyzer
+    def _load_classifier_checkpoint(
+        self,
+        detector,
+        checkpoint_path: Union[str, Path],
+        model_type: str
+    ) -> None:
+        """
+        Load a trained classifier checkpoint into a detector.
+        Supports both linear probes (sklearn) and PyTorch checkpoints.
+        """
+        import torch
+        checkpoint_path = Path(checkpoint_path)
+        if not checkpoint_path.exists():
+            print(f"Warning: {model_type} checkpoint not found: {checkpoint_path}")
+            return
+        try:
+            # Check if it's a sklearn model (joblib)
+            if checkpoint_path.suffix in ['.pkl', '.joblib']:
+                import joblib
+                classifier = joblib.load(checkpoint_path)
+                detector.classifier = classifier
+                print(f"Loaded sklearn classifier for {model_type}")
+            # Check if it's a PyTorch model
+            elif checkpoint_path.suffix == '.pt':
+                checkpoint = torch.load(checkpoint_path, map_location=self.device or 'cpu')
+                # Handle different checkpoint formats
+                if 'classifier' in checkpoint:
+                    # Linear probe format
+                    detector.classifier = checkpoint['classifier']
+                    print(f"Loaded linear probe for {model_type}")
+                elif 'model_state_dict' in checkpoint:
+                    # Full model checkpoint - would need separate handling
+                    print(f"Note: Full model checkpoint for {model_type} - using zero-shot")
+                else:
+                    print(f"Unknown checkpoint format for {model_type}")
+        except Exception as e:
+            print(f"Warning: Could not load {model_type} checkpoint: {e}")
+    def assess_maternal(
+        self,
+        patient: PatientInfo,
+        conjunctiva_image: Optional[Union[str, Path]] = None,
+    ) -> AssessmentResult:
+        """
+        Perform maternal health assessment.
+        Currently focuses on anemia detection via conjunctiva imaging.
+        Args:
+            patient: Patient information
+            conjunctiva_image: Path to conjunctiva image
+        Returns:
+            AssessmentResult with findings
+        """
+        result = AssessmentResult(
+            patient=patient,
+            timestamp=datetime.now().isoformat(),
+            priority_actions=[],
+        )
+        # Anemia detection
+        if conjunctiva_image:
+            detector = self._get_anemia_detector()
+            result.anemia_result = detector.detect(conjunctiva_image)
+            # Add color analysis
+            color_info = detector.analyze_color_features(conjunctiva_image)
+            result.anemia_result["color_analysis"] = color_info
+            # Determine actions
+            if result.anemia_result["risk_level"] == "high":
+                result.priority_actions.append("URGENT: Refer for blood test - suspected severe anemia")
+                result.referral_needed = True
+                result.overall_risk = "high"
+            elif result.anemia_result["risk_level"] == "medium":
+                result.priority_actions.append("Schedule blood test within 48 hours")
+                result.overall_risk = "medium"
+            else:
+                result.overall_risk = "low"
+        return result
+    def assess_neonate(
+        self,
+        patient: PatientInfo,
+        skin_image: Optional[Union[str, Path]] = None,
+        cry_audio: Optional[Union[str, Path]] = None,
+    ) -> AssessmentResult:
+        """
+        Perform neonatal health assessment.
+        Includes jaundice detection and cry analysis.
+        Args:
+            patient: Patient information
+            skin_image: Path to skin/sclera image for jaundice
+            cry_audio: Path to cry audio file
+        Returns:
+            AssessmentResult with findings
+        """
+        result = AssessmentResult(
+            patient=patient,
+            timestamp=datetime.now().isoformat(),
+            priority_actions=[],
+        )
+        risk_scores = []
+        # Jaundice detection
+        if skin_image:
+            detector = self._get_jaundice_detector()
+            result.jaundice_result = detector.detect(skin_image)
+            # Add zone analysis
+            zone_info = detector.analyze_kramer_zones(skin_image)
+            result.jaundice_result["zone_analysis"] = zone_info
+            if result.jaundice_result["severity"] == "critical":
+                result.priority_actions.insert(0, "CRITICAL: Immediate phototherapy required")
+                result.referral_needed = True
+                risk_scores.append(1.0)
+            elif result.jaundice_result["severity"] == "severe":
+                result.priority_actions.append("URGENT: Start phototherapy")
+                result.referral_needed = True
+                risk_scores.append(0.8)
+            elif result.jaundice_result["severity"] == "moderate":
+                result.priority_actions.append("Monitor closely, recheck in 12-24 hours")
+                risk_scores.append(0.5)
+            else:
+                risk_scores.append(0.2)
+        # Cry analysis
+        if cry_audio:
+            analyzer = self._get_cry_analyzer()
+            result.cry_result = analyzer.analyze(cry_audio)
+            if result.cry_result["risk_level"] == "high":
+                result.priority_actions.insert(0, "URGENT: Abnormal cry - assess for birth asphyxia")
+                result.referral_needed = True
+                risk_scores.append(1.0)
+            elif result.cry_result["risk_level"] == "medium":
+                result.priority_actions.append("Monitor cry patterns, reassess in 30 minutes")
+                risk_scores.append(0.5)
+            else:
+                risk_scores.append(0.2)
+        # Determine overall risk
+        if risk_scores:
+            max_risk = max(risk_scores)
+            if max_risk >= 0.8:
+                result.overall_risk = "high"
+            elif max_risk >= 0.5:
+                result.overall_risk = "medium"
+            else:
+                result.overall_risk = "low"
+        return result
+    def agentic_assessment(
+        self,
+        patient_type: str = "newborn",
+        conjunctiva_image: Optional[Union[str, Path]] = None,
+        skin_image: Optional[Union[str, Path]] = None,
+        cry_audio: Optional[Union[str, Path]] = None,
+        danger_signs: Optional[List[Dict]] = None,
+        patient_info: Optional[Dict] = None,
+    ) -> Dict:
+        """
+        Run the full agentic clinical workflow with 6 specialized agents.
+        This provides richer output than full_assessment() — each agent emits
+        step-by-step reasoning traces forming a complete audit trail.
+        Args:
+            patient_type: "pregnant" or "newborn"
+            conjunctiva_image: Path to conjunctiva image for anemia screening
+            skin_image: Path to skin image for jaundice detection
+            cry_audio: Path to cry audio for asphyxia detection
+            danger_signs: List of danger sign dicts with keys: id, label, severity, present
+            patient_info: Patient information dict
+        Returns:
+            Dict with workflow result including agent_traces list
+        """
+        from .agentic_workflow import (
+            AgenticWorkflowEngine,
+            AgentPatientInfo,
+            DangerSign,
+            WorkflowInput,
+        )
+        # Build patient info
+        info = AgentPatientInfo(patient_type=patient_type)
+        if patient_info:
+            info.patient_id = patient_info.get("patient_id", "")
+            info.gestational_weeks = patient_info.get("gestational_weeks")
+            info.birth_weight = patient_info.get("birth_weight")
+            info.apgar_score = patient_info.get("apgar_score")
+            info.age_hours = patient_info.get("age_hours")
+        # Build danger signs
+        signs = []
+        if danger_signs:
+            for s in danger_signs:
+                signs.append(DangerSign(
+                    id=s.get("id", ""),
+                    label=s.get("label", ""),
+                    severity=s.get("severity", "medium"),
+                    present=s.get("present", True),
+                ))
+        workflow_input = WorkflowInput(
+            patient_type=patient_type,
+            patient_info=info,
+            danger_signs=signs,
+            conjunctiva_image=conjunctiva_image,
+            skin_image=skin_image,
+            cry_audio=cry_audio,
+        )
+        # Create engine with existing model instances to avoid reloading
+        engine = AgenticWorkflowEngine(
+            anemia_detector=self._anemia_detector,
+            jaundice_detector=self._jaundice_detector,
+            cry_analyzer=self._cry_analyzer,
+        )
+        result = engine.execute(workflow_input)
+        # Serialize to dict
+        return {
+            "success": result.success,
+            "patient_type": result.patient_type,
+            "who_classification": result.who_classification,
+            "clinical_synthesis": result.clinical_synthesis,
+            "recommendation": result.recommendation,
+            "immediate_actions": result.immediate_actions,
+            "processing_time_ms": result.processing_time_ms,
+            "timestamp": result.timestamp,
+            "triage": {
+                "risk_level": result.triage_result.risk_level,
+                "score": result.triage_result.score,
+                "critical_signs": result.triage_result.critical_signs,
+                "immediate_referral": result.triage_result.immediate_referral_needed,
+            } if result.triage_result else None,
+            "referral": {
+                "referral_needed": result.referral_result.referral_needed,
+                "urgency": result.referral_result.urgency,
+                "facility_level": result.referral_result.facility_level,
+                "reason": result.referral_result.reason,
+                "timeframe": result.referral_result.timeframe,
+            } if result.referral_result else None,
+            "protocol": {
+                "classification": result.protocol_result.classification,
+                "applicable_protocols": result.protocol_result.applicable_protocols,
+                "treatment_recommendations": result.protocol_result.treatment_recommendations,
+                "follow_up_schedule": result.protocol_result.follow_up_schedule,
+            } if result.protocol_result else None,
+            "agent_traces": [
+                {
+                    "agent_name": t.agent_name,
+                    "status": t.status,
+                    "reasoning": t.reasoning,
+                    "findings": t.findings,
+                    "confidence": t.confidence,
+                    "processing_time_ms": t.processing_time_ms,
+                }
+                for t in result.agent_traces
+            ],
+        }
+    def full_assessment(
+        self,
+        patient: PatientInfo,
+        conjunctiva_image: Optional[Union[str, Path]] = None,
+        skin_image: Optional[Union[str, Path]] = None,
+        cry_audio: Optional[Union[str, Path]] = None,
+    ) -> AssessmentResult:
+        """
+        Perform full assessment (maternal or neonatal based on patient info).
+        Args:
+            patient: Patient information
+            conjunctiva_image: For maternal anemia screening
+            skin_image: For neonatal jaundice detection
+            cry_audio: For neonatal cry analysis
+        Returns:
+            Complete AssessmentResult
+        """
+        if patient.is_maternal:
+            return self.assess_maternal(patient, conjunctiva_image)
+        else:
+            return self.assess_neonate(patient, skin_image, cry_audio)
+    def generate_report(self, result: AssessmentResult) -> str:
+        """
+        Generate a text report from assessment result.
+        Args:
+            result: AssessmentResult from assessment
+        Returns:
+            Formatted report string
+        """
+        lines = [
+            "=" * 60,
+            "NEXUS HEALTH ASSESSMENT REPORT",
+            "=" * 60,
+            "",
+            f"Patient ID: {result.patient.patient_id}",
+            f"Assessment Time: {result.timestamp}",
+            f"Patient Type: {'Maternal' if result.patient.is_maternal else 'Neonatal'}",
+            "",
+        ]
+        if result.patient.age_days is not None:
+            lines.append(f"Age: {result.patient.age_days} days")
+        if result.patient.gestational_age is not None:
+            lines.append(f"Gestational Age: {result.patient.gestational_age} weeks")
+        if result.patient.birth_weight is not None:
+            lines.append(f"Birth Weight: {result.patient.birth_weight} grams")
+        lines.extend(["", "-" * 60, "FINDINGS", "-" * 60, ""])
+        # Anemia findings
+        if result.anemia_result:
+            lines.extend([
+                "ANEMIA SCREENING:",
+                f"  Status: {'ANEMIC' if result.anemia_result['is_anemic'] else 'Normal'}",
+                f"  Confidence: {result.anemia_result['confidence']:.1%}",
+                f"  Risk Level: {result.anemia_result['risk_level'].upper()}",
+                "",
+            ])
+        # Jaundice findings
+        if result.jaundice_result:
+            lines.extend([
+                "JAUNDICE ASSESSMENT:",
+                f"  Status: {'JAUNDICE DETECTED' if result.jaundice_result['has_jaundice'] else 'Normal'}",
+                f"  Estimated Bilirubin: {result.jaundice_result['estimated_bilirubin']} mg/dL",
+                f"  Severity: {result.jaundice_result['severity'].upper()}",
+                f"  Phototherapy Needed: {'YES' if result.jaundice_result['needs_phototherapy'] else 'No'}",
+                "",
+            ])
+        # Cry analysis findings
+        if result.cry_result:
+            lines.extend([
+                "CRY ANALYSIS:",
+                f"  Status: {'ABNORMAL' if result.cry_result['is_abnormal'] else 'Normal'}",
+                f"  Asphyxia Risk: {result.cry_result['asphyxia_risk']:.1%}",
+                f"  Cry Type: {result.cry_result['cry_type']}",
+                f"  Risk Level: {result.cry_result['risk_level'].upper()}",
+                "",
+            ])
+        lines.extend(["-" * 60, "OVERALL ASSESSMENT", "-" * 60, ""])
+        lines.append(f"Overall Risk Level: {result.overall_risk.upper()}")
+        lines.append(f"Referral Needed: {'YES' if result.referral_needed else 'No'}")
+        if result.priority_actions:
+            lines.extend(["", "PRIORITY ACTIONS:"])
+            for i, action in enumerate(result.priority_actions, 1):
+                lines.append(f"  {i}. {action}")
+        lines.extend(["", "=" * 60])
+        return "\n".join(lines)
+    def to_json(self, result: AssessmentResult) -> str:
+        """Convert assessment result to JSON string."""
+        data = {
+            "patient": {
+                "patient_id": result.patient.patient_id,
+                "age_days": result.patient.age_days,
+                "gestational_age": result.patient.gestational_age,
+                "birth_weight": result.patient.birth_weight,
+                "gender": result.patient.gender,
+                "is_maternal": result.patient.is_maternal,
+            },
+            "timestamp": result.timestamp,
+            "anemia_result": result.anemia_result,
+            "jaundice_result": result.jaundice_result,
+            "cry_result": result.cry_result,
+            "overall_risk": result.overall_risk,
+            "priority_actions": result.priority_actions,
+            "referral_needed": result.referral_needed,
+        }
+        return json.dumps(data, indent=2)
+def demo():
+    """Demo the NEXUS pipeline."""
+    print("NEXUS Pipeline Demo")
+    print("=" * 60)
+    # Initialize pipeline
+    pipeline = NEXUSPipeline(lazy_load=True)
+    # Demo maternal assessment
+    print("\n--- Maternal Assessment Demo ---")
+    maternal_patient = PatientInfo(
+        patient_id="M001",
+        is_maternal=True,
+    )
+    data_dir = Path(__file__).parent.parent.parent / "data" / "raw"
+    anemia_images = list((data_dir / "eyes-defy-anemia").rglob("*.jpg"))[:1]
+    if anemia_images:
+        result = pipeline.assess_maternal(maternal_patient, anemia_images[0])
+        print(pipeline.generate_report(result))
+    # Demo neonatal assessment
+    print("\n--- Neonatal Assessment Demo ---")
+    neonatal_patient = PatientInfo(
+        patient_id="N001",
+        age_days=3,
+        gestational_age=38,
+        birth_weight=3200,
+        gender="M",
+        is_maternal=False,
+    )
+    jaundice_images = list((data_dir / "neojaundice" / "images").glob("*.jpg"))[:1]
+    cry_files = list((data_dir / "donate-a-cry").rglob("*.wav"))[:1]
+    skin_image = jaundice_images[0] if jaundice_images else None
+    cry_audio = cry_files[0] if cry_files else None
+    if skin_image or cry_audio:
+        result = pipeline.assess_neonate(neonatal_patient, skin_image, cry_audio)
+        print(pipeline.generate_report(result))
+if __name__ == "__main__":
+    demo()