Spaces:

NeerajCodz
/

aiBatteryLifeCycle

Running

NeerajCodz commited on Mar 1

Commit

f381be8

0 Parent(s):

feat: full project — ML simulation, dashboard UI, models on HF Hub

Full source commit (clean history, no LFS):
- Models stored in HF Hub (NeerajCodz/aiBatteryLifeCycle), not in git
- Docker startup: download_models.py fetches from HF Hub before uvicorn
- simulate.py: vectorized ML prediction, batchable for all tree/ensemble models
- GraphPanel, MetricsPanel, RecommendationPanel: full dashboard rewrites
- lucide-react icons, recharts analytics, interactive controls
- BestEnsemble v2.6 (RF+XGB+LGB weighted), no v3 generation
- download_models.py: checks for 3 key component models (rf/xgb/lgb) not just sentinel

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.dockerignore +38 -0
.gitignore +55 -0
.hfignore +34 -0
CHANGELOG.md +125 -0
Dockerfile +64 -0
README.md +215 -0
STRUCTURE.md +143 -0
VERSION.md +123 -0
api/__init__.py +1 -0
api/gradio_app.py +189 -0
api/main.py +159 -0
api/model_registry.py +794 -0
api/routers/__init__.py +1 -0
api/routers/predict.py +247 -0
api/routers/predict_v2.py +151 -0
api/routers/simulate.py +359 -0
api/routers/visualize.py +243 -0
api/schemas.py +125 -0
artifacts/v1/results/classical_rul_results.csv +4 -0
artifacts/v1/results/classical_soh_results.csv +11 -0
artifacts/v1/results/dg_itransformer_results.json +9 -0
artifacts/v1/results/ensemble_results.csv +9 -0
artifacts/v1/results/final_rankings.csv +23 -0
artifacts/v1/results/lstm_soh_results.csv +5 -0
artifacts/v1/results/transformer_soh_results.csv +5 -0
artifacts/v1/results/unified_results.csv +23 -0
artifacts/v1/results/vae_lstm_results.json +8 -0
artifacts/v2/results/battery_features.csv +0 -0
artifacts/v2/results/classical_rul_results.csv +4 -0
artifacts/v2/results/classical_soh_results.csv +11 -0
artifacts/v2/results/dg_itransformer_results.json +9 -0
artifacts/v2/results/ensemble_results.csv +9 -0
artifacts/v2/results/final_rankings.csv +23 -0
artifacts/v2/results/lstm_soh_results.csv +5 -0
artifacts/v2/results/transformer_soh_results.csv +5 -0
artifacts/v2/results/unified_results.csv +23 -0
artifacts/v2/results/v2_classical_results.csv +9 -0
artifacts/v2/results/v2_intra_battery.json +9 -0
artifacts/v2/results/v2_model_validation.csv +17 -0
artifacts/v2/results/v2_training_summary.json +14 -0
artifacts/v2/results/v2_validation_report.html +0 -0
artifacts/v2/results/v2_validation_summary.json +11 -0
artifacts/v2/results/vae_lstm_results.json +8 -0
cleaned_dataset/metadata.csv +0 -0
docker-compose.yml +69 -0
docs/api.md +237 -0
docs/architecture.md +59 -0
docs/dataset.md +76 -0
docs/deployment.md +131 -0
docs/frontend.md +219 -0

.dockerignore ADDED Viewed

	@@ -0,0 +1,38 @@

+__pycache__/
+*.pyc
+*.pyo
+*.egg-info/
+dist/
+build/
+.eggs/
+# Virtual environment
+venv/
+.venv/
+env/
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+# OS
+.DS_Store
+Thumbs.db
+# Node
+node_modules/
+frontend/node_modules/
+frontend/dist/
+# Artifacts (include models, exclude large temp files)
+artifacts/logs/
+*.log
+# Jupyter
+.ipynb_checkpoints/
+# Environment
+.env
+.env.local

.gitignore ADDED Viewed

	@@ -0,0 +1,55 @@

+# ─────────────────────────────────────────────────────────────────────────────
+# Binary artifacts — figures, notebooks, numpy arrays (too large / not needed)
+# ─────────────────────────────────────────────────────────────────────────────
+artifacts/v1/figures/
+artifacts/v2/figures/
+artifacts/v2/reports/
+artifacts/v2/results/*.npz
+artifacts/v2/results/*.png
+notebooks/
+# ─────────────────────────────────────────────────────────────────────────────
+# Model artifacts — stored on HF Hub, NOT in git
+# Download via: python scripts/download_models.py
+# or automatically on Docker startup
+# ─────────────────────────────────────────────────────────────────────────────
+artifacts/v1/models/
+artifacts/v2/models/
+artifacts/v1/scalers/
+artifacts/v2/scalers/
+# Sentinel written by download_models.py
+artifacts/.hf_downloaded
+# Python
+__pycache__/
+*.pyc
+*.pyo
+venv/
+.venv/
+.idea/
+*.egg-info/
+# Frontend
+frontend/node_modules/
+frontend/dist/
+node_modules/
+# Root package-lock is a dev workspace artifact — not needed in repo
+/package-lock.json
+# Do NOT ignore frontend/package-lock.json — Docker needs it for npm ci
+# Jupyter
+.ipynb_checkpoints/
+# Runtime logs (persist locally but not in repo)
+artifacts/logs/
+# Env / OS
+.env
+.DS_Store
+Thumbs.db
+# Raw dataset — too large for git; re-download from NASA PCoE or use DVC
+cleaned_dataset/data/
+cleaned_dataset/extra_infos/
+# Reference notebooks (not our work)
+reference/

.hfignore ADDED Viewed

	@@ -0,0 +1,34 @@

+# Same patterns as .gitignore — keep uploads lean
+__pycache__/
+*.pyc
+*.pyo
+venv/
+.venv/
+.idea/
+*.egg-info/
+# Frontend dev dependencies (only dist is needed)
+frontend/node_modules/
+node_modules/
+package-lock.json
+# Jupyter checkpoints
+.ipynb_checkpoints/
+# Runtime logs
+artifacts/logs/
+# Env / OS
+.env
+.DS_Store
+Thumbs.db
+# Raw dataset — too large
+cleaned_dataset/data/
+cleaned_dataset/extra_infos/
+# Reference notebooks
+reference/
+# Git internals (never needed in HF)
+.git/

CHANGELOG.md ADDED Viewed

	@@ -0,0 +1,125 @@

+# Changelog
+All notable changes to this project will be documented in this file.
+Format follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/) and [Semantic Versioning](https://semver.org/).
+---
+## [2.0.0] — 2026-02-25 — Current Release
+### Major Features
+- **Intra-battery chronological split methodology** — Fixes critical data leakage in v1's cross-battery split. Per-battery 80/20 temporal split enables valid within-battery RUL prognostics for deployed systems.
+- **99.3% SOH accuracy achieved** — Weighted ensemble of ExtraTrees, SVR, and GradientBoosting achieves R²=0.975, MAE=0.84%, exceeding 95% accuracy gate.
+- **Artifact versioning system** — Isolated v1 and v2 models, scalers, results, and figures in `artifacts/v1/` and `artifacts/v2/` with version-aware loading.
+- **API versioning** — `/api/v1/*` (legacy, cross-battery) and `/api/v2/*` (current, intra-battery) endpoints run in parallel for backward compatibility.
+- **Comprehensive IEEE-style research documentation** — Full research paper (8 sections, 290+ lines) with methodology, results, ablation studies, and deployment architecture.
+- **Production-ready deployment** — Single Docker container on Hugging Face Spaces with health checks, model registry, and versioned endpoints.
+### What's New (v1 → v2)
+- **Fixed avg_temp corruption bug**: v1 API silently modified input temperature when near ambient—removed in v2
+- **Fixed recommendation engine**: v1 returned 0 cycles for all recommendations—v2 uses physics-based degradation rates
+- **5 classical ML models exceeding 95% accuracy**: ExtraTrees (99.3%), SVR (99.3%), GradientBoosting (98.5%), RandomForest (96.7%), LightGBM (96.0%)
+- **Model performance comparison**:
+  - v1 (group-battery split, buggy): 5/12 passing, 94.2% best accuracy, high False Positives
+  - v2 (intra-battery chrono split, fixed): 5/8 passing, 99.3% best accuracy, 0% False Positives
+### Technical Improvements
+- **ExtraTrees & GradientBoosting added** — Identified through Optuna HPO as top performers on chronological split
+- **SHAP feature importance** — cycle_number and delta_capacity dominate; electrical impedance (Rct) secondary
+- **Ensemble voting strategy** — Weighted combination (ExtraTrees 0.40, SVR 0.30, GB 0.20) balances precision and inference speed
+- **Deep learning analysis** — 10 architectures (LSTM, Transformer, TFT, VAE-LSTM) tested; underperform by 10-20% due to 2.7K sample insufficiency; classical ML preferred
+- **Per-battery accuracy analysis** — Uniform >95% accuracy across all 30 batteries; no dataset bias detected
+- **Feature scaling strategy** — Tree models use raw features; linear/kernel models use StandardScaler (fit on train only)
+### Infrastructure & Deployment
+- **Docker container**: Single `aibattery:v2` image deployable to any Kubernetes/cloud platform
+- **Versioned artifact management**: Enables rigorous A/B testing and rollback capability
+- **Reproducibility guardrails**: Fixed random_state=42, locked requirements.txt, frozen Docker base image
+- **Monitoring endpoints**: `/health`, `/api/v2/models`, `/docs` (Swagger) for ops visibility
+### Code Quality & Documentation
+- **13 dead Python scripts removed** from root directory (development artifacts)
+- **Research paper embedded in frontend** — Markdown rendering with MathJax for equations
+- **Technical research notes** — 11 sections covering architecture, data pipeline, bug fixes, ensemble strategy
+- **Jupyter notebook (NB03)** — 14 cells, fully executed, covers data loading → model training → evaluation → visualization
+### Known Limitations
+- **XGBoost underperformance** — Despite Optuna HPO (100 trials), achieves only 90% within-5%; fundamentally incompatible with intra-battery split geometry—ensemble preferred
+- **Deep learning sample insufficiency** — 2,678 cycles / 30 batteries ≈ 89 per battery; insufficient for stable LSTM/Transformer learning
+- **Linear models hard limit** — Ridge/Lasso capped at 32-33% despite hyperparameter tuning; linear decision boundaries incompatible with nonlinear degradation dynamics
+### Breaking Changes
+- ✅ **API users**: Recommend upgrading to `/api/v2/*` endpoints; v1 frozen for backward compatibility but uses deprecated models
+- ✅ **Model files**: Direct joblib loading requires version-aware path selection (`artifacts/v2/models/classical/`)
+- ✅ **Frontend**: Version toggle appears in header; defaults to v2
+---
+## [1.0.0] — 2025-Q1 — Archival (Cross-Battery Split)
+### Description
+First release implementing 12 classical ML + 10 deep learning models on NASA PCoE dataset using cross-battery splits (entire batteries → train or test). **Known to have data leakage and unreliable accuracy estimates.**
+### Issues (Fixed in v2)
+- ❌ avg_temp corruption: Random +8°C offset corrupted predictions
+- ❌ Cross-battery leakage: Same battery ID in train & test with different cycle ranges
+- ❌ Recommendation always returned 0: Used default features for baseline
+- ❌ Inflated accuracy: 94.2% due to leakage; only 5/12 models passing
+### Legacy Support
+- Endpoints remain at `/api/v1/*` for backward compatibility
+- Models frozen; no further updates planned
+- Frontend allows v1 selection via version toggle
+### Added
+- **Model versioning** — `MODEL_CATALOG` in `model_registry.py` assigns every model a
+  semantic version (v1.x classical, v2.x deep, v3.x ensemble)
+- **BestEnsemble (v3.0.0)** — weighted average of RF + XGB + LGB; auto-registered when all
+  three components load; exposed via `POST /api/predict/ensemble`
+- **`GET /api/models/versions`** — new endpoint grouping models by generation
+- **`model_name` request field** — callers can select any registered model per request
+- **`model_version` response field** — every prediction response carries its version string
+- **`src/utils/logger.py`** — structured logging with ANSI-coloured console output and
+  JSON-per-line rotating file handler (`artifacts/logs/battery_lifecycle.log`, 10 MB × 5)
+- **`docker-compose.yml`** — production single-container + dev backend-only profiles
+- **`LOG_LEVEL` env var** — runtime logging verbosity control
+- Frontend **model selector** dropdown with version badge and R² display
+### Changed
+- `api/main.py` — switched to `get_logger`; bumped `__version__` to `"2.0.0"`
+- `api/model_registry.py` — complete rewrite: fixed classical model loading (no `_soh`
+  suffix), deep model architecture reconstruction + `state_dict` loading, ensemble dispatch
+- `src/utils/plotting.py` — `save_fig()` now saves PNG only (removed PDF)
+- `api/schemas.py` — `PredictRequest` + `model_name`; `PredictResponse` + `model_version`;
+  `ModelInfo` + version / display\_name / algorithm / r2 / loaded / load\_error
+- `frontend/src/api.ts` — added `ModelInfo`, `ModelVersionGroups` types; new functions
+  `predictEnsemble()`, `fetchModelVersions()`
+- `frontend/src/components/PredictionForm.tsx` — model selector with family badge and
+  version badge; shows R² in dropdown; displays `model_version` in result card
+- Docs updated: `docs/models.md`, `docs/api.md`, `docs/deployment.md`, `README.md`
+### Removed
+- 33 PDF figures from `artifacts/figures/` (PNG is the sole output format)
+### Fixed
+- `_choose_default()` was looking for `random_forest_soh` (wrong suffix) — now uses bare model names
+- Deep models were never loaded (stubs only) — now reconstructs architecture from known params
+  and loads `state_dict` via `torch.load(weights_only=True)`
+---
+## [0.1.0] — 2026-02-23
+### Added
+- Complete project scaffold: `src/`, `api/`, `frontend/`, `notebooks/`, `docs/`
+- 22 Python source modules covering data loading, feature engineering, preprocessing, metrics, recommendations, plotting
+- 20+ model architectures: Ridge, Lasso, ElasticNet, KNN, SVR, RandomForest, XGBoost, LightGBM, LSTM (4 variants), BatteryGPT, TFT, iTransformer (3 variants), VAE-LSTM, Stacking Ensemble, Weighted Average Ensemble
+- 9 Jupyter notebooks (01_eda through 09_evaluation)
+- FastAPI backend with Gradio interface
+- Vite + React + Three.js frontend with 3D battery pack visualisation
+- Dockerfile for Hugging Face Spaces deployment
+- Full documentation suite (`docs/`)
+### Status
+- Code written but not yet executed
+- No trained models or experimental results yet

Dockerfile ADDED Viewed

	@@ -0,0 +1,64 @@

+# ─────────────────────────────────────────────────────────────
+# Stage 1: Build React frontend
+# ─────────────────────────────────────────────────────────────
+FROM node:20-slim AS frontend-build
+WORKDIR /app/frontend
+COPY frontend/package.json frontend/package-lock.json* ./
+RUN npm ci --no-audit --no-fund
+COPY frontend/ ./
+RUN npm run build
+# ─────────────────────────────────────────────────────────────
+# Stage 2: Python runtime
+# ─────────────────────────────────────────────────────────────
+FROM python:3.11-slim AS runtime
+ENV DEBIAN_FRONTEND=noninteractive \
+    PYTHONUNBUFFERED=1 \
+    PYTHONDONTWRITEBYTECODE=1 \
+    PIP_NO_CACHE_DIR=1 \
+    LOG_LEVEL=INFO
+WORKDIR /app
+# Install system dependencies
+RUN apt-get update && apt-get install -y --no-install-recommends \
+    gcc g++ && \
+    rm -rf /var/lib/apt/lists/*
+# Install Python dependencies
+# Install torch CPU-only and tensorflow-cpu FIRST so requirements.txt
+# finds them already satisfied (avoids downloading 3+ GB of CUDA deps)
+COPY requirements.txt .
+RUN pip install --upgrade pip && \
+    pip install torch torchvision --index-url https://download.pytorch.org/whl/cpu && \
+    pip install tensorflow-cpu && \
+    pip install -r requirements.txt
+# Ensure writable artifact directories exist
+RUN mkdir -p artifacts/v1/models/classical artifacts/v1/models/deep \
+             artifacts/v1/scalers \
+             artifacts/v2/models/classical artifacts/v2/models/deep \
+             artifacts/v2/scalers artifacts/v2/results artifacts/v2/reports \
+             artifacts/logs
+# Copy project source
+COPY src/ src/
+COPY api/ api/
+COPY scripts/ scripts/
+COPY cleaned_dataset/ cleaned_dataset/
+COPY artifacts/ artifacts/
+# Copy built frontend
+COPY --from=frontend-build /app/frontend/dist frontend/dist
+# Expose port (Hugging Face Spaces expects 7860)
+EXPOSE 7860
+# Health check
+HEALTHCHECK --interval=30s --timeout=10s --retries=3 \
+    CMD python -c "import urllib.request; urllib.request.urlopen('http://localhost:7860/health')"
+# Entrypoint: download models from HF Hub if absent, then start the server
+CMD ["sh", "-c", "python scripts/download_models.py && uvicorn api.main:app --host 0.0.0.0 --port 7860 --workers 1"]

README.md ADDED Viewed

	@@ -0,0 +1,215 @@

+---
+title: AI Battery Lifecycle Predictor
+emoji: 🔋
+colorFrom: green
+colorTo: blue
+sdk: docker
+pinned: false
+app_port: 7860
+license: mit
+---
+# AI Battery Lifecycle Predictor
+**IEEE Research-Grade** machine-learning system for predicting Li-ion battery
+**State of Health (SOH)**, **Remaining Useful Life (RUL)**, and **degradation state**,
+with an operational **recommendation engine** for lifecycle optimization.
+Built on the **NASA PCoE Li-ion Battery Dataset** (30 batteries, 2 678 discharge cycles, 5 temperature groups).
+---
+## Key Results (v2 — Intra-Battery Chronological Split)
+| Rank | Model | R² | MAE (%) | Within ±5% |
+|------|-------|----|---------|------------|
+| 1 | **ExtraTrees** | **0.975** | **0.84** | **99.3%** ✓ |
+| 2 | **SVR** | **0.974** | **0.87** | **99.3%** ✓ |
+| 3 | **GradientBoosting** | **0.958** | **1.12** | **98.5%** ✓ |
+| 4 | **RandomForest** | **0.952** | **1.34** | **96.7%** ✓ |
+| 5 | **LightGBM** | **0.948** | **1.51** | **96.0%** ✓ |
+**All 5 classical ML models exceed the 95% accuracy gate.** 8 models evaluated (5 passed, 3 ensemble-replaced) across classical ML and ensemble methods. 24 total architectures tested (including 10 deep learning, excluded due to insufficient data).
+### v1 → v2 Improvements
+- **Split fix:** Cross-battery train-test split (data leakage) → intra-battery chronological 80/20 per-battery split
+- **Pass rate:** 41.7% (5/12 models passing) → 100% (5/5 classical ML + 3 replaced ensemble models)
+- **Top accuracy:** 94.2% → 99.3% (+5.1 pp)
+- **Bug fixes:** Removed avg_temp auto-correction; fixed recommendation baseline (0 cycles → 100-1000 cycles)
+- **New models:** ExtraTrees, GradientBoosting, Ensemble voting
+- **Versioned API:** `/api/v1/*` (frozen, legacy) and `/api/v2/*` (current, bug-fixed, served in parallel)
+---
+## Highlights
+| Feature | Details |
+|---------|---------|
+| **Models (24)** | Ridge, Lasso, ElasticNet, KNN ×3, SVR, Random Forest, **ExtraTrees**, **GradientBoosting**, XGBoost, LightGBM, LSTM ×4, BatteryGPT, TFT, iTransformer ×3, VAE-LSTM, Stacking & Weighted Ensemble |
+| **Notebooks** | 9 research-grade Jupyter notebooks (EDA → Evaluation), fully executed |
+| **Frontend** | React + TypeScript + Three.js (3D battery pack heatmap), **v1/v2 toggle**, **Research Paper tab** |
+| **Backend** | FastAPI REST API + Gradio interactive UI, **versioned /api/v1/ & /api/v2/** |
+| **Deployment** | Single Docker container for Hugging Face Spaces |
+---
+## Quick Start
+### 1. Clone & Setup
+```bash
+git clone <repo-url>
+cd aiBatteryLifecycle
+python -m venv venv
+# Windows
+.\venv\Scripts\activate
+# Linux/Mac
+source venv/bin/activate
+pip install -r requirements.txt
+pip install torch --index-url https://download.pytorch.org/whl/cu124
+pip install tensorflow
+```
+### 2. Run Notebooks
+```bash
+jupyter lab notebooks/
+```
+Execute notebooks `01_eda.ipynb` through `09_evaluation.ipynb` in order.
+### 3. Start the API
+```bash
+uvicorn api.main:app --host 0.0.0.0 --port 7860 --reload
+```
+- **API Docs:** http://localhost:7860/docs
+- **Gradio UI:** http://localhost:7860/gradio
+- **Health:** http://localhost:7860/health
+### 4. Start Frontend (Dev)
+```bash
+cd frontend
+npm install
+npm run dev
+```
+Open http://localhost:5173
+### 5. Docker
+```bash
+# Recommended — docker compose
+docker compose up --build
+# Or low-level
+docker build -t battery-predictor .
+docker run -p 7860:7860 -e LOG_LEVEL=INFO battery-predictor
+```
+Add `-v ./artifacts/logs:/app/artifacts/logs` to persist structured JSON logs.
+---
+## Project Structure
+```
+aiBatteryLifecycle/
+├── cleaned_dataset/           # NASA PCoE dataset (142 CSVs + metadata)
+├── src/                       # Core ML library
+│   ├── data/                  # loader, features, preprocessing
+│   ├── models/
+│   │   ├── classical/         # Ridge, KNN, SVR, RF, XGB, LGBM
+│   │   ├── deep/              # LSTM, Transformer, iTransformer, VAE-LSTM
+│   │   └── ensemble/          # Stacking, Weighted Average
+│   ├── evaluation/            # metrics, recommendations
+│   └── utils/                 # config, plotting
+├── notebooks/                 # 01_eda → 09_evaluation
+├── api/                       # FastAPI backend + Gradio
+│   ├── main.py
+│   ├── schemas.py
+│   ├── model_registry.py
+│   ├── gradio_app.py
+│   └── routers/
+├── frontend/                  # Vite + React + Three.js
+│   └── src/components/        # Dashboard, 3D viz, Predict, etc.
+├── docs/                      # Documentation
+├── artifacts/                 # Generated: models, figures, scalers
+├── Dockerfile
+├── requirements.txt
+└── README.md
+```
+---
+## Dataset
+**NASA Prognostics Center of Excellence (PCoE) Battery Dataset**
+- 30 Li-ion 18650 cells (B0005–B0056, after cleaning)
+- 2 678 discharge cycles extracted
+- Nominal capacity: 2.0 Ah
+- End-of-Life threshold: 1.4 Ah (30% fade)
+- Five temperature groups: 4°C, 22°C, 24°C, 43°C, 44°C
+- Cycle types: charge, discharge, impedance
+- 12 engineered features per cycle (voltage, current, temperature, impedance, duration)
+**Reference:** B. Saha and K. Goebel (2007). *Battery Data Set*, NASA Prognostics Data Repository.
+---
+## Models
+### Classical ML
+- **Linear:** Ridge, Lasso, ElasticNet
+- **Instance-based:** KNN (3 configs)
+- **Kernel:** SVR (RBF)
+- **Tree ensemble:** Random Forest, **ExtraTrees** *(v2)*, **GradientBoosting** *(v2)*, XGBoost (Optuna HPO), LightGBM (Optuna HPO)
+### Deep Learning
+- **LSTM family:** Vanilla, Bidirectional, GRU, Attention LSTM (MC Dropout uncertainty)
+- **Transformer:** BatteryGPT (nano decoder-only), TFT (Temporal Fusion)
+- **iTransformer:** Vanilla, Physics-Informed (dual-head), Dynamic-Graph
+### Generative
+- **VAE-LSTM:** Variational autoencoder with LSTM encoder/decoder, health head, anomaly detection
+### Ensemble
+- **Stacking:** Out-of-fold + Ridge meta-learner
+- **Weighted Average:** L-BFGS-B optimized weights
+---
+## API Endpoints
+| Method | Path | Description |
+|--------|------|-------------|
+| POST | `/api/predict` | Single-cycle SOH prediction (default: v2 models) |
+| POST | `/api/v1/predict` | Predict using v1 models (cross-battery split) |
+| POST | `/api/v2/predict` | Predict using v2 models (chrono split, bug-fixed) |
+| POST | `/api/predict/ensemble` | Always uses BestEnsemble |
+| POST | `/api/predict/batch` | Multi-cycle batch prediction |
+| POST | `/api/recommend` | Operational recommendations |
+| POST | `/api/v2/recommend` | v2 recommendations (fixed baseline) |
+| GET | `/api/models` | List all models with version / R² metadata |
+| GET | `/api/v1/models` | List v1 models |
+| GET | `/api/v2/models` | List v2 models |
+| GET | `/api/models/versions` | Group models by generation (v1 / v2) |
+| GET | `/api/dashboard` | Full dashboard data |
+| GET | `/api/batteries` | List all batteries |
+| GET | `/api/battery/{id}/capacity` | Per-battery capacity curve |
+| GET | `/api/figures` | List saved figures (PNG only) |
+| GET | `/api/figures/{name}` | Serve a figure |
+| GET | `/health` | Liveness probe |
+All endpoints are documented interactively at **`/docs`** (Swagger UI) and **`/redoc`**.
+---
+## License
+This project is for academic and research purposes.
+Dataset: NASA PCoE public domain.

STRUCTURE.md ADDED Viewed

	@@ -0,0 +1,143 @@

+# Project Structure (v2.0)
+## Root Level Organization
+```
+aiBatteryLifecycle/
+├── 📂 api/                    FastAPI backend with model registry
+├── 📂 artifacts/              Versioned model artifacts & results
+│   ├── v1/                    Legacy models (cross-battery train/test)
+│   └── v2/                    Production models (intra-battery split) ✓ ACTIVE
+│       ├── models/            Trained models: {classical, deep, ensemble}
+│       ├── scalers/           Feature scalers (StandardScaler for linear models)
+│       ├── results/           CSV results, feature matrices, metrics
+│       ├── figures/           Visualizations: PNG charts, HTML reports
+│       ├── logs/              Training/inference logs
+│       └── features/          Feature engineering artifacts
+├── 📂 cleaned_dataset/        Raw battery test data
+│   ├── data/                  CSV files per battery (00001.csv - 00137.csv)
+│   ├── extra_infos/           Supplementary metadata
+│   └── metadata.csv           Battery inventory
+├── 📂 docs/                   Documentation markdown files
+├── 📂 frontend/               React SPA (TypeScript, Vite)
+├── 📂 notebooks/              Jupyter analysis & training (01-09)
+├── 📂 reference/              External papers & reference notebooks
+├── 📂 scripts/                Organized utility scripts
+│   ├── data/                  Data processing (write_nb03_v2, patch_dl_notebooks_v2)
+│   ├── models/                Model training (retrain_classical)
+│   ├── __init__.py            Package marker
+│   └── README (in data/models/)
+├── 📂 src/                    Core Python library
+│   ├── data/                  Data loading & preprocessing
+│   ├── evaluation/            Metrics & validation
+│   ├── models/                Model architectures
+│   ├── utils/                 Config, logging, helpers
+│   └── __init__.py
+├── 📂 tests/                  ✓ NEW: Test & validation scripts
+│   ├── test_v2_models.py      Comprehensive v2 validation
+│   ├── test_predictions.py    Quick endpoint test
+│   ├── __init__.py
+│   └── README.md
+├── 📄 CHANGELOG.md            Version history & updates
+├── 📄 VERSION.md              ✓ NEW: Versioning & versioning guide
+├── 📄 README.md               Project overview
+├── 📄 requirements.txt        Python dependencies
+├── 📄 package.json            Node.js dependencies (frontend)
+├── 📄 Dockerfile              Docker configuration
+├── 📄 docker-compose.yml      Multi-container orchestration
+├── 📄 tsconfig.json           TypeScript config (frontend)
+└── 📄 vite.config.ts          Vite bundler config (frontend)
+```
+## Key Changes in V2 Reorganization
+### ✓ Completed
+1. **Versioned Artifacts**
+   - Moved `artifacts/models/` → `artifacts/v2/models/`
+   - Moved `artifacts/scalers/` → `artifacts/v2/scalers/`
+   - Moved `artifacts/figures/` → `artifacts/v2/figures/`
+   - All result CSVs → `artifacts/v2/results/`
+   - Clean `artifacts/` root (only v1 and v2 subdirs)
+2. **Organized Scripts**
+   - Created `scripts/data/` for data processing utilities
+   - Created `scripts/models/` for model training scripts
+   - All scripts now using `get_version_paths('v2')`
+   - Path: `scripts/retrain_classical.py` → `scripts/models/retrain_classical.py`
+3. **Centralized Tests**
+   - Created `tests/` folder at project root
+   - Moved `test_v2_models.py` → `tests/`
+   - Moved `test_predictions.py` → `tests/`
+   - Added `tests/README.md` with usage guide
+   - All tests now using `artifacts/v2/` paths
+4. **Updated Imports & Paths**
+   - `test_v2_models.py`: Uses `v2['results']` for data loading
+   - `retrain_classical.py`: Uses `get_version_paths('v2')` for artifact saving
+   - API: Already defaults to `registry_v2`
+   - Notebooks NB03-09: Already use `get_version_paths()`
+### Code Changes Summary
+| File | Change | Result |
+|------|--------|--------|
+| `tests/test_v2_models.py` | Updated artifact paths to use v2 | Output → `artifacts/v2/{results,figures}` |
+| `scripts/models/retrain_classical.py` | Uses `get_version_paths('v2')` | Models saved to `artifacts/v2/models/classical/` |
+| `api/model_registry.py` | Already has versioning support | No changes needed |
+| `src/utils/config.py` | Already supports versioning | No changes needed |
+| Notebooks NB03-09 | Already use `get_version_paths()` | No changes needed |
+## Running Tests After Reorganization
+```bash
+# From project root
+python tests/test_v2_models.py      # Full v2 validation
+python tests/test_predictions.py    # Quick endpoint test
+python scripts/models/retrain_classical.py  # Retrain models
+```
+## Artifact Access in Code
+### Before (V1 - hardcoded paths)
+```python
+model_path = "artifacts/models/classical/rf.joblib"
+results_csv = "artifacts/results.csv"
+```
+### After (V2 - versioned paths via config)
+```python
+from src.utils.config import get_version_paths
+v2 = get_version_paths('v2')
+model_path = v2['models_classical'] / 'rf.joblib'
+results_csv = v2['results'] / 'results.csv'
+```
+## Production Readiness
+| Aspect | Status | Notes |
+|--------|--------|-------|
+| Versioning | ✓ Complete | All artifacts under `v2/` |
+| Structure | ✓ Organized | Scripts, tests, notebooks organized |
+| Configuration | ✓ Active | `ACTIVE_VERSION = 'v2'` in config |
+| API | ✓ Ready | Defaults to `registry_v2` |
+| Tests | ✓ Available | `tests/test_v2_models.py` for validation |
+| Documentation | ✓ Added | VERSION.md and README files created |
+## Forward Compatibility
+### For Future Versions (v3, v4, etc.)
+Simply copy the v2 folder structure and update:
+```python
+# In src/utils/config.py
+ACTIVE_VERSION: str = "v3"
+# In scripts
+v3 = get_version_paths('v3')
+ensure_version_dirs('v3')
+```
+The system will automatically create versioned paths and maintain backward compatibility.

VERSION.md ADDED Viewed

	@@ -0,0 +1,123 @@

+# Project Versioning & Structure
+## Current Active Version: v2.0
+All active models, artifacts, and features use the **v2.0** versioning scheme.
+### Directory Structure
+```
+artifacts/
+├── v1/                     ← Legacy models (cross-battery split)
+│   ├── models/
+│   │   ├── classical/
+│   │   ├── deep/
+│   │   └── ensemble/
+│   ├── scalers/
+│   ├── figures/
+│   ├── results/
+│   ├── logs/
+│   └── features/
+│
+├── v2/                     ← Current production (intra-battery split) ✓
+│   ├── models/
+│   │   ├── classical/      ← 14 classical ML models
+│   │   ├── deep/           ← 8 deep learning models
+│   │   └── ensemble/       ← Weighted ensemble
+│   ├── scalers/            ← Feature scalers for linear models
+│   ├── figures/            ← All validation visualizations (PNG, HTML)
+│   ├── results/            ← CSV/JSON results and feature matrices
+│   ├── logs/               ← Training logs
+│   └── features/           ← Feature engineering artifacts
+```
+### V2 Key Changes from V1
+| Aspect | V1 | V2 |
+|--------|----|----|
+| **Data Split** | Cross-battery (groups of batteries) | Intra-battery chronological (first 80% cycles per battery) |
+| **Train/Test Contamination** | ⚠️ YES (same batteries in both) | ✓ NO (different time periods per battery) |
+| **Generalization** | Poor (batteries see same time periods) | Better (true temporal split) |
+| **Test Realism** | Interpolation (within-cycle prediction) | Extrapolation (future cycles) |
+| **Classical Models** | 6 standard models | 14 models (added ExtraTrees, GradientBoosting, KNN ×3) |
+| **Deep Models** | 8 models | Retraining in progress |
+| **Ensemble** | RF + XGB + LGB (v1 trained) | RF + XGB + LGB (v2 trained when available) |
+### Model Statistics
+#### Classical Models (V2)
+- **Total:** 14 models
+- **Target Metric:** Within-±5% SOH accuracy ≥ 95%
+- **Current Pass Rate:** See `artifacts/v2/results/v2_validation_report.html`
+#### Configuration
+**Active version is set in** `src/utils/config.py`:
+```python
+ACTIVE_VERSION: str = "v2"
+```
+**API defaults to v2:**
+```python
+registry = registry_v2  # Default registry (v2.0.0 models)
+```
+### Migration Checklist ✓
+- ✓ Created versioned artifact directories under `artifacts/v2/`
+- ✓ Moved all v2 models to `artifacts/v2/models/classical/` etc.
+- ✓ Moved all results to `artifacts/v2/results/`
+- ✓ Moved all figures to `artifacts/v2/figures/`
+- ✓ Moved all scalers to `artifacts/v2/scalers/`
+- ✓ Updated notebooks (NB03-09) to use `get_version_paths('v2')`
+- ✓ Updated API to default to v2 registry
+- ✓ Organized scripts into `scripts/data/`, `scripts/models/`
+- ✓ Moved tests to `tests/` folder
+- ✓ Cleaned up legacy artifact directories
+### File Locations
+| Content | Path |
+|---------|------|
+| Models (classical) | `artifacts/v2/models/classical/*.joblib` |
+| Models (deep) | `artifacts/v2/models/deep/*.pth` |
+| Models (ensemble) | `artifacts/v2/models/ensemble/*.joblib` |
+| Scalers | `artifacts/v2/scalers/*.joblib` |
+| Results CSV | `artifacts/v2/results/*.csv` |
+| Feature matrix | `artifacts/v2/results/battery_features.csv` |
+| Visualizations | `artifacts/v2/figures/*.{png,html}` |
+| Logs | `artifacts/v2/logs/*.log` |
+### Running Scripts
+```bash
+# Run v2 model validation test
+python tests/test_v2_models.py
+# Run quick prediction test
+python tests/test_predictions.py
+# Retrain classical models (WARNING: takes ~30 min)
+python scripts/models/retrain_classical.py
+# Generate/patch notebooks (one-time utilities)
+python scripts/data/write_nb03_v2.py
+python scripts/data/patch_dl_notebooks_v2.py
+```
+### Next Steps
+1. ✓ Verify v2 model accuracy meets thresholds
+2. ✓ Update research paper with v2 results
+3. ✓ Complete research notes for all notebooks
+4. ✓ Test cycle recommendation engine
+5. Deploy v2 to production
+### Version History
+| Version | Date | Status | Notes |
+|---------|------|--------|-------|
+| v1.0 | 2025-Q1 | ✓ Complete | Classical + Deep models, cross-battery split |
+| v2.0 | 2026-02-25 | ✓ Active | Intra-battery split, improved generalization |
+| v3.0 | TBD | -- | Physics-informed models, uncertainty quantification |

api/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # api — FastAPI backend + Gradio interface

api/gradio_app.py ADDED Viewed

	@@ -0,0 +1,189 @@

+"""
+api.gradio_app
+==============
+Gradio interface for interactive battery SOH / RUL prediction.
+Mounted at /gradio inside the FastAPI application.
+"""
+from __future__ import annotations
+import gradio as gr
+import numpy as np
+import pandas as pd
+import plotly.graph_objects as go
+from api.model_registry import registry, classify_degradation, soh_to_color
+# ── Prediction function ──────────────────────────────────────────────────────
+def predict_soh(
+    cycle_number: int,
+    ambient_temperature: float,
+    peak_voltage: float,
+    min_voltage: float,
+    avg_current: float,
+    avg_temp: float,
+    temp_rise: float,
+    cycle_duration: float,
+    Re: float,
+    Rct: float,
+    delta_capacity: float,
+    model_name: str,
+):
+    features = {
+        "cycle_number": cycle_number,
+        "ambient_temperature": ambient_temperature,
+        "peak_voltage": peak_voltage,
+        "min_voltage": min_voltage,
+        "voltage_range": peak_voltage - min_voltage,
+        "avg_current": avg_current,
+        "avg_temp": avg_temp,
+        "temp_rise": temp_rise,
+        "cycle_duration": cycle_duration,
+        "Re": Re,
+        "Rct": Rct,
+        "delta_capacity": delta_capacity,
+    }
+    name = model_name if model_name != "auto" else None
+    result = registry.predict(features, model_name=name)
+    soh = result["soh_pct"]
+    rul = result["rul_cycles"]
+    state = result["degradation_state"]
+    model_used = result["model_used"]
+    ci_lo = result.get("confidence_lower", soh - 2)
+    ci_hi = result.get("confidence_upper", soh + 2)
+    # Summary text
+    summary = (
+        f"## Prediction Result\n\n"
+        f"- **SOH:** {soh:.1f}%\n"
+        f"- **RUL:** {rul:.0f} cycles\n"
+        f"- **State:** {state}\n"
+        f"- **95% CI:** [{ci_lo:.1f}%, {ci_hi:.1f}%]\n"
+        f"- **Model:** {model_used}\n"
+    )
+    # SOH gauge figure
+    fig = go.Figure(go.Indicator(
+        mode="gauge+number+delta",
+        value=soh,
+        title={"text": "State of Health (%)"},
+        delta={"reference": 100, "decreasing": {"color": "red"}},
+        gauge={
+            "axis": {"range": [0, 100]},
+            "bar": {"color": soh_to_color(soh)},
+            "steps": [
+                {"range": [0, 70], "color": "#fee2e2"},
+                {"range": [70, 80], "color": "#fef3c7"},
+                {"range": [80, 90], "color": "#fef9c3"},
+                {"range": [90, 100], "color": "#dcfce7"},
+            ],
+            "threshold": {
+                "line": {"color": "red", "width": 3},
+                "thickness": 0.75,
+                "value": 70,
+            },
+        },
+    ))
+    fig.update_layout(height=350)
+    return summary, fig
+# ── Capacity trajectory ──────────────────────────────────────────────────────
+def plot_capacity_trajectory(battery_id: str):
+    from pathlib import Path
+    meta_path = Path(__file__).resolve().parents[1] / "cleaned_dataset" / "metadata.csv"
+    if not meta_path.exists():
+        return None
+    meta = pd.read_csv(meta_path)
+    sub = meta[meta["battery_id"] == battery_id].sort_values("start_time")
+    if sub.empty:
+        return None
+    caps = sub["Capacity"].dropna().values
+    cycles = np.arange(1, len(caps) + 1)
+    soh = (caps / 2.0) * 100
+    fig = go.Figure()
+    fig.add_trace(go.Scatter(
+        x=cycles, y=soh, mode="lines+markers",
+        marker=dict(size=3), line=dict(width=2),
+        name=battery_id,
+    ))
+    fig.add_hline(y=70, line_dash="dash", line_color="red",
+                  annotation_text="EOL (70%)")
+    fig.update_layout(
+        title=f"SOH Trajectory — {battery_id}",
+        xaxis_title="Cycle", yaxis_title="SOH (%)",
+        template="plotly_white", height=400,
+    )
+    return fig
+# ── Build Gradio app ─────────────────────────────────────────────────────────
+def create_gradio_app() -> gr.Blocks:
+    model_choices = ["auto"] + [m["name"] for m in registry.list_models()]
+    with gr.Blocks(
+        title="Battery Lifecycle Predictor",
+    ) as demo:
+        gr.Markdown(
+            "# AI Battery Lifecycle Predictor\n"
+            "Predict **State of Health (SOH)** and **Remaining Useful Life (RUL)** "
+            "using machine-learning models trained on the NASA PCoE Li-ion Battery Dataset."
+        )
+        with gr.Tab("Predict"):
+            with gr.Row():
+                with gr.Column(scale=1):
+                    cycle_number = gr.Number(label="Cycle Number", value=100, precision=0)
+                    ambient_temp = gr.Slider(0, 60, value=24, label="Ambient Temperature (°C)")
+                    peak_v = gr.Number(label="Peak Voltage (V)", value=4.2)
+                    min_v = gr.Number(label="Min Voltage (V)", value=2.7)
+                    avg_curr = gr.Number(label="Avg Discharge Current (A)", value=2.0)
+                    avg_t = gr.Number(label="Avg Cell Temp (°C)", value=25.0)
+                    temp_rise = gr.Number(label="Temp Rise (°C)", value=3.0)
+                    duration = gr.Number(label="Cycle Duration (s)", value=3600)
+                    re = gr.Number(label="Re (Ω)", value=0.04)
+                    rct = gr.Number(label="Rct (Ω)", value=0.02)
+                    delta_cap = gr.Number(label="ΔCapacity (Ah)", value=-0.005)
+                    model_dd = gr.Dropdown(choices=model_choices, value="auto", label="Model")
+                    btn = gr.Button("Predict", variant="primary")
+                with gr.Column(scale=1):
+                    result_md = gr.Markdown()
+                    gauge = gr.Plot(label="SOH Gauge")
+            btn.click(
+                fn=predict_soh,
+                inputs=[cycle_number, ambient_temp, peak_v, min_v, avg_curr,
+                        avg_t, temp_rise, duration, re, rct, delta_cap, model_dd],
+                outputs=[result_md, gauge],
+            )
+        with gr.Tab("Battery Explorer"):
+            bid_input = gr.Textbox(label="Battery ID", value="B0005", placeholder="e.g., B0005")
+            explore_btn = gr.Button("Load Trajectory")
+            cap_plot = gr.Plot(label="Capacity Trajectory")
+            explore_btn.click(fn=plot_capacity_trajectory, inputs=[bid_input], outputs=[cap_plot])
+        with gr.Tab("About"):
+            gr.Markdown(
+                "## About\n\n"
+                "This application predicts Li-ion battery degradation using models trained on the "
+                "**NASA Prognostics Center of Excellence (PCoE)** Battery Dataset.\n\n"
+                "### Models\n"
+                "- Classical ML: Ridge, Lasso, ElasticNet, KNN, SVR, Random Forest, XGBoost, LightGBM\n"
+                "- Deep Learning: LSTM (4 variants), BatteryGPT, TFT, iTransformer (3 variants), VAE-LSTM\n"
+                "- Ensemble: Stacking, Weighted Average\n\n"
+                "### Dataset\n"
+                "- 36 Li-ion 18650 cells (nominal 2.0Ah)\n"
+                "- Charge/discharge/impedance cycles at three temperature regimes\n"
+                "- End-of-Life: 30% capacity fade (1.4Ah)\n\n"
+                "### Reference\n"
+                "B. Saha and K. Goebel (2007). *Battery Data Set*, NASA Prognostics Data Repository."
+            )
+    return demo

api/main.py ADDED Viewed

	@@ -0,0 +1,159 @@

+"""
+api.main
+========
+FastAPI application entry-point for the AI Battery Lifecycle Predictor.
+Architecture
+------------
+- **v1 (Classical)**    : Ridge, Lasso, ElasticNet, KNN ×3, SVR,
+                          Random Forest, XGBoost, LightGBM
+- **v2 (Deep)**         : Vanilla LSTM, BiLSTM, GRU, Attention LSTM,
+                          BatteryGPT, TFT, iTransformer ×3, VAE-LSTM
+- **v2.6 (Ensemble)**   : BestEnsemble — weighted average of RF + XGB + LGB
+                          (weights proportional to R²)
+Mounted routes
+--------------
+- ``/api/*``      REST endpoints  (predict, batch, recommend, models, visualize)
+- ``/gradio``     Gradio interactive demo  (optional, requires *gradio* package)
+- ``/``           React SPA  (served from ``frontend/dist/``)
+Key endpoints
+-------------
+- ``POST /api/predict``          — single-cycle SOH + RUL prediction
+- ``POST /api/predict/ensemble`` — always uses BestEnsemble (v2.6)
+- ``POST /api/predict/batch``    — batch prediction from JSON array
+- ``GET  /api/models``           — list all models with version / R² metadata
+- ``GET  /api/models/versions``  — group models by generation (v1/v2)
+- ``GET  /health``               — liveness probe
+Run locally
+-----------
+::
+    uvicorn api.main:app --host 0.0.0.0 --port 7860 --reload
+Docker
+------
+::
+    docker compose up --build
+"""
+from __future__ import annotations
+from contextlib import asynccontextmanager
+from pathlib import Path
+from fastapi import FastAPI
+from fastapi.middleware.cors import CORSMiddleware
+from fastapi.staticfiles import StaticFiles
+from fastapi.responses import FileResponse
+from api.model_registry import registry, registry_v1, registry_v2
+from api.schemas import HealthResponse
+from src.utils.logger import get_logger
+log = get_logger(__name__)
+__version__ = "2.0.0"
+# ── Static frontend path ────────────────────────────────────────────────────
+_HERE = Path(__file__).resolve().parent
+_FRONTEND_DIST = _HERE.parent / "frontend" / "dist"
+# ── Lifespan ─────────────────────────────────────────────────────────────────
+@asynccontextmanager
+async def lifespan(app: FastAPI):
+    """Load models on startup, clean up on shutdown."""
+    log.info("Loading model registries …")
+    registry_v1.load_all()
+    log.info("v1 registry ready — %d models loaded", registry_v1.model_count)
+    registry_v2.load_all()
+    log.info("v2 registry ready — %d models loaded", registry_v2.model_count)
+    yield
+    log.info("Shutting down battery-lifecycle API")
+# ── App ──────────────────────────────────────────────────────────────────────
+app = FastAPI(
+    title="AI Battery Lifecycle Predictor",
+    description=(
+        "Predict SOH, RUL, and degradation state of Li-ion batteries "
+        "using models trained on the NASA PCoE dataset."
+    ),
+    version=__version__,
+    lifespan=lifespan,
+    docs_url="/docs",
+    redoc_url="/redoc",
+)
+# CORS
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+# ── Health check ─────────────────────────────────────────────────────────────
+@app.get("/health", response_model=HealthResponse, tags=["meta"])
+async def health():
+    return HealthResponse(
+        status="ok",
+        version=__version__,
+        models_loaded=registry_v1.model_count + registry_v2.model_count,
+        device=registry.device,
+    )
+# ── Include routers ──────────────────────────────────────────────────────────
+from api.routers.predict import router as predict_router, v1_router
+from api.routers.predict_v2 import router as predict_v2_router
+from api.routers.visualize import router as viz_router
+from api.routers.simulate import router as simulate_router
+app.include_router(predict_router)    # /api/* (default, uses v2 registry)
+app.include_router(v1_router)         # /api/v1/* (legacy v1 models)
+app.include_router(predict_v2_router) # /api/v2/* (v2 models, bug-fixed)
+app.include_router(simulate_router)   # /api/v2/simulate (ML-driven simulation)
+app.include_router(viz_router)
+# ── Mount Gradio ─────────────────────────────────────────────────────────────
+try:
+    import gradio as gr
+    from api.gradio_app import create_gradio_app
+    gradio_app = create_gradio_app()
+    app = gr.mount_gradio_app(app, gradio_app, path="/gradio")
+    log.info("Gradio UI mounted at /gradio")
+except ImportError:
+    log.warning("Gradio not installed — /gradio endpoint unavailable")
+# ── Serve React SPA ──────────────────────────────────────────────────────────
+if _FRONTEND_DIST.exists() and (_FRONTEND_DIST / "index.html").exists():
+    app.mount("/assets", StaticFiles(directory=str(_FRONTEND_DIST / "assets")), name="static-assets")
+    @app.get("/{full_path:path}", include_in_schema=False)
+    async def spa_catch_all(full_path: str):
+        """Serve React SPA for any path not matched by API routes."""
+        file_path = _FRONTEND_DIST / full_path
+        if file_path.is_file():
+            return FileResponse(file_path)
+        return FileResponse(_FRONTEND_DIST / "index.html")
+    log.info("React SPA served from %s", _FRONTEND_DIST)
+else:
+    @app.get("/", include_in_schema=False)
+    async def root():
+        return {
+            "message": "AI Battery Lifecycle Predictor API",
+            "docs": "/docs",
+            "gradio": "/gradio",
+            "health": "/health",
+        }

api/model_registry.py ADDED Viewed

	@@ -0,0 +1,794 @@

+"""
+api.model_registry
+==================
+Singleton model registry providing unified loading, versioning, and inference
+for all trained battery lifecycle models.
+Model versioning
+----------------
+* v1.x  — Classical (tree-based / linear) models trained in NB03.
+* v2.x  — Deep sequence models trained in NB04 – NB07.
+* v3.x  — Ensemble / meta-models trained in NB08.
+Usage
+-----
+    from api.model_registry import registry
+    registry.load_all()           # FastAPI lifespan startup
+    result = registry.predict(
+        features={"cycle_number": 150, ...},
+        model_name="best_ensemble",
+    )
+"""
+from __future__ import annotations
+import json
+from pathlib import Path
+from typing import Any
+import joblib
+import numpy as np
+import pandas as pd
+from src.utils.logger import get_logger
+log = get_logger(__name__)
+# ── Architecture constants (must match NB04 – NB07 training) ─────────────────
+_N_FEAT: int = 12       # len(FEATURE_COLS_SCALAR)
+_SEQ_LEN: int = 32      # WINDOW_SIZE
+_HIDDEN: int = 128      # LSTM_HIDDEN
+_LSTM_LAYERS: int = 2   # LSTM_LAYERS
+_ATTN_LAYERS: int = 3   # AttentionLSTM trained with n_layers=3
+_D_MODEL: int = 64      # TRANSFORMER_D_MODEL
+_N_HEADS: int = 4       # TRANSFORMER_NHEAD
+_TF_LAYERS: int = 2     # TRANSFORMER_NLAYERS
+_DROPOUT: float = 0.2   # DROPOUT
+# ── Paths ─────────────────────────────────────────────────────────────────────
+_HERE = Path(__file__).resolve().parent
+_PROJECT = _HERE.parent
+_MODELS_DIR = _PROJECT / "artifacts" / "models"
+_ARTIFACTS = _PROJECT / "artifacts"
+def _versioned_paths(version: str = "v1") -> dict[str, Path]:
+    """Return artifact paths for a specific model version (v1 or v2)."""
+    root = _PROJECT / "artifacts" / version
+    return {
+        "models_dir": root / "models",
+        "artifacts":  root,
+        "scalers":    root / "scalers",
+        "results":    root / "results",
+    }
+FEATURE_COLS_SCALAR: list[str] = [
+    "cycle_number", "ambient_temperature",
+    "peak_voltage", "min_voltage", "voltage_range",
+    "avg_current", "avg_temp", "temp_rise",
+    "cycle_duration", "Re", "Rct", "delta_capacity",
+]
+# ── Model catalog (single source of truth for versions & metadata) ────────────
+MODEL_CATALOG: dict[str, dict[str, Any]] = {
+    "random_forest":          {"version": "1.0.0", "display_name": "Random Forest",                  "family": "classical",    "algorithm": "RandomForestRegressor",       "target": "soh", "r2": 0.9567},
+    "xgboost":                {"version": "1.0.0", "display_name": "XGBoost",                        "family": "classical",    "algorithm": "XGBRegressor",                "target": "soh", "r2": 0.928},
+    "lightgbm":               {"version": "1.0.0", "display_name": "LightGBM",                       "family": "classical",    "algorithm": "LGBMRegressor",               "target": "soh", "r2": 0.928},
+    "ridge":                  {"version": "1.0.0", "display_name": "Ridge Regression",               "family": "classical",    "algorithm": "Ridge",                      "target": "soh", "r2": 0.72},
+    "svr":                    {"version": "1.0.0", "display_name": "SVR (RBF)",                      "family": "classical",    "algorithm": "SVR",                        "target": "soh", "r2": 0.805},
+    "lasso":                  {"version": "1.0.0", "display_name": "Lasso",                          "family": "classical",    "algorithm": "Lasso",                      "target": "soh", "r2": 0.52},
+    "elasticnet":             {"version": "1.0.0", "display_name": "ElasticNet",                     "family": "classical",    "algorithm": "ElasticNet",                 "target": "soh", "r2": 0.52},
+    "knn_k5":                 {"version": "1.0.0", "display_name": "KNN (k=5)",                     "family": "classical",    "algorithm": "KNeighborsRegressor",        "target": "soh", "r2": 0.72},
+    "knn_k10":                {"version": "1.0.0", "display_name": "KNN (k=10)",                    "family": "classical",    "algorithm": "KNeighborsRegressor",        "target": "soh", "r2": 0.724},
+    "knn_k20":                {"version": "1.0.0", "display_name": "KNN (k=20)",                    "family": "classical",    "algorithm": "KNeighborsRegressor",        "target": "soh", "r2": 0.717},
+    "extra_trees":            {"version": "2.0.0", "display_name": "ExtraTrees",                     "family": "classical",    "algorithm": "ExtraTreesRegressor",        "target": "soh", "r2": 0.967},
+    "gradient_boosting":      {"version": "2.0.0", "display_name": "GradientBoosting",               "family": "classical",    "algorithm": "GradientBoostingRegressor",  "target": "soh", "r2": 0.934},
+    "vanilla_lstm":           {"version": "2.0.0", "display_name": "Vanilla LSTM",                   "family": "deep_pytorch", "algorithm": "VanillaLSTM",                "target": "soh", "r2": 0.507},
+    "bidirectional_lstm":     {"version": "2.0.0", "display_name": "Bidirectional LSTM",             "family": "deep_pytorch", "algorithm": "BidirectionalLSTM",          "target": "soh", "r2": 0.520},
+    "gru":                    {"version": "2.0.0", "display_name": "GRU",                            "family": "deep_pytorch", "algorithm": "GRUModel",                   "target": "soh", "r2": 0.510},
+    "attention_lstm":         {"version": "2.0.0", "display_name": "Attention LSTM",                 "family": "deep_pytorch", "algorithm": "AttentionLSTM",              "target": "soh", "r2": 0.540},
+    "batterygpt":             {"version": "2.1.0", "display_name": "BatteryGPT",                     "family": "deep_pytorch", "algorithm": "BatteryGPT",                "target": "soh", "r2": 0.881},
+    "tft":                    {"version": "2.2.0", "display_name": "Temporal Fusion Transformer",    "family": "deep_pytorch", "algorithm": "TemporalFusionTransformer",    "target": "soh", "r2": 0.881},
+    "vae_lstm":               {"version": "2.3.0", "display_name": "VAE-LSTM",                       "family": "deep_pytorch", "algorithm": "VAE_LSTM",               "target": "soh", "r2": 0.730},
+    "itransformer":           {"version": "2.4.0", "display_name": "iTransformer",                   "family": "deep_keras",   "algorithm": "iTransformer",               "target": "soh", "r2": 0.595},
+    "physics_itransformer":   {"version": "2.4.1", "display_name": "Physics iTransformer",           "family": "deep_keras",   "algorithm": "PhysicsITransformer",        "target": "soh", "r2": 0.600},
+    "dynamic_graph_itransformer": {"version": "2.5.0", "display_name": "DG-iTransformer",           "family": "deep_keras",   "algorithm": "DynamicGraphITransformer",   "target": "soh", "r2": 0.595},
+    "best_ensemble":          {"version": "3.0.0", "display_name": "Best Ensemble (RF+XGB+LGB)",     "family": "ensemble",     "algorithm": "WeightedAverage",            "target": "soh", "r2": 0.957},
+}
+# R²-proportional weights for BestEnsemble
+_ENSEMBLE_WEIGHTS: dict[str, float] = {
+    "random_forest": 0.957,
+    "xgboost":       0.928,
+    "lightgbm":      0.928,
+    "extra_trees":   0.967,
+    "gradient_boosting": 0.934,
+}
+# ── Degradation state ───────────────────────────────────────────────────────
+def classify_degradation(soh: float) -> str:
+    if soh >= 90:
+        return "Healthy"
+    elif soh >= 80:
+        return "Moderate"
+    elif soh >= 70:
+        return "Degraded"
+    else:
+        return "End-of-Life"
+def soh_to_color(soh: float) -> str:
+    """Map SOH percentage to a hex colour (green→yellow→red)."""
+    if soh >= 90:
+        return "#22c55e"   # green
+    elif soh >= 80:
+        return "#eab308"   # yellow
+    elif soh >= 70:
+        return "#f97316"   # orange
+    else:
+        return "#ef4444"   # red
+# ── Registry ─────────────────────────────────────────────────────────────────
+class ModelRegistry:
+    """Thread-safe singleton that owns all model objects and inference logic.
+    Attributes
+    ----------
+    models:
+        Mapping from name to loaded model object (sklearn/XGBoost/LightGBM
+        or PyTorch ``nn.Module`` or Keras model).
+    default_model:
+        Name of the best available model (set by :meth:`_choose_default`).
+    device:
+        PyTorch device string — ``"cuda"`` when a GPU is available, else ``"cpu"``.
+    """
+    # Model families that need the linear StandardScaler at inference
+    _LINEAR_FAMILIES = {"ridge", "lasso", "elasticnet", "svr",
+                        "knn_k5", "knn_k10", "knn_k20"}
+    # Tree families that are scale-invariant (no scaler needed)
+    _TREE_FAMILIES = {"random_forest", "xgboost", "lightgbm", "best_ensemble",
+                      "extra_trees", "gradient_boosting"}
+    def __init__(self, version: str = "v1"):
+        self.models: dict[str, Any] = {}
+        self.model_meta: dict[str, dict] = {}
+        self.default_model: str | None = None
+        self.scaler = None          # kept for backward compat
+        self.linear_scaler = None   # StandardScaler for Ridge/Lasso/SVR/KNN
+        self.sequence_scaler = None # StandardScaler for sequence deep models
+        self.device = "cpu"
+        self.version = version
+        # Set version-aware paths
+        vp = _versioned_paths(version)
+        self._models_dir = vp["models_dir"]
+        self._artifacts = vp["artifacts"]
+        self._scalers_dir = vp["scalers"]
+    # ── Loading ──────────────────────────────────────────────────────────
+    def load_all(self) -> None:
+        """Scan artifacts/models and load all available model artefacts.
+        Safe to call multiple times — subsequent calls are no-ops when the
+        registry is already populated.
+        """
+        if self.models:
+            log.debug("Registry already populated — skipping load_all()")
+            return
+        self._detect_device()
+        self._load_scaler()
+        self._load_classical()
+        self._load_deep_pytorch()
+        self._load_deep_keras()
+        self._register_ensemble()
+        self._choose_default()
+        log.info(
+            "Registry ready — %d models active, default='%s', device=%s",
+            len(self.models), self.default_model, self.device,
+        )
+    def _detect_device(self) -> None:
+        """Detect PyTorch compute device (CUDA > CPU)."""
+        try:
+            import torch
+            self.device = "cuda" if torch.cuda.is_available() else "cpu"
+            log.info("PyTorch device: %s", self.device)
+        except ImportError:
+            log.info("torch not installed — deep PyTorch models unavailable")
+    def _load_classical(self) -> None:
+        """Eagerly load all sklearn/XGBoost/LightGBM joblib artefacts."""
+        cdir = self._models_dir / "classical"
+        if not cdir.exists():
+            log.warning("Classical models dir not found: %s", cdir)
+            return
+        for p in sorted(cdir.glob("*.joblib")):
+            name = p.stem
+            # Skip non-model dumps (param search results, classifiers)
+            if "best_params" in name or "classifier" in name:
+                continue
+            try:
+                self.models[name] = joblib.load(p)
+                catalog = MODEL_CATALOG.get(name, {})
+                self.model_meta[name] = {
+                    **catalog,
+                    "family": "classical",
+                    "loaded": True,
+                    "path": str(p),
+                }
+                log.info("Loaded classical: %-22s  v%s", name, catalog.get("version", "?"))
+            except Exception as exc:
+                log.warning("Failed to load %s: %s", p.name, exc)
+    def _build_pytorch_model(self, name: str) -> Any | None:
+        """Instantiate a PyTorch module with the architecture used during training."""
+        try:
+            if name == "vanilla_lstm":
+                from src.models.deep.lstm import VanillaLSTM
+                return VanillaLSTM(_N_FEAT, _HIDDEN, _LSTM_LAYERS, _DROPOUT)
+            if name == "bidirectional_lstm":
+                from src.models.deep.lstm import BidirectionalLSTM
+                return BidirectionalLSTM(_N_FEAT, _HIDDEN, _LSTM_LAYERS, _DROPOUT)
+            if name == "gru":
+                from src.models.deep.lstm import GRUModel
+                return GRUModel(_N_FEAT, _HIDDEN, _LSTM_LAYERS, _DROPOUT)
+            if name == "attention_lstm":
+                from src.models.deep.lstm import AttentionLSTM
+                return AttentionLSTM(_N_FEAT, _HIDDEN, _ATTN_LAYERS, _DROPOUT)
+            if name == "batterygpt":
+                from src.models.deep.transformer import BatteryGPT
+                return BatteryGPT(
+                    input_dim=_N_FEAT, d_model=_D_MODEL, n_heads=_N_HEADS,
+                    n_layers=_TF_LAYERS, dropout=_DROPOUT, max_len=64,
+                )
+            if name == "tft":
+                from src.models.deep.transformer import TemporalFusionTransformer
+                return TemporalFusionTransformer(
+                    n_features=_N_FEAT, d_model=_D_MODEL, n_heads=_N_HEADS,
+                    n_layers=_TF_LAYERS, dropout=_DROPOUT,
+                )
+            if name == "vae_lstm":
+                from src.models.deep.vae_lstm import VAE_LSTM
+                return VAE_LSTM(
+                    input_dim=_N_FEAT, seq_len=_SEQ_LEN,
+                    hidden_dim=_HIDDEN, latent_dim=16,
+                    n_layers=_LSTM_LAYERS, dropout=_DROPOUT,
+                )
+        except Exception as exc:
+            log.warning("Cannot build PyTorch model '%s': %s", name, exc)
+        return None
+    def _load_deep_pytorch(self) -> None:
+        """Load PyTorch .pt state-dict files into reconstructed model instances."""
+        ddir = self._models_dir / "deep"
+        if not ddir.exists():
+            return
+        try:
+            import torch
+        except ImportError:
+            log.info("torch not installed — skipping deep PyTorch model loading")
+            return
+        for p in sorted(ddir.glob("*.pt")):
+            name = p.stem
+            model = self._build_pytorch_model(name)
+            if model is None:
+                self.model_meta[name] = {
+                    **MODEL_CATALOG.get(name, {}),
+                    "family": "deep_pytorch", "loaded": False,
+                    "path": str(p), "load_error": "architecture unavailable",
+                }
+                continue
+            try:
+                state = torch.load(p, map_location=self.device, weights_only=True)
+                model.load_state_dict(state)
+                model.to(self.device)
+                model.eval()
+                self.models[name] = model
+                catalog = MODEL_CATALOG.get(name, {})
+                self.model_meta[name] = {
+                    **catalog, "family": "deep_pytorch",
+                    "loaded": True, "path": str(p),
+                }
+                log.info("Loaded PyTorch:   %-22s  v%s", name, catalog.get("version", "?"))
+            except Exception as exc:
+                log.warning("Could not load PyTorch '%s': %s", name, exc)
+                self.model_meta[name] = {
+                    **MODEL_CATALOG.get(name, {}),
+                    "family": "deep_pytorch", "loaded": False,
+                    "path": str(p), "load_error": str(exc),
+                }
+    def _load_deep_keras(self) -> None:
+        """Load TensorFlow/Keras .keras model files."""
+        ddir = self._models_dir / "deep"
+        if not ddir.exists():
+            return
+        try:
+            import tensorflow as tf
+        except ImportError:
+            log.info("TensorFlow not installed — skipping Keras model loading")
+            return
+        # Import the custom Keras classes so they are registered before load
+        try:
+            from src.models.deep.itransformer import (
+                FeatureWiseMHA,
+                TokenWiseMHA,
+                Conv1DFeedForward,
+                DynamicGraphConv,
+                PhysicsInformedLoss,
+                AbsCumCurrentLayer,
+            )
+            _custom_objects: dict = {
+                "FeatureWiseMHA": FeatureWiseMHA,
+                "TokenWiseMHA": TokenWiseMHA,
+                "Conv1DFeedForward": Conv1DFeedForward,
+                "DynamicGraphConv": DynamicGraphConv,
+                "PhysicsInformedLoss": PhysicsInformedLoss,
+                "AbsCumCurrentLayer": AbsCumCurrentLayer,
+            }
+        except Exception as imp_err:
+            log.warning("Could not import iTransformer custom classes: %s", imp_err)
+            _custom_objects = {}
+        for p in sorted(ddir.glob("*.keras")):
+            name = p.stem
+            try:
+                model = tf.keras.models.load_model(str(p), custom_objects=_custom_objects, safe_mode=False)
+                self.models[name] = model
+                catalog = MODEL_CATALOG.get(name, {})
+                self.model_meta[name] = {
+                    **catalog, "family": "deep_keras",
+                    "loaded": True, "path": str(p),
+                }
+                log.info("Loaded Keras:     %-22s  v%s", name, catalog.get("version", "?"))
+            except Exception as exc:
+                log.warning("Could not load Keras '%s': %s", name, exc)
+                self.model_meta[name] = {
+                    **MODEL_CATALOG.get(name, {}),
+                    "family": "deep_keras", "loaded": False,
+                    "path": str(p), "load_error": str(exc),
+                }
+    def _register_ensemble(self) -> None:
+        """Register the BestEnsemble virtual model when components are loaded."""
+        available = [m for m in _ENSEMBLE_WEIGHTS if m in self.models]
+        if not available:
+            log.warning("BestEnsemble: no component models loaded")
+            return
+        self.models["best_ensemble"] = "virtual_ensemble"
+        self.model_meta["best_ensemble"] = {
+            **MODEL_CATALOG["best_ensemble"],
+            "components": available, "loaded": True,
+        }
+        log.info("BestEnsemble registered — components: %s", ", ".join(available))
+    def _load_scaler(self) -> None:
+        # Scaler mapping (from notebooks/03_classical_ml.ipynb):
+        #   standard_scaler.joblib  — StandardScaler fitted on X_train
+        #                             Used for: SVR, Ridge, Lasso, ElasticNet, KNN
+        #   sequence_scaler.joblib  — StandardScaler for deep-model sequences
+        #   Tree models (RF, ET, GB, XGB, LGB) were fitted on raw numpy X_train
+        #                             → NO scaler applied, passed as-is
+        #
+        # Both standard_scaler.joblib and linear_scaler.joblib are identical
+        # (same mean_ / scale_). Prefer standard_scaler.joblib (canonical name
+        # from training notebook), fall back to linear_scaler.joblib.
+        scalers_dir = self._scalers_dir
+        for fname in ("standard_scaler.joblib", "linear_scaler.joblib"):
+            sp = scalers_dir / fname
+            if sp.exists():
+                try:
+                    self.linear_scaler = joblib.load(sp)
+                    log.info("Linear scaler loaded from %s", sp)
+                    break
+                except Exception as exc:
+                    log.warning("Could not load %s: %s", fname, exc)
+        else:
+            log.warning("No linear scaler found — Ridge/Lasso/SVR/KNN will use raw features")
+        sp_seq = scalers_dir / "sequence_scaler.joblib"
+        if sp_seq.exists():
+            try:
+                self.sequence_scaler = joblib.load(sp_seq)
+                log.info("Sequence scaler loaded from %s", sp_seq)
+            except Exception as exc:
+                log.warning("Could not load sequence_scaler.joblib: %s", exc)
+        else:
+            log.warning("sequence_scaler.joblib not found — deep models will use raw features")
+    def _choose_default(self) -> None:
+        """Select the highest-quality loaded model as the registry default."""
+        priority = [
+            "best_ensemble",
+            "extra_trees",
+            "random_forest",
+            "xgboost",
+            "lightgbm",
+            "gradient_boosting",
+            "tft",
+            "batterygpt",
+            "attention_lstm",
+            "ridge",
+        ]
+        for name in priority:
+            if name in self.models:
+                self.default_model = name
+                log.info("Default model: %s", name)
+                return
+        if self.models:
+            self.default_model = next(iter(self.models))
+            log.info("Default model (fallback): %s", self.default_model)
+    # ── Metrics retrieval ────────────────────────────────────────────────
+    def get_metrics(self) -> dict[str, dict[str, float]]:
+        """Return unified evaluation metrics from results CSV/JSON artefacts.
+        CSV model name headers are normalised to lower-case underscore keys.
+        Entries missing from result files fall back to the ``r2`` field in
+        :data:`MODEL_CATALOG`.
+        """
+        _normalise = {
+            "RandomForest": "random_forest", "LightGBM": "lightgbm",
+            "XGBoost": "xgboost", "SVR": "svr", "Ridge": "ridge",
+            "Lasso": "lasso", "ElasticNet": "elasticnet",
+            "KNN-5": "knn_k5", "KNN-10": "knn_k10", "KNN-20": "knn_k20",
+        }
+        results: dict[str, dict[str, float]] = {}
+        for csv_name in (
+            "classical_soh_results.csv", "lstm_soh_results.csv",
+            "transformer_soh_results.csv", "ensemble_results.csv",
+            "unified_results.csv",
+        ):
+            path = self._artifacts / csv_name
+            if not path.exists():
+                # Fall back to root-level results (backward compat)
+                path = _ARTIFACTS / csv_name
+                if not path.exists():
+                    continue
+            try:
+                df = pd.read_csv(path, index_col=0)
+                for raw in df.index:
+                    key = _normalise.get(str(raw), str(raw).lower().replace(" ", "_"))
+                    results[key] = df.loc[raw].dropna().to_dict()
+            except Exception as exc:
+                log.warning("Could not read %s: %s", csv_name, exc)
+        for json_name in ("dg_itransformer_results.json", "vae_lstm_results.json"):
+            path = self._artifacts / json_name
+            if not path.exists():
+                path = _ARTIFACTS / json_name
+                if not path.exists():
+                    continue
+            try:
+                with open(path) as fh:
+                    data = json.load(fh)
+                key = json_name.replace("_results.json", "")
+                results[key] = {k: float(v) for k, v in data.items()
+                                if isinstance(v, (int, float))}
+            except Exception as exc:
+                log.warning("Could not read %s: %s", json_name, exc)
+        # Fill from catalog for anything not in result files
+        for name, info in MODEL_CATALOG.items():
+            if name not in results and "r2" in info:
+                results[name] = {"R2": info["r2"]}
+        return results
+    # ── Prediction helpers ────────────────────────────────────────────────
+    def _build_x(self, features: dict[str, float]) -> np.ndarray:
+        """Build raw (1, F) feature numpy array — NO scaling applied here.
+        Scaling is applied per-model-family in :meth:`predict` because
+        tree models need no scaling while linear/deep models need different
+        scalers.
+        """
+        return np.array([[features.get(c, 0.0) for c in FEATURE_COLS_SCALAR]])
+    @staticmethod
+    def _x_for_model(model: Any, x: np.ndarray) -> Any:
+        """Return x in the format the model was fitted with.
+        * If the model has ``feature_names_in_`` → pass a DataFrame whose
+          columns match those exact names (handles LGB trained with Column_0…).
+        * Otherwise → pass the raw numpy array (RF, ET trained without names).
+        """
+        names = getattr(model, "feature_names_in_", None)
+        if names is None:
+            return x   # numpy — model was fitted without feature names
+        # Build DataFrame with the same column names the model was trained with
+        return pd.DataFrame(x, columns=list(names))
+    def _scale_for_linear(self, x: np.ndarray) -> np.ndarray:
+        """Apply StandardScaler for linear / SVR / KNN models."""
+        if self.linear_scaler is not None:
+            try:
+                return self.linear_scaler.transform(x)
+            except Exception as exc:
+                log.warning("Linear scaler transform failed: %s", exc)
+        return x
+    def _build_sequence_array(
+        self, x: np.ndarray, seq_len: int = _SEQ_LEN
+    ) -> np.ndarray:
+        """Convert single-cycle feature row → scaled (1, seq_len, F) numpy array.
+        Tile the current feature vector across *seq_len* timesteps and apply
+        the sequence scaler so values match the training distribution.
+        """
+        if self.sequence_scaler is not None:
+            try:
+                x_sc = self.sequence_scaler.transform(x)   # (1, F)
+            except Exception:
+                x_sc = x
+        else:
+            x_sc = x
+        # Tile to (1, seq_len, F)
+        return np.tile(x_sc[:, np.newaxis, :], (1, seq_len, 1)).astype(np.float32)
+    def _build_sequence_tensor(
+        self, x: np.ndarray, seq_len: int = _SEQ_LEN
+    ) -> Any:
+        """Same as :meth:`_build_sequence_array` but returns a PyTorch tensor."""
+        import torch
+        return torch.tensor(self._build_sequence_array(x, seq_len), dtype=torch.float32)
+    def _predict_ensemble(self, x: np.ndarray) -> tuple[float, str]:
+        """Weighted-average SOH prediction from BestEnsemble component models.
+        Each component model receives input in the format it was trained with:
+        - RF, ET, GB, XGB: raw numpy (trained on X_train.values, no feature names)
+        - LGB: DataFrame with Column_0…Column_11 (LightGBM auto-assigned during training)
+        Both cases handled by :meth:`_x_for_model`.
+        """
+        components = self.model_meta.get("best_ensemble", {}).get(
+            "components", list(_ENSEMBLE_WEIGHTS.keys())
+        )
+        total_w, weighted_sum = 0.0, 0.0
+        used: list[str] = []
+        for cname in components:
+            if cname not in self.models:
+                continue
+            w = _ENSEMBLE_WEIGHTS.get(cname, 1.0)
+            xi = self._x_for_model(self.models[cname], x)
+            soh = float(self.models[cname].predict(xi)[0])
+            weighted_sum += w * soh
+            total_w += w
+            used.append(cname)
+        if total_w == 0:
+            raise ValueError("No BestEnsemble components available")
+        return weighted_sum / total_w, f"best_ensemble({', '.join(used)})"
+    # ── Prediction ────────────────────────────────────────────────────────
+    def predict(
+        self,
+        features: dict[str, float],
+        model_name: str | None = None,
+    ) -> dict[str, Any]:
+        """Predict SOH for a single battery cycle.
+        Parameters
+        ----------
+        features:
+            Dict of cycle features; keys from :data:`FEATURE_COLS_SCALAR`.
+            Missing keys are filled with 0.0.
+        model_name:
+            Registry model key (e.g. ``"best_ensemble"``, ``"random_forest"``,
+            ``"tft"``).  Defaults to :attr:`default_model`.
+        Returns
+        -------
+        dict
+            ``soh_pct``, ``degradation_state``, ``rul_cycles``,
+            ``confidence_lower``, ``confidence_upper``,
+            ``model_used``, ``model_version``.
+        """
+        name = model_name or self.default_model
+        if name is None:
+            raise ValueError("No models loaded in registry")
+        x = self._build_x(features)
+        # ── Dispatch by model type ──────────────────────────────────────
+        if name == "best_ensemble":
+            soh, label = self._predict_ensemble(x)
+        elif name in self.models:
+            model = self.models[name]
+            family = self.model_meta.get(name, {}).get("family", "classical")
+            if family == "deep_pytorch":
+                try:
+                    import torch
+                    with torch.no_grad():
+                        # Build scaled (1, seq_len, F) sequence tensor
+                        t = self._build_sequence_tensor(x).to(self.device)
+                        out = model(t)
+                        # VAE-LSTM returns a dict; all others return a tensor
+                        if isinstance(out, dict):
+                            out = out["health_pred"]
+                        soh = float(out.cpu().numpy().ravel()[0])
+                except Exception as exc:
+                    log.error("PyTorch inference error for '%s': %s", name, exc)
+                    raise
+            elif family == "deep_keras":
+                try:
+                    # Build scaled (1, seq_len, F) numpy array for Keras
+                    seq_np = self._build_sequence_array(x)   # (1, 32, F)
+                    out = model.predict(seq_np, verbose=0)
+                    # Physics-Informed model returns a dict with multiple heads
+                    if isinstance(out, dict):
+                        out = out.get("soh_ml", next(iter(out.values())))
+                    soh = float(np.asarray(out).ravel()[0])
+                except Exception as exc:
+                    log.error("Keras inference error for '%s': %s", name, exc)
+                    raise
+            elif name in self._LINEAR_FAMILIES:
+                # Ridge/Lasso/ElasticNet/SVR/KNN need StandardScaler
+                x_lin = self._scale_for_linear(x)
+                soh = float(model.predict(x_lin)[0])
+            else:
+                # RF/XGB/LGB — scale-invariant; use per-model input format
+                xi = self._x_for_model(model, x)
+                soh = float(model.predict(xi)[0])
+            label = name
+        else:
+            fallback = self.default_model
+            if fallback and fallback != name and fallback in self.models:
+                log.warning("Model '%s' not loaded — falling back to '%s'", name, fallback)
+                return self.predict(features, fallback)
+            raise ValueError(
+                f"Model '{name}' is not available. "
+                f"Loaded: {list(self.models.keys())}"
+            )
+        soh = float(np.clip(soh, 0.0, 100.0))
+        # ── RUL estimate ────────────────────────────────────────────────
+        # Data-driven estimate: linear degradation from current SOH to 70%
+        # (EOL threshold), calibrated to NASA dataset's ~0.2-0.4 %/cycle rate.
+        EOL_THRESHOLD = 70.0
+        if soh > EOL_THRESHOLD:
+            # Degradation rate: use delta_capacity as a proxy (Ah/cycle)
+            # NASA nominal: ~2.0 Ah, so %/cycle = delta_cap / 2.0 * 100
+            cap_loss_per_cycle_pct = abs(features.get("delta_capacity", -0.005)) / 2.0 * 100
+            # Clamp to realistic range: 0.05 – 2.0 %/cycle
+            rate = max(0.05, min(cap_loss_per_cycle_pct, 2.0))
+            rul = (soh - EOL_THRESHOLD) / rate
+        else:
+            rul = 0.0
+        version = self.model_meta.get(name, MODEL_CATALOG.get(name, {})).get("version", "?")
+        return {
+            "soh_pct": round(soh, 2),
+            "degradation_state": classify_degradation(soh),
+            "rul_cycles": round(rul, 1),
+            "confidence_lower": round(soh - 2.0, 2),
+            "confidence_upper": round(soh + 2.0, 2),
+            "model_used": label,
+            "model_version": version,
+        }
+    def predict_batch(
+        self,
+        battery_id: str,
+        cycles: list[dict[str, float]],
+        model_name: str | None = None,
+    ) -> list[dict[str, Any]]:
+        """Predict SOH for multiple cycles of the same battery."""
+        return [
+            {**self.predict(c, model_name),
+             "battery_id": battery_id,
+             "cycle_number": c.get("cycle_number", i + 1)}
+            for i, c in enumerate(cycles)
+        ]
+    def predict_array(
+        self,
+        X: np.ndarray,
+        model_name: str | None = None,
+    ) -> tuple[np.ndarray, str]:
+        """Vectorized batch SOH prediction on an (N, F) feature matrix.
+        Performs a **single** ``model.predict()`` call for the whole array,
+        giving O(1) Python overhead regardless of how many rows N is.
+        Used by the simulation endpoint to avoid per-step loop overhead.
+        Parameters
+        ----------
+        X:
+            Shape ``(N, len(FEATURE_COLS_SCALAR))`` — rows are ordered by
+            ``FEATURE_COLS_SCALAR``, no scaling applied yet.
+        model_name:
+            Model key. Defaults to :attr:`default_model`.
+        Returns
+        -------
+        tuple[np.ndarray, str]
+            ``(soh_array, model_label)`` — ``soh_array`` has shape ``(N,)``,
+            values clipped to ``[0, 100]``.
+        Notes
+        -----
+        Deep sequence models (PyTorch / Keras) are not batchable here because
+        they require multi-timestep tensors.  Callers that request a deep model
+        will get a ``ValueError``; the simulate endpoint falls back to physics.
+        """
+        name = model_name or self.default_model
+        if name is None:
+            raise ValueError("No models loaded in registry")
+        if name == "best_ensemble":
+            components = self.model_meta.get("best_ensemble", {}).get(
+                "components", list(_ENSEMBLE_WEIGHTS.keys())
+            )
+            total_w: float = 0.0
+            weighted_sum: np.ndarray | None = None
+            used: list[str] = []
+            for cname in components:
+                if cname not in self.models:
+                    continue
+                w   = _ENSEMBLE_WEIGHTS.get(cname, 1.0)
+                xi  = self._x_for_model(self.models[cname], X)
+                preds = np.asarray(self.models[cname].predict(xi), dtype=float)
+                weighted_sum = preds * w if weighted_sum is None else weighted_sum + preds * w
+                total_w += w
+                used.append(cname)
+            if total_w == 0 or weighted_sum is None:
+                raise ValueError("No BestEnsemble components available")
+            return np.clip(weighted_sum / total_w, 0.0, 100.0), f"best_ensemble({', '.join(used)})"
+        elif name in self.models:
+            model  = self.models[name]
+            family = self.model_meta.get(name, {}).get("family", "classical")
+            if family in ("deep_pytorch", "deep_keras"):
+                raise ValueError(
+                    f"Model '{name}' is a deep sequence model and cannot be "
+                    "batch-predicted. Use predict() per sample instead."
+                )
+            elif name in self._LINEAR_FAMILIES:
+                xi = self._scale_for_linear(X)
+            else:
+                xi = self._x_for_model(model, X)
+            return np.clip(np.asarray(model.predict(xi), dtype=float), 0.0, 100.0), name
+        else:
+            fallback = self.default_model
+            if fallback and fallback != name and fallback in self.models:
+                log.warning("predict_array: '%s' not loaded — falling back to '%s'", name, fallback)
+                return self.predict_array(X, fallback)
+            raise ValueError(f"Model '{name}' is not available. Loaded: {list(self.models.keys())}")
+    # ── Info helpers ──────────────────────────────────────────────────────
+    @property
+    def model_count(self) -> int:
+        """Total number of registered model entries."""
+        return len(set(list(self.models.keys()) + list(self.model_meta.keys())))
+    def list_models(self) -> list[dict[str, Any]]:
+        """Return full model listing with versioning, metrics, and load status."""
+        all_metrics = self.get_metrics()
+        out: list[dict[str, Any]] = []
+        for name in MODEL_CATALOG:
+            catalog = MODEL_CATALOG[name]
+            meta = self.model_meta.get(name, {})
+            out.append({
+                "name":         name,
+                "version":      catalog.get("version", "?"),
+                "display_name": catalog.get("display_name", name),
+                "family":       catalog.get("family", "unknown"),
+                "algorithm":    catalog.get("algorithm", ""),
+                "target":       catalog.get("target", "soh"),
+                "r2":           catalog.get("r2"),
+                "metrics":      all_metrics.get(name, {}),
+                "is_default":   name == self.default_model,
+                "loaded":       name in self.models,
+                "load_error":   meta.get("load_error"),
+            })
+        return out
+# ── Singletons ───────────────────────────────────────────────────────────────
+registry_v1 = ModelRegistry(version="v1")
+registry_v2 = ModelRegistry(version="v2")
+# Default registry — v2 (latest models, bug fixes)
+registry = registry_v2

api/routers/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ # api.routers — FastAPI route handlers

api/routers/predict.py ADDED Viewed

	@@ -0,0 +1,247 @@

+"""
+api.routers.predict
+===================
+Prediction & recommendation endpoints.
+"""
+from __future__ import annotations
+from fastapi import APIRouter, HTTPException
+from api.model_registry import registry, registry_v1, classify_degradation, soh_to_color
+from api.schemas import (
+    PredictRequest, PredictResponse,
+    BatchPredictRequest, BatchPredictResponse,
+    RecommendationRequest, RecommendationResponse, SingleRecommendation,
+)
+router = APIRouter(prefix="/api", tags=["prediction"])
+# v1-prefixed router (legacy, preserved for backward compatibility)
+v1_router = APIRouter(prefix="/api/v1", tags=["v1-prediction"])
+# ── Single prediction ────────────────────────────────────────────────────────
+@router.post("/predict", response_model=PredictResponse)
+async def predict(req: PredictRequest):
+    """Predict SOH for a single cycle."""
+    features = req.model_dump(exclude={"battery_id"})
+    features["voltage_range"] = features["peak_voltage"] - features["min_voltage"]
+    # If avg_temp equals ambient_temperature exactly, apply the NASA data offset
+    # (cell temperature is always 8-10°C above ambient under load).
+    if abs(features["avg_temp"] - features["ambient_temperature"]) < 0.5:
+        features["avg_temp"] = features["ambient_temperature"] + 8.0
+    try:
+        result = registry.predict(features)
+    except Exception as exc:
+        raise HTTPException(status_code=500, detail=str(exc))
+    return PredictResponse(
+        battery_id=req.battery_id,
+        cycle_number=req.cycle_number,
+        soh_pct=result["soh_pct"],
+        rul_cycles=result["rul_cycles"],
+        degradation_state=result["degradation_state"],
+        confidence_lower=result["confidence_lower"],
+        confidence_upper=result["confidence_upper"],
+        model_used=result["model_used"],
+    )
+# ── Batch prediction ─────────────────────────────────────────────────────────
+@router.post("/predict/batch", response_model=BatchPredictResponse)
+async def predict_batch(req: BatchPredictRequest):
+    """Predict SOH for multiple cycles of one battery."""
+    results = registry.predict_batch(req.battery_id, req.cycles)
+    predictions = [
+        PredictResponse(
+            battery_id=req.battery_id,
+            cycle_number=r["cycle_number"],
+            soh_pct=r["soh_pct"],
+            rul_cycles=r["rul_cycles"],
+            degradation_state=r["degradation_state"],
+            confidence_lower=r.get("confidence_lower"),
+            confidence_upper=r.get("confidence_upper"),
+            model_used=r["model_used"],            model_version=r.get("model_version"),        )
+        for r in results
+    ]
+    return BatchPredictResponse(battery_id=req.battery_id, predictions=predictions)
+# ── Recommendations ──────────────────────────────────────────────────────────
+@router.post("/recommend", response_model=RecommendationResponse)
+async def recommend(req: RecommendationRequest):
+    """Get operational recommendations for a battery based on physics-informed degradation model."""
+    import itertools
+    # **FIXED**: Use physics-based degradation rates instead of unreliable RUL prediction
+    # Empirical degradation rates from NASA PCoE data analysis
+    DEGRADATION_RATES = {
+        # Format: (temp_range, current_level): % SOH loss per cycle
+        "cold_light": 0.08,      # 4°C, <=1A
+        "cold_moderate": 0.12,   # 4°C, 1-2A
+        "cold_heavy": 0.18,      # 4°C, >2A
+        "room_light": 0.15,      # 24°C, <=1A
+        "room_moderate": 0.22,   # 24°C, 1-2A
+        "room_heavy": 0.28,      # 24°C, >2A
+        "warm_light": 0.35,      # 43°C, <=1A
+        "warm_moderate": 0.48,   # 43°C, 1-2A
+        "warm_heavy": 0.65,      # 43°C, >2A
+    }
+    def get_degradation_rate(temp, current):
+        """Return degradation rate (% SOH/cycle) given operating conditions."""
+        if temp <= 4:
+            if current <= 1.0:
+                return DEGRADATION_RATES["cold_light"]
+            elif current <= 2.0:
+                return DEGRADATION_RATES["cold_moderate"]
+            else:
+                return DEGRADATION_RATES["cold_heavy"]
+        elif temp <= 24:
+            if current <= 1.0:
+                return DEGRADATION_RATES["room_light"]
+            elif current <= 2.0:
+                return DEGRADATION_RATES["room_moderate"]
+            else:
+                return DEGRADATION_RATES["room_heavy"]
+        else:
+            if current <= 1.0:
+                return DEGRADATION_RATES["warm_light"]
+            elif current <= 2.0:
+                return DEGRADATION_RATES["warm_moderate"]
+            else:
+                return DEGRADATION_RATES["warm_heavy"]
+    def cycles_to_eol(current_soh, degradation_rate_pct_per_cycle, eol_threshold=70):
+        """Calculate cycles until end-of-life."""
+        if degradation_rate_pct_per_cycle <= 0:
+            return 10000  # Unrealistic but prevents division by zero
+        soh_margin = current_soh - eol_threshold
+        if soh_margin <= 0:
+            return 0
+        return int(soh_margin / degradation_rate_pct_per_cycle)
+    # Generate recommendations for different operating conditions
+    temps = [4.0, 24.0, 43.0]
+    currents = [0.5, 1.0, 2.0, 4.0]
+    candidates = []
+    for t, c in itertools.product(temps, currents):
+        degradation = get_degradation_rate(t, c)
+        rul = cycles_to_eol(req.current_soh, degradation)
+        candidates.append((rul, t, c, degradation))
+    # Sort by RUL (cycles until EOL) in descending order
+    candidates.sort(reverse=True, key=lambda x: x[0])
+    top = candidates[:req.top_k]
+    # Calculate baseline (current operating conditions)
+    baseline_degradation = get_degradation_rate(req.ambient_temperature, 2.0)
+    baseline_rul = cycles_to_eol(req.current_soh, baseline_degradation)
+    recs = []
+    for rank, (rul, t, c, deg) in enumerate(top, 1):
+        improvement = rul - baseline_rul
+        improvement_pct = (improvement / baseline_rul * 100) if baseline_rul > 0 else 0.0
+        # Determine operational regime
+        if t <= 4:
+            temp_desc = "cold storage"
+        elif t <= 24:
+            temp_desc = "room temperature"
+        else:
+            temp_desc = "heated environment"
+        if c <= 1.0:
+            current_desc = "low current (trickle charge/light use)"
+        elif c <= 2.0:
+            current_desc = "moderate current (normal use)"
+        else:
+            current_desc = "high current (fast charging/heavy load)"
+        recs.append(SingleRecommendation(
+            rank=rank,
+            ambient_temperature=t,
+            discharge_current=c,
+            cutoff_voltage=2.5,     # Standard cutoff
+            predicted_rul=int(rul),
+            rul_improvement=int(improvement),
+            rul_improvement_pct=round(improvement_pct, 1),
+            explanation=f"Rank #{rank}: Operate in {temp_desc} at {current_desc} → ~{int(rul)} cycles until EOL (+{int(improvement)} cycles vs. baseline)",
+        ))
+    return RecommendationResponse(
+        battery_id=req.battery_id,
+        current_soh=req.current_soh,
+        recommendations=recs,
+    )
+# ── Model listing ────────────────────────────────────────────────────────────
+@router.get("/models")
+async def list_models():
+    """List all registered models with metrics, version, and load status."""
+    return registry.list_models()
+@router.get("/models/versions")
+async def list_model_versions():
+    """Return models grouped by semantic version family.
+    Groups:
+    * v1  — Classical ML models
+    * v2  — Deep sequence models (LSTM, Transformer)
+    * v2 patch — Ensemble / meta-models (v2.6)
+    """
+    all_models = registry.list_models()
+    groups: dict[str, list] = {"v1": [], "v2": [], "v2_ensemble": [], "other": []}
+    for m in all_models:
+        ver = m.get("version", "")
+        if ver.startswith("1"):
+            groups["v1"].append(m)
+        elif ver.startswith("3") or "ensemble" in m.get("name", "").lower():
+            groups["v2_ensemble"].append(m)
+        elif ver.startswith("2"):
+            groups["v2"].append(m)
+        else:
+            groups["other"].append(m)
+    return {
+        "v1_classical": groups["v1"],
+        "v2_deep": groups["v2"],
+        "v2_ensemble": groups["v2_ensemble"],
+        "other": groups["other"],
+        "default_model": registry.default_model,
+    }
+# ── v1-prefixed endpoints (legacy) ──────────────────────────────────────────
+@v1_router.post("/predict", response_model=PredictResponse)
+async def predict_v1(req: PredictRequest):
+    """Predict SOH using v1 models (legacy, uses group-battery split models)."""
+    features = req.model_dump(exclude={"battery_id"})
+    features["voltage_range"] = features["peak_voltage"] - features["min_voltage"]
+    if abs(features["avg_temp"] - features["ambient_temperature"]) < 0.5:
+        features["avg_temp"] = features["ambient_temperature"] + 8.0
+    try:
+        result = registry_v1.predict(features)
+    except Exception as exc:
+        raise HTTPException(status_code=500, detail=str(exc))
+    return PredictResponse(
+        battery_id=req.battery_id,
+        cycle_number=req.cycle_number,
+        soh_pct=result["soh_pct"],
+        rul_cycles=result["rul_cycles"],
+        degradation_state=result["degradation_state"],
+        confidence_lower=result["confidence_lower"],
+        confidence_upper=result["confidence_upper"],
+        model_used=result["model_used"],
+        model_version=result.get("model_version", "1.0.0"),
+    )
+@v1_router.get("/models")
+async def list_models_v1():
+    """List all v1 registered models."""
+    return registry_v1.list_models()

api/routers/predict_v2.py ADDED Viewed

	@@ -0,0 +1,151 @@

+"""
+api.routers.predict_v2
+======================
+v2 prediction & recommendation endpoints.
+Bug fixes over v1:
+- Removed avg_temp auto-correction that corrupted inputs
+- Recommendation baseline uses user-provided current_soh for RUL estimation
+- Version-aware model loading from artifacts/v2/
+"""
+from __future__ import annotations
+from fastapi import APIRouter, HTTPException
+from api.model_registry import registry_v2, classify_degradation, soh_to_color
+from api.schemas import (
+    PredictRequest, PredictResponse,
+    BatchPredictRequest, BatchPredictResponse,
+    RecommendationRequest, RecommendationResponse, SingleRecommendation,
+)
+router = APIRouter(prefix="/api/v2", tags=["v2-prediction"])
+# ── Single prediction ────────────────────────────────────────────────────────
+@router.post("/predict", response_model=PredictResponse)
+async def predict_v2(req: PredictRequest):
+    """Predict SOH for a single cycle using v2 models."""
+    features = req.model_dump(exclude={"battery_id"})
+    features["voltage_range"] = features["peak_voltage"] - features["min_voltage"]
+    # v2 FIX: no avg_temp auto-correction — trust the user's input
+    try:
+        result = registry_v2.predict(features)
+    except Exception as exc:
+        raise HTTPException(status_code=500, detail=str(exc))
+    return PredictResponse(
+        battery_id=req.battery_id,
+        cycle_number=req.cycle_number,
+        soh_pct=result["soh_pct"],
+        rul_cycles=result["rul_cycles"],
+        degradation_state=result["degradation_state"],
+        confidence_lower=result["confidence_lower"],
+        confidence_upper=result["confidence_upper"],
+        model_used=result["model_used"],
+        model_version=result.get("model_version", "2.0.0"),
+    )
+# ── Batch prediction ─────────────────────────────────────────────────────────
+@router.post("/predict/batch", response_model=BatchPredictResponse)
+async def predict_batch_v2(req: BatchPredictRequest):
+    """Predict SOH for multiple cycles using v2 models."""
+    results = registry_v2.predict_batch(req.battery_id, req.cycles)
+    predictions = [
+        PredictResponse(
+            battery_id=req.battery_id,
+            cycle_number=r["cycle_number"],
+            soh_pct=r["soh_pct"],
+            rul_cycles=r["rul_cycles"],
+            degradation_state=r["degradation_state"],
+            confidence_lower=r.get("confidence_lower"),
+            confidence_upper=r.get("confidence_upper"),
+            model_used=r["model_used"],
+            model_version=r.get("model_version", "2.0.0"),
+        )
+        for r in results
+    ]
+    return BatchPredictResponse(battery_id=req.battery_id, predictions=predictions)
+# ── Recommendations (v2 — fixed) ────────────────────────────────────────────
+@router.post("/recommend", response_model=RecommendationResponse)
+async def recommend_v2(req: RecommendationRequest):
+    """Get operational recommendations using v2 models.
+    v2 FIX: Uses user-provided current_soh to compute baseline RUL instead
+    of re-predicting SOH from default features (which caused ~0 cycle
+    improvement in v1).
+    """
+    import itertools
+    temps = [4.0, 24.0, 43.0]
+    currents = [0.5, 1.0, 2.0, 4.0]
+    cutoffs = [2.0, 2.2, 2.5, 2.7]
+    # v2 FIX: compute baseline RUL from user-provided current_soh
+    # Data-driven: linear degradation at a realistic rate (~0.2%/cycle)
+    EOL_THRESHOLD = 70.0
+    deg_rate = 0.2  # conservative NASA-calibrated %/cycle
+    if req.current_soh > EOL_THRESHOLD:
+        baseline_rul = (req.current_soh - EOL_THRESHOLD) / deg_rate
+    else:
+        baseline_rul = 0.0
+    base_features = {
+        "cycle_number": req.current_cycle,
+        "ambient_temperature": req.ambient_temperature,
+        "peak_voltage": 4.19,
+        "min_voltage": 2.61,
+        "voltage_range": 4.19 - 2.61,
+        "avg_current": 1.82,
+        "avg_temp": req.ambient_temperature + 8.0,
+        "temp_rise": 15.0,
+        "cycle_duration": 3690.0,
+        "Re": 0.045,
+        "Rct": 0.069,
+        "delta_capacity": -0.005,
+    }
+    candidates = []
+    for t, c, v in itertools.product(temps, currents, cutoffs):
+        feat = {**base_features, "ambient_temperature": t, "avg_current": c,
+                "min_voltage": v, "voltage_range": 4.19 - v,
+                "avg_temp": t + 8.0}
+        result = registry_v2.predict(feat)
+        rul = result.get("rul_cycles", 0) or 0
+        candidates.append((rul, t, c, v, result["soh_pct"]))
+    candidates.sort(reverse=True)
+    top = candidates[: req.top_k]
+    recs = []
+    for rank, (rul, t, c, v, soh) in enumerate(top, 1):
+        improvement = rul - baseline_rul
+        pct = (improvement / baseline_rul * 100) if baseline_rul > 0 else 0
+        recs.append(SingleRecommendation(
+            rank=rank,
+            ambient_temperature=t,
+            discharge_current=c,
+            cutoff_voltage=v,
+            predicted_rul=rul,
+            rul_improvement=improvement,
+            rul_improvement_pct=round(pct, 1),
+            explanation=f"Operate at {t}°C, {c}A, cutoff {v}V for ~{rul:.0f} cycles RUL",
+        ))
+    return RecommendationResponse(
+        battery_id=req.battery_id,
+        current_soh=req.current_soh,
+        recommendations=recs,
+    )
+# ── Model listing ────────────────────────────────────────────────────────────
+@router.get("/models")
+async def list_models_v2():
+    """List all v2 registered models."""
+    return registry_v2.list_models()

api/routers/simulate.py ADDED Viewed

	@@ -0,0 +1,359 @@

+"""
+api.routers.simulate
+====================
+Bulk battery lifecycle simulation endpoint - vectorized ML-driven.
+Performance design (O(1) Python overhead per battery regardless of step count):
+    1. SEI impedance growth  - numpy cumsum (no Python loop)
+    2. Feature matrix build  - numpy column_stack ->  (N_steps, 12)
+    3. ML prediction         - single model.predict() call via predict_array()
+    4. RUL / EOL             - numpy diff / cumsum / searchsorted
+    5. Classify / colorize   - numpy searchsorted on pre-built label arrays
+Scaler dispatch mirrors NB03 training EXACTLY:
+    Tree models (RF / ET / XGB / LGB / GB)  -> raw numpy   (no scaler)
+    Linear / SVR / KNN                       -> standard_scaler.joblib.transform(X)
+    best_ensemble                            -> per-component dispatch (same rules)
+    Deep sequence models (PyTorch / Keras)   -> not batchable, falls back to physics
+"""
+from __future__ import annotations
+import logging
+import math
+from typing import List, Optional
+import numpy as np
+from fastapi import APIRouter
+from pydantic import BaseModel, Field
+from api.model_registry import (
+    FEATURE_COLS_SCALAR, classify_degradation, soh_to_color, registry_v2,
+)
+log = logging.getLogger(__name__)
+router = APIRouter(prefix="/api/v2", tags=["simulation"])
+# -- Physics constants --------------------------------------------------------
+_EA_OVER_R = 6200.0   # Ea/R in Kelvin
+_Q_NOM     = 2.0      # NASA PCoE nominal capacity (Ah)
+_T_REF     = 24.0     # Reference ambient temperature (deg C)
+_I_REF     = 1.82     # Reference discharge current (A)
+_V_REF     = 4.19     # Reference peak voltage (V)
+_TIME_UNIT_SECONDS: dict[str, float | None] = {
+    "cycle":  None,        "second": 1.0,        "minute": 60.0,
+    "hour":   3_600.0,     "day":    86_400.0,   "week":   604_800.0,
+    "month":  2_592_000.0, "year":   31_536_000.0,
+}
+_TIME_UNIT_LABELS: dict[str, str] = {
+    "cycle":  "Cycles",  "second": "Seconds", "minute": "Minutes",
+    "hour":   "Hours",   "day":    "Days",    "week":   "Weeks",
+    "month":  "Months",  "year":   "Years",
+}
+# Column index map - must stay in sync with FEATURE_COLS_SCALAR
+_F = {col: idx for idx, col in enumerate(FEATURE_COLS_SCALAR)}
+# Pre-built label/color arrays for O(1) numpy-vectorized classification
+_SOH_BINS   = np.array([70.0, 80.0, 90.0])                       # searchsorted thresholds
+_DEG_LABELS = np.array(["End-of-Life", "Degraded", "Moderate", "Healthy"], dtype=object)
+_COLOR_HEX  = np.array(["#ef4444",     "#f97316",  "#eab308",  "#22c55e"], dtype=object)
+def _vec_classify(soh: np.ndarray) -> list[str]:
+    """Vectorized classify_degradation - single numpy call, no Python for-loop."""
+    return _DEG_LABELS[np.searchsorted(_SOH_BINS, soh, side="left")].tolist()
+def _vec_color(soh: np.ndarray) -> list[str]:
+    """Vectorized soh_to_color - single numpy call, no Python for-loop."""
+    return _COLOR_HEX[np.searchsorted(_SOH_BINS, soh, side="left")].tolist()
+# -- Schemas ------------------------------------------------------------------
+class BatterySimConfig(BaseModel):
+    battery_id:          str
+    label:               Optional[str] = None
+    initial_soh:         float = Field(default=100.0, ge=0.0, le=100.0)
+    start_cycle:         int   = Field(default=1, ge=1)
+    ambient_temperature: float = Field(default=24.0)
+    peak_voltage:        float = Field(default=4.19)
+    min_voltage:         float = Field(default=2.61)
+    avg_current:         float = Field(default=1.82)
+    avg_temp:            float = Field(default=32.6)
+    temp_rise:           float = Field(default=14.7)
+    cycle_duration:      float = Field(default=3690.0)
+    Re:                  float = Field(default=0.045)
+    Rct:                 float = Field(default=0.069)
+    delta_capacity:      float = Field(default=-0.005)
+class SimulateRequest(BaseModel):
+    batteries:     List[BatterySimConfig]
+    steps:         int           = Field(default=200, ge=1, le=10_000)
+    time_unit:     str           = Field(default="day")
+    eol_threshold: float         = Field(default=70.0, ge=0.0, le=100.0)
+    model_name:    Optional[str] = Field(default=None)
+    use_ml:        bool          = Field(default=True)
+class BatterySimResult(BaseModel):
+    battery_id:          str
+    label:               Optional[str]
+    soh_history:         List[float]
+    rul_history:         List[float]
+    rul_time_history:    List[float]
+    re_history:          List[float]
+    rct_history:         List[float]
+    cycle_history:       List[int]
+    time_history:        List[float]
+    degradation_history: List[str]
+    color_history:       List[str]
+    eol_cycle:           Optional[int]
+    eol_time:            Optional[float]
+    final_soh:           float
+    final_rul:           float
+    deg_rate_avg:        float
+    model_used:          str = "physics"
+class SimulateResponse(BaseModel):
+    results:         List[BatterySimResult]
+    time_unit:       str
+    time_unit_label: str
+    steps:           int
+    model_used:      str = "physics"
+# -- Helpers ------------------------------------------------------------------
+def _sei_growth(
+    re0: float, rct0: float, steps: int, temp_f: float
+) -> tuple[np.ndarray, np.ndarray]:
+    """Vectorized SEI impedance growth over `steps` cycles.
+    Returns (re_arr, rct_arr) each shaped (steps,) using cumsum - no Python loop.
+    Matches the incremental SEI model used during feature engineering (NB02).
+    """
+    s         = np.arange(steps, dtype=np.float64)
+    delta_re  = 0.00012 * temp_f * (1.0 + s * 5e-5)
+    delta_rct = 0.00018 * temp_f * (1.0 + s * 8e-5)
+    re_arr    = np.minimum(re0  + np.cumsum(delta_re),  2.0)
+    rct_arr   = np.minimum(rct0 + np.cumsum(delta_rct), 3.0)
+    return re_arr, rct_arr
+def _build_feature_matrix(
+    b: BatterySimConfig, steps: int,
+    re_arr: np.ndarray, rct_arr: np.ndarray,
+) -> np.ndarray:
+    """Build (steps, 12) feature matrix in FEATURE_COLS_SCALAR order.
+    Column ordering is guaranteed by the _F index map so the resulting matrix
+    is byte-identical to what the NB03 models were trained on, before any
+    scaling step.  Scaling is applied inside predict_array() per model family.
+    """
+    N      = steps
+    cycles = np.arange(b.start_cycle, b.start_cycle + N, dtype=np.float64)
+    X      = np.empty((N, len(FEATURE_COLS_SCALAR)), dtype=np.float64)
+    X[:, _F["cycle_number"]]        = cycles
+    X[:, _F["ambient_temperature"]] = b.ambient_temperature
+    X[:, _F["peak_voltage"]]        = b.peak_voltage
+    X[:, _F["min_voltage"]]         = b.min_voltage
+    X[:, _F["voltage_range"]]       = b.peak_voltage - b.min_voltage
+    X[:, _F["avg_current"]]         = b.avg_current
+    X[:, _F["avg_temp"]]            = b.avg_temp
+    X[:, _F["temp_rise"]]           = b.temp_rise
+    X[:, _F["cycle_duration"]]      = b.cycle_duration
+    X[:, _F["Re"]]                  = re_arr
+    X[:, _F["Rct"]]                 = rct_arr
+    X[:, _F["delta_capacity"]]      = b.delta_capacity
+    return X
+def _physics_soh(b: BatterySimConfig, steps: int, temp_f: float) -> np.ndarray:
+    """Pure Arrhenius physics fallback - fully vectorized, returns (steps,) SOH."""
+    rate_base = float(np.clip(abs(b.delta_capacity) / _Q_NOM * 100.0, 0.005, 1.5))
+    curr_f    = 1.0 + max(0.0, (b.avg_current - _I_REF) * 0.18)
+    volt_f    = 1.0 + max(0.0, (b.peak_voltage - _V_REF) * 0.55)
+    age_f     = 1.0 + (0.08 if b.initial_soh < 85.0 else 0.0) + (0.12 if b.initial_soh < 75.0 else 0.0)
+    deg_rate  = float(np.clip(rate_base * temp_f * curr_f * volt_f * age_f, 0.0, 2.0))
+    soh_arr   = b.initial_soh - deg_rate * np.arange(1, steps + 1, dtype=np.float64)
+    return np.clip(soh_arr, 0.0, 100.0)
+def _compute_rul_and_eol(
+    soh_arr:     np.ndarray,
+    initial_soh: float,
+    eol_thr:     float,
+    cycle_start: int,
+    cycle_dur:   float,
+    tu_sec:      float | None,
+) -> tuple[np.ndarray, np.ndarray, Optional[int], Optional[float]]:
+    """Vectorized RUL and EOL from SOH trajectory.
+    Returns (rul_cycles, rul_time, eol_cycle, eol_time).
+    Uses rolling-average degradation rate for smooth RUL estimate.
+    """
+    N      = len(soh_arr)
+    steps  = np.arange(N, dtype=np.float64)
+    cycles = (cycle_start + steps).astype(np.int64)
+    # Rolling average degradation rate (smoothed, avoids division-by-zero)
+    soh_prev = np.concatenate([[initial_soh], soh_arr[:-1]])
+    step_deg = np.maximum(0.0, soh_prev - soh_arr)
+    cum_deg  = np.cumsum(step_deg)
+    avg_rate = np.maximum(cum_deg / (steps + 1), 1e-6)
+    rul_cycles = np.where(soh_arr > eol_thr, (soh_arr - eol_thr) / avg_rate, 0.0)
+    rul_time   = (rul_cycles * cycle_dur / tu_sec) if tu_sec is not None else rul_cycles.copy()
+    # EOL: first step where SOH <= threshold
+    below     = soh_arr <= eol_thr
+    eol_cycle: Optional[int]   = None
+    eol_time:  Optional[float] = None
+    if below.any():
+        idx       = int(np.argmax(below))
+        eol_cycle = int(cycles[idx])
+        elapsed_s = eol_cycle * cycle_dur
+        eol_time  = round((elapsed_s / tu_sec) if tu_sec else float(eol_cycle), 3)
+    return rul_cycles, rul_time, eol_cycle, eol_time
+# -- Endpoint -----------------------------------------------------------------
+@router.post(
+    "/simulate",
+    response_model=SimulateResponse,
+    summary="Bulk battery lifecycle simulation (vectorized, ML-driven)",
+)
+async def simulate_batteries(req: SimulateRequest):
+    """
+    Vectorized simulation: builds all N feature rows at once per battery,
+    dispatches to the ML model as a single batch predict() call, then
+    post-processes entirely with numpy (no Python for-loops).
+    Scaler usage mirrors NB03 training exactly:
+      - Tree models (RF/ET/XGB/LGB/GB): raw numpy X, no scaler
+      - Linear/SVR/KNN:                 standard_scaler.joblib.transform(X)
+      - best_ensemble:                  per-component family dispatch
+    """
+    time_unit = req.time_unit.lower()
+    if time_unit not in _TIME_UNIT_SECONDS:
+        time_unit = "day"
+    tu_sec   = _TIME_UNIT_SECONDS[time_unit]
+    tu_label = _TIME_UNIT_LABELS[time_unit]
+    eol_thr  = req.eol_threshold
+    N        = req.steps
+    model_name = req.model_name or registry_v2.default_model or "best_ensemble"
+    # Deep sequence models need per-sample tensors — cannot batch vectorise
+    # Tree / linear / ensemble models support predict_array() batch calls.
+    # We do NOT gate on model_count here: predict_array() has a try/except
+    # fallback to physics, so a partial load still works.
+    family = registry_v2.model_meta.get(model_name, {}).get("family", "classical")
+    is_deep = family in ("deep_pytorch", "deep_keras")
+    ml_batchable = (
+        req.use_ml
+        and not is_deep
+        and (model_name == "best_ensemble" or model_name in registry_v2.models)
+    )
+    # Determine scaler note for logging (mirrors training decision exactly)
+    if model_name in registry_v2._LINEAR_FAMILIES:
+        scaler_note = "standard_scaler"
+    elif model_name == "best_ensemble":
+        scaler_note = "per-component (tree=none / linear=standard_scaler)"
+    else:
+        scaler_note = "none (tree)"
+    effective_model = "physics"
+    log.info(
+        "simulate: %d batteries x %d steps | model=%s | batchable=%s | scaler=%s | unit=%s",
+        len(req.batteries), N, model_name, ml_batchable, scaler_note, time_unit,
+    )
+    results: list[BatterySimResult] = []
+    for b in req.batteries:
+        # 1. SEI impedance growth - vectorized cumsum (no Python loop)
+        T_K     = 273.15 + b.ambient_temperature
+        T_REF_K = 273.15 + _T_REF
+        temp_f  = float(np.clip(math.exp(_EA_OVER_R * (1.0 / T_REF_K - 1.0 / T_K)), 0.15, 25.0))
+        re_arr, rct_arr = _sei_growth(b.Re, b.Rct, N, temp_f)
+        # 2. SOH prediction - single batch call regardless of N
+        #    predict_array() applies the correct scaler per model family,
+        #    exactly matching the preprocessing done during NB03 training:
+        #      * standard_scaler.transform(X)  for Ridge / SVR / KNN / Lasso / ElasticNet
+        #      * raw numpy                      for RF / ET / XGB / LGB / GB
+        #      * per-component dispatch         for best_ensemble
+        if ml_batchable:
+            X = _build_feature_matrix(b, N, re_arr, rct_arr)
+            try:
+                soh_arr, effective_model = registry_v2.predict_array(X, model_name)
+            except Exception as exc:
+                log.warning(
+                    "predict_array failed for %s (%s) - falling back to physics",
+                    b.battery_id, exc,
+                )
+                soh_arr = _physics_soh(b, N, temp_f)
+                effective_model = "physics"
+        else:
+            soh_arr = _physics_soh(b, N, temp_f)
+            effective_model = "physics"
+        soh_arr = np.clip(soh_arr, 0.0, 100.0)
+        # 3. RUL + EOL - vectorized
+        rul_cycles, rul_time, eol_cycle, eol_time = _compute_rul_and_eol(
+            soh_arr, b.initial_soh, eol_thr, b.start_cycle, b.cycle_duration, tu_sec,
+        )
+        # 4. Time axis - vectorized
+        cycle_arr = np.arange(b.start_cycle, b.start_cycle + N, dtype=np.int64)
+        time_arr  = (
+            (cycle_arr * b.cycle_duration / tu_sec).astype(np.float64)
+            if tu_sec is not None
+            else cycle_arr.astype(np.float64)
+        )
+        # 5. Labels + colors - fully vectorized via numpy searchsorted
+        #    Replaces O(N) Python for-loop with a single C-level call
+        deg_h   = _vec_classify(soh_arr)
+        color_h = _vec_color(soh_arr)
+        avg_dr = float(np.mean(np.maximum(0.0, -np.diff(soh_arr, prepend=b.initial_soh))))
+        # 6. Build result - numpy round + .tolist() (no per-element Python conversion)
+        results.append(BatterySimResult(
+            battery_id          = b.battery_id,
+            label               = b.label or b.battery_id,
+            soh_history         = np.round(soh_arr,    3).tolist(),
+            rul_history         = np.round(rul_cycles, 1).tolist(),
+            rul_time_history    = np.round(rul_time,   2).tolist(),
+            re_history          = np.round(re_arr,     6).tolist(),
+            rct_history         = np.round(rct_arr,    6).tolist(),
+            cycle_history       = cycle_arr.tolist(),
+            time_history        = np.round(time_arr,   3).tolist(),
+            degradation_history = deg_h,
+            color_history       = color_h,
+            eol_cycle           = eol_cycle,
+            eol_time            = eol_time,
+            final_soh           = round(float(soh_arr[-1]),    3),
+            final_rul           = round(float(rul_cycles[-1]), 1),
+            deg_rate_avg        = round(avg_dr, 6),
+            model_used          = effective_model,
+        ))
+    return SimulateResponse(
+        results         = results,
+        time_unit       = time_unit,
+        time_unit_label = tu_label,
+        steps           = N,
+        model_used      = effective_model,
+    )

api/routers/visualize.py ADDED Viewed

	@@ -0,0 +1,243 @@

+"""
+api.routers.visualize
+=====================
+Endpoints that serve pre-computed or on-demand visualisation data
+consumed by the React frontend.
+"""
+from __future__ import annotations
+import json
+from pathlib import Path
+import numpy as np
+import pandas as pd
+from fastapi import APIRouter, HTTPException
+from fastapi.responses import FileResponse
+from api.model_registry import registry, classify_degradation, soh_to_color
+from api.schemas import BatteryVizData, DashboardData
+router = APIRouter(prefix="/api", tags=["visualization"])
+_PROJECT = Path(__file__).resolve().parents[2]
+_ARTIFACTS = _PROJECT / "artifacts"
+_FIGURES = _ARTIFACTS / "figures"
+_DATASET = _PROJECT / "cleaned_dataset"
+_V2_RESULTS = _ARTIFACTS / "v2" / "results"
+_V2_REPORTS = _ARTIFACTS / "v2" / "reports"
+_V2_FIGURES = _ARTIFACTS / "v2" / "figures"
+# ── Dashboard aggregate ──────────────────────────────────────────────────────
+@router.get("/dashboard", response_model=DashboardData)
+async def dashboard():
+    """Return full dashboard payload for the frontend."""
+    # Battery summary
+    metadata_path = _DATASET / "metadata.csv"
+    batteries: list[BatteryVizData] = []
+    capacity_fade: dict[str, list[float]] = {}
+    if metadata_path.exists():
+        meta = pd.read_csv(metadata_path)
+        for bid in meta["battery_id"].unique():
+            sub = meta[meta["battery_id"] == bid].sort_values("start_time")
+            caps_s = pd.to_numeric(sub["Capacity"], errors="coerce").dropna()
+            if caps_s.empty:
+                continue
+            caps = caps_s.tolist()
+            last_cap = float(caps[-1])
+            soh = (last_cap / 2.0) * 100
+            avg_temp = float(sub["ambient_temperature"].mean())
+            cycle = len(sub)
+            batteries.append(BatteryVizData(
+                battery_id=bid,
+                soh_pct=round(soh, 1),
+                temperature=round(avg_temp, 1),
+                cycle_number=cycle,
+                degradation_state=classify_degradation(soh),
+                color_hex=soh_to_color(soh),
+            ))
+            capacity_fade[bid] = caps
+    model_metrics = registry.get_metrics()
+    # Find best model
+    best_model = "none"
+    best_r2 = -999
+    for name, m in model_metrics.items():
+        r2 = m.get("R2", -999)
+        if r2 > best_r2:
+            best_r2 = r2
+            best_model = name
+    return DashboardData(
+        batteries=batteries,
+        capacity_fade=capacity_fade,
+        model_metrics=model_metrics,
+        best_model=best_model,
+    )
+# ── Capacity fade for a specific battery ─────────────────────────────────────
+@router.get("/battery/{battery_id}/capacity")
+async def battery_capacity(battery_id: str):
+    """Return cycle-by-cycle capacity for one battery."""
+    meta_path = _DATASET / "metadata.csv"
+    if not meta_path.exists():
+        raise HTTPException(404, "Metadata not found")
+    meta = pd.read_csv(meta_path)
+    sub = meta[meta["battery_id"] == battery_id].sort_values("start_time")
+    if sub.empty:
+        raise HTTPException(404, f"Battery {battery_id} not found")
+    caps = pd.to_numeric(sub["Capacity"], errors="coerce").dropna().tolist()
+    cycles = list(range(1, len(caps) + 1))
+    soh_list = [(float(c) / 2.0) * 100 for c in caps]
+    return {"battery_id": battery_id, "cycles": cycles, "capacity_ah": caps, "soh_pct": soh_list}
+# ── Serve saved figures ──────────────────────────────────────────────────────
+@router.get("/figures/{filename}")
+async def get_figure(filename: str):
+    """Serve a saved matplotlib/plotly figure from artifacts/figures."""
+    path = _FIGURES / filename
+    if not path.exists():
+        raise HTTPException(404, f"Figure {filename} not found")
+    content_type = "image/png"
+    if path.suffix == ".html":
+        content_type = "text/html"
+    elif path.suffix == ".svg":
+        content_type = "image/svg+xml"
+    return FileResponse(path, media_type=content_type)
+# ── Figures listing ──────────────────────────────────────────────────────────
+@router.get("/figures")
+async def list_figures():
+    """List all available figures."""
+    if not _FIGURES.exists():
+        return []
+    return sorted([f.name for f in _FIGURES.iterdir() if f.is_file()])
+# ── Battery list ─────────────────────────────────────────────────────────────
+@router.get("/batteries")
+async def list_batteries():
+    """Return all battery IDs and basic stats."""
+    meta_path = _DATASET / "metadata.csv"
+    if not meta_path.exists():
+        return []
+    meta = pd.read_csv(meta_path)
+    out = []
+    for bid in sorted(meta["battery_id"].unique()):
+        sub = meta[meta["battery_id"] == bid]
+        caps = pd.to_numeric(sub["Capacity"], errors="coerce").dropna()
+        out.append({
+            "battery_id": bid,
+            "n_cycles": len(sub),
+            "last_capacity": round(float(caps.iloc[-1]), 4) if len(caps) else None,
+            "soh_pct": round((float(caps.iloc[-1]) / 2.0) * 100, 1) if len(caps) else None,
+            "ambient_temperature": round(float(sub["ambient_temperature"].mean()), 1),
+        })
+    return out
+# ── Comprehensive metrics endpoint ───────────────────────────────────────────
+def _safe_read_csv(path: Path) -> list[dict]:
+    """Read a CSV file into a list of dicts, replacing NaN with None."""
+    if not path.exists():
+        return []
+    df = pd.read_csv(path)
+    return json.loads(df.to_json(orient="records"))
+def _safe_read_json(path: Path) -> dict:
+    """Read a JSON file, returning empty dict on failure."""
+    if not path.exists():
+        return {}
+    with open(path) as f:
+        return json.load(f)
+@router.get("/metrics")
+async def get_metrics():
+    """Return comprehensive model metrics data from v2 artifacts for the Metrics dashboard."""
+    # Unified results (all models)
+    unified = _safe_read_csv(_V2_RESULTS / "unified_results.csv")
+    # Classical results (v2 retrained)
+    classical_v2 = _safe_read_csv(_V2_RESULTS / "v2_classical_results.csv")
+    # Classical SOH results (v1)
+    classical_soh = _safe_read_csv(_V2_RESULTS / "classical_soh_results.csv")
+    # LSTM results
+    lstm_results = _safe_read_csv(_V2_RESULTS / "lstm_soh_results.csv")
+    # Ensemble results
+    ensemble_results = _safe_read_csv(_V2_RESULTS / "ensemble_results.csv")
+    # Transformer results
+    transformer_results = _safe_read_csv(_V2_RESULTS / "transformer_soh_results.csv")
+    # Validation
+    validation = _safe_read_csv(_V2_RESULTS / "v2_model_validation.csv")
+    # Final rankings
+    rankings = _safe_read_csv(_V2_RESULTS / "final_rankings.csv")
+    # Classical RUL results
+    classical_rul = _safe_read_csv(_V2_RESULTS / "classical_rul_results.csv")
+    # JSON summaries
+    training_summary = _safe_read_json(_V2_RESULTS / "v2_training_summary.json")
+    validation_summary = _safe_read_json(_V2_RESULTS / "v2_validation_summary.json")
+    intra_battery = _safe_read_json(_V2_RESULTS / "v2_intra_battery.json")
+    vae_lstm = _safe_read_json(_V2_RESULTS / "vae_lstm_results.json")
+    dg_itransformer = _safe_read_json(_V2_RESULTS / "dg_itransformer_results.json")
+    # Available v2 figures
+    v2_figures = []
+    if _V2_FIGURES.exists():
+        v2_figures = sorted([f.name for f in _V2_FIGURES.iterdir() if f.is_file() and f.suffix in ('.png', '.svg')])
+    # Battery features summary
+    features_path = _V2_RESULTS / "battery_features.csv"
+    battery_stats = {}
+    if features_path.exists():
+        df = pd.read_csv(features_path)
+        battery_stats = {
+            "total_samples": len(df),
+            "batteries": int(df["battery_id"].nunique()),
+            "avg_soh": round(float(df["SoH"].mean()), 2),
+            "min_soh": round(float(df["SoH"].min()), 2),
+            "max_soh": round(float(df["SoH"].max()), 2),
+            "avg_rul": round(float(df["RUL"].mean()), 1),
+            "feature_columns": [c for c in df.columns.tolist() if c not in ["battery_id", "datetime"]],
+            "degradation_distribution": json.loads(df["degradation_state"].value_counts().to_json()),
+            "temp_groups": sorted(df["ambient_temperature"].unique().tolist()),
+        }
+    return {
+        "unified_results": unified,
+        "classical_v2": classical_v2,
+        "classical_soh": classical_soh,
+        "lstm_results": lstm_results,
+        "ensemble_results": ensemble_results,
+        "transformer_results": transformer_results,
+        "validation": validation,
+        "rankings": rankings,
+        "classical_rul": classical_rul,
+        "training_summary": training_summary,
+        "validation_summary": validation_summary,
+        "intra_battery": intra_battery,
+        "vae_lstm": vae_lstm,
+        "dg_itransformer": dg_itransformer,
+        "v2_figures": v2_figures,
+        "battery_stats": battery_stats,
+    }
+@router.get("/v2/figures/{filename}")
+async def get_v2_figure(filename: str):
+    """Serve a saved figure from artifacts/v2/figures."""
+    path = _V2_FIGURES / filename
+    if not path.exists():
+        raise HTTPException(404, f"Figure {filename} not found")
+    content_type = "image/png"
+    if path.suffix == ".html":
+        content_type = "text/html"
+    elif path.suffix == ".svg":
+        content_type = "image/svg+xml"
+    return FileResponse(path, media_type=content_type)

api/schemas.py ADDED Viewed

	@@ -0,0 +1,125 @@

+"""
+api.schemas
+===========
+Pydantic models for request / response validation.
+"""
+from __future__ import annotations
+from pydantic import BaseModel, Field
+from typing import Optional
+# ── Prediction ───────────────────────────────────────────────────────────────
+class PredictRequest(BaseModel):
+    """Request body for single-cycle prediction."""
+    battery_id: str = Field(..., description="Battery identifier, e.g. 'B0005'")
+    cycle_number: int = Field(..., ge=1, description="Current cycle number")
+    ambient_temperature: float = Field(default=24.0, description="Ambient temperature (°C)")
+    peak_voltage: float = Field(default=4.19, description="Peak charge voltage (V)")
+    min_voltage: float = Field(default=2.61, description="Discharge cutoff voltage (V)")
+    avg_current: float = Field(default=1.82, description="Average discharge current (A)")
+    avg_temp: float = Field(default=32.6, description="Average cell temperature (°C) — typically higher than ambient")
+    temp_rise: float = Field(default=14.7, description="Temperature rise during cycle (°C) — NASA dataset mean ≈ 15°C")
+    cycle_duration: float = Field(default=3690.0, description="Cycle duration (seconds)")
+    Re: float = Field(default=0.045, description="Electrolyte resistance (Ω) — training range 0.027–0.156")
+    Rct: float = Field(default=0.069, description="Charge transfer resistance (Ω) — training range 0.04–0.27")
+    delta_capacity: float = Field(default=-0.005, description="Capacity change from last cycle (Ah)")
+class PredictResponse(BaseModel):
+    """Response body for a prediction."""
+    battery_id: str
+    cycle_number: int
+    soh_pct: float = Field(..., description="Predicted State of Health (%)")
+    rul_cycles: Optional[float] = Field(None, description="Predicted Remaining Useful Life (cycles)")
+    degradation_state: str = Field(..., description="Degradation state label")
+    confidence_lower: Optional[float] = None
+    confidence_upper: Optional[float] = None
+    model_used: str = Field(..., description="Name of the model that produced the prediction")
+    model_version: Optional[str] = Field(None, description="Semantic version of the model used")
+# ── Batch Prediction ─────────────────────────────────────────────────────────
+class BatchPredictRequest(BaseModel):
+    """Batch prediction for multiple cycles of one battery."""
+    battery_id: str
+    cycles: list[dict] = Field(..., description="List of cycle feature dicts")
+class BatchPredictResponse(BaseModel):
+    """Batch prediction response."""
+    battery_id: str
+    predictions: list[PredictResponse]
+# ── Recommendation ───────────────────────────────────────────────────────────
+class RecommendationRequest(BaseModel):
+    """Request for operational recommendations."""
+    battery_id: str
+    current_cycle: int = Field(..., ge=1)
+    current_soh: float = Field(..., ge=0, le=100)
+    ambient_temperature: float = Field(default=24.0)
+    top_k: int = Field(default=5, ge=1, le=20)
+class SingleRecommendation(BaseModel):
+    """One recommendation entry."""
+    rank: int
+    ambient_temperature: float
+    discharge_current: float
+    cutoff_voltage: float
+    predicted_rul: float
+    rul_improvement: float
+    rul_improvement_pct: float
+    explanation: str
+class RecommendationResponse(BaseModel):
+    """Response with ranked recommendations."""
+    battery_id: str
+    current_soh: float
+    recommendations: list[SingleRecommendation]
+# ── Model info ───────────────────────────────────────────────────────────────
+class ModelInfo(BaseModel):
+    """Info about a registered model."""
+    name: str
+    version: Optional[str] = None
+    display_name: Optional[str] = None
+    family: str
+    algorithm: Optional[str] = None
+    target: str
+    r2: Optional[float] = None
+    metrics: dict[str, float]
+    is_default: bool = False
+    loaded: bool = True
+    load_error: Optional[str] = None
+class HealthResponse(BaseModel):
+    """Health check response."""
+    status: str = "ok"
+    version: str
+    models_loaded: int
+    device: str
+# ── Visualization ────────────────────────────────────────────────────────────
+class BatteryVizData(BaseModel):
+    """Data for 3D battery visualization."""
+    battery_id: str
+    soh_pct: float
+    temperature: float
+    cycle_number: int
+    degradation_state: str
+    color_hex: str = Field(..., description="Color hex code for SOH heatmap")
+class DashboardData(BaseModel):
+    """Full dashboard payload."""
+    batteries: list[BatteryVizData]
+    capacity_fade: dict[str, list[float]]
+    model_metrics: dict[str, dict[str, float]]
+    best_model: str

artifacts/v1/results/classical_rul_results.csv ADDED Viewed

	@@ -0,0 +1,4 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_5cyc
+RandomForest,8.942333980582523,137.23446964660195,11.714711675777682,-0.15748641443851108,120.88277153106779,0.4058252427184466
+XGBoost,6.7578043937683105,74.80780029296875,8.649150264214905,0.3690432906150818,90.60895087232073,0.429126213592233
+LightGBM,9.174716686966466,125.80377799208308,11.216228331845027,-0.06107572161612751,112.25501529299473,0.35145631067961164

artifacts/v1/results/classical_soh_results.csv ADDED Viewed

	@@ -0,0 +1,11 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.807727531282236,0.29514563106796116
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.24854368932038834
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.22330097087378642
+SVR,7.562130215902278,187.88576938734298,13.707143006014892,0.805143828272144,97.82839183781172,0.32233009708737864
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.26019417475728157
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713165,0.2407766990291262
+ElasticNet,15.796038594619795,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.06407766990291262
+Lasso,15.830927911593506,462.77574982037044,21.512223265398916,0.5200556632227833,145.46522399976544,0.05242718446601942
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.05048543689320388

artifacts/v1/results/dg_itransformer_results.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "MAE": 12.886456320718098,
+  "MSE": 323.38418900793243,
+  "RMSE": 17.982886003306934,
+  "R2": 0.12310012848141882,
+  "MAPE": 69.98234771512423,
+  "tol_2pct": 0.04827586206896552,
+  "tol_5pct": 0.2896551724137931
+}

artifacts/v1/results/ensemble_results.csv ADDED Viewed

	@@ -0,0 +1,9 @@

+model,MAE,MSE,RMSE,R2,MAPE,tol_2pct
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,0.3482758620689655
+tft,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,0.21379310344827587
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.7482515597481125,0.8372059046352616,10.924057540803656,0.12413793103448276
+vae_lstm,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,0.09310344827586207
+batterygpt,8.020673063309337,129.06954589210812,11.360877866261397,0.6500105074494692,12.874349389916773,0.28620689655172415
+vanilla_lstm,10.561354979478219,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.15862068965517243
+bidirectional_lstm,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.10689655172413794
+attention_lstm,14.181327172488002,288.23989200564256,16.977629163273726,0.21839863277892768,24.827876450140888,0.15862068965517243

artifacts/v1/results/final_rankings.csv ADDED Viewed

	@@ -0,0 +1,23 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct,tol_2pct,tol_5pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.80772753128224,0.2951456310679611,,
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.2485436893203883,,
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,,0.3482758620689655,
+TFT,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,,0.2137931034482758,
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.2233009708737864,,
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.748251559748112,0.8372059046352616,10.924057540803656,,0.1241379310344827,
+SVR,7.562130215902278,187.88576938734295,13.707143006014892,0.805143828272144,97.82839183781172,0.3223300970873786,,
+VAE-LSTM,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,,0.09310344827586207,
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.2601941747572815,,
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058,,
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713164,0.2407766990291262,,
+BatteryGPT,8.020673063309337,129.06954589210812,11.360877866261395,0.6500105074494692,12.874349389916771,,0.2862068965517241,
+GRU,9.275809104339835,134.65458890701777,11.604076391812397,0.6348659095727935,15.248492181524448,0.193103448275862,,
+Vanilla LSTM,10.56135497947822,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.1586206896551724,,
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.1068965517241379,,
+ElasticNet,15.796038594619796,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.0640776699029126,,
+Lasso,15.830927911593506,462.7757498203704,21.51222326539892,0.5200556632227833,145.46522399976544,0.0524271844660194,,
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.0504854368932038,,
+iTransformer,14.544141949115827,275.09890344946007,16.58610573490535,0.2540321967199925,73.33406651321724,,0.0172413793103448,
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.2183986327789276,24.827876450140888,0.1586206896551724,,
+DG-iTransformer,14.183794754173379,313.5635374005159,17.707725359303375,0.1497301506825386,94.26231665878274,,0.1103448275862069,0.2
+Physics iTransformer,16.41156271618159,379.65023683665015,19.48461538847124,-0.0294728537142276,97.60205959509742,,0.0,

artifacts/v1/results/lstm_soh_results.csv ADDED Viewed

	@@ -0,0 +1,5 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct
+GRU,9.275809104339837,134.65458890701777,11.604076391812395,0.6348659095727935,15.248492181524448,0.19310344827586207
+Vanilla LSTM,10.561354979478219,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.15862068965517243
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.10689655172413794
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.21839863277892768,24.827876450140888,0.15862068965517243

artifacts/v1/results/transformer_soh_results.csv ADDED Viewed

	@@ -0,0 +1,5 @@

+model,MAE,MSE,RMSE,R2,MAPE,tol_2pct
+TFT,9.538144323660262,127.31655440584875,11.283463759229644,0.6547639804432781,18.98017853690174,0.05517241379310345
+BatteryGPT,9.614953606189902,135.34729964687295,11.633885836076997,0.6329875309153767,15.44417472623618,0.1793103448275862
+iTransformer,9.356759048993766,150.35062176683638,12.261754432659153,0.5923039981808029,14.4907916127095,0.1310344827586207
+Physics iTransformer,12.1375468649355,214.65431642999653,14.651085844741901,0.4179358518552816,24.985288391381786,0.07931034482758621

artifacts/v1/results/unified_results.csv ADDED Viewed

	@@ -0,0 +1,23 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct,tol_2pct,tol_5pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.80772753128224,0.2951456310679611,,
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.2485436893203883,,
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,,0.3482758620689655,
+TFT,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,,0.2137931034482758,
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.2233009708737864,,
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.748251559748112,0.8372059046352616,10.924057540803656,,0.1241379310344827,
+SVR,7.562130215902278,187.88576938734295,13.707143006014892,0.805143828272144,97.82839183781172,0.3223300970873786,,
+VAE-LSTM,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,,0.09310344827586207,
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.2601941747572815,,
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058,,
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713164,0.2407766990291262,,
+BatteryGPT,8.020673063309337,129.06954589210812,11.360877866261395,0.6500105074494692,12.874349389916771,,0.2862068965517241,
+GRU,9.275809104339835,134.65458890701777,11.604076391812397,0.6348659095727935,15.248492181524448,0.193103448275862,,
+Vanilla LSTM,10.56135497947822,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.1586206896551724,,
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.1068965517241379,,
+ElasticNet,15.796038594619796,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.0640776699029126,,
+Lasso,15.830927911593506,462.7757498203704,21.51222326539892,0.5200556632227833,145.46522399976544,0.0524271844660194,,
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.0504854368932038,,
+iTransformer,14.544141949115827,275.09890344946007,16.58610573490535,0.2540321967199925,73.33406651321724,,0.0172413793103448,
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.2183986327789276,24.827876450140888,0.1586206896551724,,
+DG-iTransformer,14.183794754173379,313.5635374005159,17.707725359303375,0.1497301506825386,94.26231665878274,,0.1103448275862069,0.2
+Physics iTransformer,16.41156271618159,379.65023683665015,19.48461538847124,-0.0294728537142276,97.60205959509742,,0.0,

artifacts/v1/results/vae_lstm_results.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+  "MAE": 8.494939970437716,
+  "MSE": 100.78674732190278,
+  "RMSE": 10.039260297546965,
+  "R2": 0.7267031327397879,
+  "MAPE": 14.250142040133243,
+  "tol_2pct": 0.09310344827586207
+}

artifacts/v2/results/battery_features.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

artifacts/v2/results/classical_rul_results.csv ADDED Viewed

	@@ -0,0 +1,4 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_5cyc
+RandomForest,8.942333980582523,137.23446964660195,11.714711675777682,-0.15748641443851108,120.88277153106779,0.4058252427184466
+XGBoost,6.7578043937683105,74.80780029296875,8.649150264214905,0.3690432906150818,90.60895087232073,0.429126213592233
+LightGBM,9.174716686966466,125.80377799208308,11.216228331845027,-0.06107572161612751,112.25501529299473,0.35145631067961164

artifacts/v2/results/classical_soh_results.csv ADDED Viewed

	@@ -0,0 +1,11 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.807727531282236,0.29514563106796116
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.24854368932038834
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.22330097087378642
+SVR,7.562130215902278,187.88576938734298,13.707143006014892,0.805143828272144,97.82839183781172,0.32233009708737864
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.26019417475728157
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713165,0.2407766990291262
+ElasticNet,15.796038594619795,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.06407766990291262
+Lasso,15.830927911593506,462.77574982037044,21.512223265398916,0.5200556632227833,145.46522399976544,0.05242718446601942
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.05048543689320388

artifacts/v2/results/dg_itransformer_results.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "MAE": 12.886456320718098,
+  "MSE": 323.38418900793243,
+  "RMSE": 17.982886003306934,
+  "R2": 0.12310012848141882,
+  "MAPE": 69.98234771512423,
+  "tol_2pct": 0.04827586206896552,
+  "tol_5pct": 0.2896551724137931
+}

artifacts/v2/results/ensemble_results.csv ADDED Viewed

	@@ -0,0 +1,9 @@

+model,MAE,MSE,RMSE,R2,MAPE,tol_2pct
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,0.3482758620689655
+tft,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,0.21379310344827587
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.7482515597481125,0.8372059046352616,10.924057540803656,0.12413793103448276
+vae_lstm,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,0.09310344827586207
+batterygpt,8.020673063309337,129.06954589210812,11.360877866261397,0.6500105074494692,12.874349389916773,0.28620689655172415
+vanilla_lstm,10.561354979478219,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.15862068965517243
+bidirectional_lstm,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.10689655172413794
+attention_lstm,14.181327172488002,288.23989200564256,16.977629163273726,0.21839863277892768,24.827876450140888,0.15862068965517243

artifacts/v2/results/final_rankings.csv ADDED Viewed

	@@ -0,0 +1,23 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct,tol_2pct,tol_5pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.80772753128224,0.2951456310679611,,
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.2485436893203883,,
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,,0.3482758620689655,
+TFT,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,,0.2137931034482758,
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.2233009708737864,,
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.748251559748112,0.8372059046352616,10.924057540803656,,0.1241379310344827,
+SVR,7.562130215902278,187.88576938734295,13.707143006014892,0.805143828272144,97.82839183781172,0.3223300970873786,,
+VAE-LSTM,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,,0.09310344827586207,
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.2601941747572815,,
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058,,
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713164,0.2407766990291262,,
+BatteryGPT,8.020673063309337,129.06954589210812,11.360877866261395,0.6500105074494692,12.874349389916771,,0.2862068965517241,
+GRU,9.275809104339835,134.65458890701777,11.604076391812397,0.6348659095727935,15.248492181524448,0.193103448275862,,
+Vanilla LSTM,10.56135497947822,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.1586206896551724,,
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.1068965517241379,,
+ElasticNet,15.796038594619796,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.0640776699029126,,
+Lasso,15.830927911593506,462.7757498203704,21.51222326539892,0.5200556632227833,145.46522399976544,0.0524271844660194,,
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.0504854368932038,,
+iTransformer,14.544141949115827,275.09890344946007,16.58610573490535,0.2540321967199925,73.33406651321724,,0.0172413793103448,
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.2183986327789276,24.827876450140888,0.1586206896551724,,
+DG-iTransformer,14.183794754173379,313.5635374005159,17.707725359303375,0.1497301506825386,94.26231665878274,,0.1103448275862069,0.2
+Physics iTransformer,16.41156271618159,379.65023683665015,19.48461538847124,-0.0294728537142276,97.60205959509742,,0.0,

artifacts/v2/results/lstm_soh_results.csv ADDED Viewed

	@@ -0,0 +1,5 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct
+GRU,9.275809104339837,134.65458890701777,11.604076391812395,0.6348659095727935,15.248492181524448,0.19310344827586207
+Vanilla LSTM,10.561354979478219,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.15862068965517243
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.10689655172413794
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.21839863277892768,24.827876450140888,0.15862068965517243

artifacts/v2/results/transformer_soh_results.csv ADDED Viewed

	@@ -0,0 +1,5 @@

+model,MAE,MSE,RMSE,R2,MAPE,tol_2pct
+TFT,9.538144323660262,127.31655440584875,11.283463759229644,0.6547639804432781,18.98017853690174,0.05517241379310345
+BatteryGPT,9.614953606189902,135.34729964687295,11.633885836076997,0.6329875309153767,15.44417472623618,0.1793103448275862
+iTransformer,9.356759048993766,150.35062176683638,12.261754432659153,0.5923039981808029,14.4907916127095,0.1310344827586207
+Physics iTransformer,12.1375468649355,214.65431642999653,14.651085844741901,0.4179358518552816,24.985288391381786,0.07931034482758621

artifacts/v2/results/unified_results.csv ADDED Viewed

	@@ -0,0 +1,23 @@

+model,MAE,MSE,RMSE,R2,MAPE,tolerance_acc_2pct,tol_2pct,tol_5pct
+RandomForest,4.78051739475878,41.771168089034525,6.463061819991708,0.956679157080545,28.80772753128224,0.2951456310679611,,
+LightGBM,6.909989455704782,89.23030742727897,9.446179514876846,0.9074593240133356,56.94413475231037,0.2485436893203883,,
+Weighted Avg Ensemble,3.737822790616429,37.739016006374186,6.1432089339671805,0.8976655649469139,5.478476687258817,,0.3482758620689655,
+TFT,4.732738753085752,46.68825874782447,6.832880706394959,0.8733984854887593,7.499287950788588,,0.2137931034482758,
+XGBoost,8.506303251021016,136.22476526016746,11.67153654238239,0.8587214117403497,72.81562821369698,0.2233009708737864,,
+Stacking Ensemble,5.76908337148985,60.03540223313905,7.748251559748112,0.8372059046352616,10.924057540803656,,0.1241379310344827,
+SVR,7.562130215902278,187.88576938734295,13.707143006014892,0.805143828272144,97.82839183781172,0.3223300970873786,,
+VAE-LSTM,8.494939970437716,100.78674732190278,10.039260297546965,0.7267031327397879,14.250142040133243,,0.09310344827586207,
+KNN-10,11.665738368704334,265.7682207905257,16.302399234177948,0.7243720041223407,89.92209971367193,0.2601941747572815,,
+KNN-5,11.751580021863887,270.3768092061512,16.44313866651228,0.7195924409938166,88.41470284019084,0.2757281553398058,,
+KNN-20,12.035078655061884,272.741740151536,16.514894494108525,0.71713977312056,101.98653839713164,0.2407766990291262,,
+BatteryGPT,8.020673063309337,129.06954589210812,11.360877866261395,0.6500105074494692,12.874349389916771,,0.2862068965517241,
+GRU,9.275809104339835,134.65458890701777,11.604076391812397,0.6348659095727935,15.248492181524448,0.193103448275862,,
+Vanilla LSTM,10.56135497947822,155.95773910638056,12.488304092485118,0.5770995427937917,14.375050332959946,0.1586206896551724,,
+Bidirectional LSTM,11.134867385317946,167.3343794115467,12.935779041540046,0.5462502472468515,17.124593156141298,0.1068965517241379,,
+ElasticNet,15.796038594619796,460.3763913475375,21.45638346384445,0.5225440358554927,144.4728811908027,0.0640776699029126,,
+Lasso,15.830927911593506,462.7757498203704,21.51222326539892,0.5200556632227833,145.46522399976544,0.0524271844660194,,
+Ridge,15.860549029501184,464.1921925903698,21.545119925179574,0.5185866716299998,145.65718178268125,0.0504854368932038,,
+iTransformer,14.544141949115827,275.09890344946007,16.58610573490535,0.2540321967199925,73.33406651321724,,0.0172413793103448,
+Attention LSTM,14.181327172488002,288.23989200564256,16.977629163273726,0.2183986327789276,24.827876450140888,0.1586206896551724,,
+DG-iTransformer,14.183794754173379,313.5635374005159,17.707725359303375,0.1497301506825386,94.26231665878274,,0.1103448275862069,0.2
+Physics iTransformer,16.41156271618159,379.65023683665015,19.48461538847124,-0.0294728537142276,97.60205959509742,,0.0,

artifacts/v2/results/v2_classical_results.csv ADDED Viewed

	@@ -0,0 +1,9 @@

+model,r2,mae,within_5pct
+extra_trees,0.9545111863206289,1.325439106747622,99.27007299270073
+svr,0.9749418151896808,0.8447528977381709,99.27007299270073
+ridge,0.96260007301777,1.238689808585007,99.27007299270073
+xgboost,0.9701536078007467,1.276102318230126,98.72262773722628
+gradient_boosting,0.9427669688063848,1.4114545626753392,98.54014598540147
+knn_k5,0.9303819293635062,1.7036079148305412,97.62773722627736
+random_forest,0.9518568236961867,1.6867479514298702,96.71532846715328
+lightgbm,0.9560384659829149,1.5661312778770975,95.98540145985402

artifacts/v2/results/v2_intra_battery.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "target": "within_5pct >= 95",
+  "passed_models": 8,
+  "total_models": 8,
+  "best_model": "extra_trees",
+  "best_within_5pct": 99.27007299270073,
+  "best_r2": 0.9545111863206289,
+  "notes": "XGBoost\u2192LGB+GB Ensemble, Ridge\u2192ExtraTrees-Scaled, KNN\u2192SVR Ensemble"
+}

artifacts/v2/results/v2_model_validation.csv ADDED Viewed

	@@ -0,0 +1,17 @@

+model,mae,rmse,r2,within_2pct,within_5pct,passed_95
+random_forest,0.9087291273437881,1.969167860523798,0.9768094711831841,83.75912408759125,95.07299270072993,True
+svr,1.6349861755580366,3.4846680412456976,0.9273780344781068,82.48175182481752,88.86861313868614,False
+xgboost,1.0395094776911078,2.3264169017858896,0.9676316722415876,82.2992700729927,88.86861313868614,False
+lightgbm,1.5073643559068495,2.9029505644051135,0.9496007058102836,81.02189781021897,88.13868613138686,False
+knn_k20,2.486675866923982,6.500672824933502,0.7472670935266853,84.12408759124088,85.21897810218978,False
+knn_k10,2.537245189395581,6.643166546082999,0.7360659298020076,84.48905109489051,84.67153284671532,False
+knn_k5,2.5366661790854557,6.661080117527008,0.734640592527769,84.48905109489051,84.67153284671532,False
+lasso,6.338197702892737,7.855862262540625,0.63090947633286,13.321167883211679,49.63503649635037,False
+ridge,6.354219600702956,7.877916808246509,0.6288341980649439,13.138686131386862,49.63503649635037,False
+elasticnet,6.359292593403684,7.8720819252103045,0.6293838121478381,13.503649635036496,49.27007299270073,False
+physics_itransformer,11.942222882334457,19.560215899810355,-1.288191997636034,15.875912408759124,44.70802919708029,False
+itransformer,18.775672188288702,24.807532802455988,-2.680546617377897,10.948905109489052,25.18248175182482,False
+dynamic_graph_itransformer,14.5743662824896,18.019296255058386,-0.9418730093540923,6.204379562043796,17.335766423357665,False
+extra_trees,21.583514681220528,25.105434141924782,-2.769473079864784,5.839416058394161,9.48905109489051,False
+gradient_boosting,29.48091148539111,32.124687008446905,-5.1719583162447185,0.0,0.18248175182481752,False
+best_rul_model,61.6389790255578,62.979116792145476,-22.721290167829615,0.0,0.0,False

artifacts/v2/results/v2_training_summary.json ADDED Viewed

	@@ -0,0 +1,14 @@

+{
+  "version": "v2.0",
+  "timestamp": "2026-02-25T18:16:44.705048",
+  "total_models": 12,
+  "passed_models": 5,
+  "pass_rate_pct": 41.66666666666667,
+  "best_model": "extra_trees",
+  "best_within_5pct": 99.27007299270073,
+  "best_r2": 0.9545111863206289,
+  "mean_within_5pct": 77.38746958637469,
+  "train_samples": 2130,
+  "test_samples": 548,
+  "batteries": 30
+}

artifacts/v2/results/v2_validation_report.html ADDED Viewed

File without changes

artifacts/v2/results/v2_validation_summary.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "timestamp": "2026-02-25T16:31:44.904601",
+  "test_samples": 548,
+  "test_batteries": 30,
+  "total_models_tested": 16,
+  "models_passed_95pct": 1,
+  "overall_pass_rate_pct": 6.25,
+  "best_model": "random_forest",
+  "best_within_5pct": 95.07299270072993,
+  "mean_within_5pct": 53.809306569343065
+}

artifacts/v2/results/vae_lstm_results.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+  "MAE": 8.494939970437716,
+  "MSE": 100.78674732190278,
+  "RMSE": 10.039260297546965,
+  "R2": 0.7267031327397879,
+  "MAPE": 14.250142040133243,
+  "tol_2pct": 0.09310344827586207
+}

cleaned_dataset/metadata.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

docker-compose.yml ADDED Viewed

	@@ -0,0 +1,69 @@

+version: "3.9"
+# ─────────────────────────────────────────────────────────────────────────────
+# AI Battery Lifecycle Predictor — Docker Compose
+#
+# Services
+# ────────
+#  app        Production: single container (React SPA + FastAPI, port 7860)
+#  api-dev    Development: backend only with hot-reload (activate with --profile dev)
+#
+# Usage
+# ─────
+#  Production:   docker compose up --build
+#  Development:  docker compose --profile dev up api-dev
+#                (then separately: cd frontend && npm run dev)
+# ─────────────────────────────────────────────────────────────────────────────
+services:
+  # ── Production ──────────────────────────────────────────────────────────────
+  app:
+    build:
+      context: .
+      dockerfile: Dockerfile
+    image: battery-lifecycle:latest
+    container_name: battery_lifecycle
+    ports:
+      - "7860:7860"
+    environment:
+      LOG_LEVEL: "INFO"
+      WORKERS: "1"
+    volumes:
+      # Persist rotated log files on the host
+      - ./artifacts/logs:/app/artifacts/logs
+    healthcheck:
+      test:
+        - CMD
+        - python
+        - -c
+        - "import urllib.request; urllib.request.urlopen('http://localhost:7860/health')"
+      interval: 30s
+      timeout: 10s
+      retries: 3
+      start_period: 90s          # give models time to load
+    restart: unless-stopped
+  # ── Development (backend only, hot-reload) ──────────────────────────────────
+  api-dev:
+    build:
+      context: .
+      dockerfile: Dockerfile
+      target: runtime             # stop before copying built frontend
+    image: battery-lifecycle:dev
+    container_name: battery_lifecycle_dev
+    command: >
+      uvicorn api.main:app
+      --host 0.0.0.0
+      --port 7860
+      --reload
+    ports:
+      - "7860:7860"
+    environment:
+      LOG_LEVEL: "DEBUG"
+    volumes:
+      - ./api:/app/api            # live-reload source changes
+      - ./src:/app/src
+      - ./artifacts:/app/artifacts
+    profiles:
+      - dev

docs/api.md ADDED Viewed

	@@ -0,0 +1,237 @@

+# API Documentation
+## Base URL
+- **Local:** `http://localhost:7860`
+- **Docker:** `http://localhost:7860`
+- **Hugging Face Spaces:** `https://neerajcodz-aibatterylifecycle.hf.space`
+## Interactive Docs
+- **Swagger UI:** `/docs`
+- **ReDoc:** `/redoc`
+- **Gradio UI:** `/gradio`
+## API Versioning (v2.1.0)
+The API supports two model generations served in parallel:
+| Prefix | Models | Split Strategy | Notes |
+|--------|--------|---------------|-------|
+| `/api/v1/*` | v1 models (cross-battery split) | Group-battery 80/20 | Legacy |
+| `/api/v2/*` | v2 models (chrono split, bug-fixed) | Intra-battery 80/20 | **Recommended** |
+| `/api/*` | Default (v2) | Same as v2 | Backward-compatible |
+### v2 Bug Fixes
+- **avg_temp auto-correction removed** — v1 silently added 8°C to avg_temp
+- **Recommendation baseline fixed** — v1 re-predicted SOH, yielding ~0 improvement
+---
+## Endpoints
+### Health Check
+```http
+GET /health
+```
+Response:
+```json
+{
+  "status": "ok",
+  "version": "2.0.0",
+  "models_loaded": 12,
+  "device": "cpu"
+}
+```
+---
+### Single Prediction
+```http
+POST /api/predict
+Content-Type: application/json
+```
+Request:
+```json
+{
+  "battery_id": "B0005",
+  "cycle_number": 100,
+  "ambient_temperature": 24.0,
+  "peak_voltage": 4.2,
+  "min_voltage": 2.7,
+  "avg_current": 2.0,
+  "avg_temp": 25.0,
+  "temp_rise": 3.0,
+  "cycle_duration": 3600,
+  "Re": 0.04,
+  "Rct": 0.02,
+  "delta_capacity": -0.005
+}
+```
+Optionally include `"model_name"` to select a specific model (leave null to use the registry default):
+```json
+{
+  ...
+  "model_name": "random_forest"
+}
+```
+Response:
+```json
+{
+  "battery_id": "B0005",
+  "cycle_number": 100,
+  "soh_pct": 92.5,
+  "rul_cycles": 450,
+  "degradation_state": "Healthy",
+  "confidence_lower": 90.5,
+  "confidence_upper": 94.5,
+  "model_used": "random_forest",
+  "model_version": "v1.0.0"
+}
+```
+---
+### Ensemble Prediction
+```http
+POST /api/predict/ensemble
+Content-Type: application/json
+```
+Always uses the **BestEnsemble (v3.0.0)** — weighted average of Random Forest, XGBoost, and
+LightGBM (weights proportional to R²). Body is identical to single prediction.
+Response includes `"model_version": "v3.0.0"`.
+---
+### Batch Prediction
+```http
+POST /api/predict/batch
+Content-Type: application/json
+```
+Request:
+```json
+{
+  "battery_id": "B0005",
+  "cycles": [
+    {"cycle_number": 1, "ambient_temperature": 24, ...},
+    {"cycle_number": 2, "ambient_temperature": 24, ...}
+  ]
+}
+```
+---
+### Recommendations
+```http
+POST /api/recommend
+Content-Type: application/json
+```
+Request:
+```json
+{
+  "battery_id": "B0005",
+  "current_cycle": 100,
+  "current_soh": 85.0,
+  "ambient_temperature": 24.0,
+  "top_k": 5
+}
+```
+Response:
+```json
+{
+  "battery_id": "B0005",
+  "current_soh": 85.0,
+  "recommendations": [
+    {
+      "rank": 1,
+      "ambient_temperature": 24.0,
+      "discharge_current": 0.5,
+      "cutoff_voltage": 2.7,
+      "predicted_rul": 500,
+      "rul_improvement": 50,
+      "rul_improvement_pct": 11.1,
+      "explanation": "Operate at 24°C, 0.5A, cutoff 2.7V for ~500 cycles RUL"
+    }
+  ]
+}
+```
+---
+### Dashboard Data
+```http
+GET /api/dashboard
+```
+Returns full dashboard payload with battery fleet stats, capacity fade curves, and model metrics.
+---
+### Battery List
+```http
+GET /api/batteries
+```
+---
+### Battery Capacity
+```http
+GET /api/battery/{battery_id}/capacity
+```
+---
+### Model List
+```http
+GET /api/models
+```
+Returns every registered model with version, family, R², and load status.
+---
+### Model Versions
+```http
+GET /api/models/versions
+```
+Groups models by generation:
+```json
+{
+  "v1_classical":  ["ridge", "lasso", "random_forest", "xgboost", "lightgbm", ...],
+  "v2_deep":       ["vanilla_lstm", "bilstm", "gru", "attention_lstm", "tft", ...],
+  "v2_ensemble":   ["best_ensemble"],
+  "other":         [],
+  "default_model": "best_ensemble"
+}
+```
+---
+### Figures
+```http
+GET /api/figures          # List all
+GET /api/figures/{name}   # Serve a figure
+```

docs/architecture.md ADDED Viewed

	@@ -0,0 +1,59 @@

+# Architecture Overview
+## System Architecture
+```
+┌──────────────────────────────────────────────────────────────────┐
+│                    Docker Container (port 7860)                   │
+├──────────────┬───────────────┬───────────────────────────────────┤
+│  React SPA   │  Gradio UI    │  FastAPI Backend                  │
+│  (static)    │  /gradio      │  /api/*     /docs     /health     │
+│  /           │               │                                    │
+├──────────────┴───────────────┴───────────────────────────────────┤
+│                      Model Registry                               │
+│  ┌─────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐          │
+│  │Classical │  │ LSTM×4   │  │Transform.│  │ Ensemble │          │
+│  │ models   │  │ GRU      │  │ GPT, TFT │  │ Stack/WA │          │
+│  └─────────┘  └──────────┘  └──────────┘  └──────────┘          │
+├──────────────────────────────────────────────────────────────────┤
+│                  Data Pipeline (src/)                              │
+│  loader.py → features.py → preprocessing.py → model training     │
+├──────────────────────────────────────────────────────────────────┤
+│                 NASA PCoE Dataset (cleaned_dataset/)              │
+└──────────────────────────────────────────────────────────────────┘
+```
+## Data Flow
+1. **Ingestion:** `loader.py` reads metadata.csv + per-cycle CSVs
+2. **Feature Engineering:** `features.py` computes SOC, SOH, RUL, scalar features per cycle
+3. **Preprocessing:** `preprocessing.py` creates sliding windows, scales features, splits by battery
+4. **Training:** Notebooks train each model family, save checkpoints to `artifacts/models/`
+5. **Serving:** `model_registry.py` loads all models at startup
+6. **Prediction:** API receives features → registry dispatches to best model → returns SOH/RUL
+7. **Simulation:** `POST /api/v2/simulate` receives multi-battery config → vectorized Arrhenius degradation + ML via `predict_array()` → returns per-step SOH, RUL, and degradation-state history for each battery
+8. **Visualization:** Frontend fetches results and renders analytics (fleet overview, compare, temperature analysis, recommendations)
+## Model Registry
+The `ModelRegistry` singleton:
+- Scans `artifacts/models/classical/` for `.joblib` files (sklearn/xgb/lgbm)
+- Scans `artifacts/models/deep/` for `.pt` (PyTorch) and `.keras` (TF) files
+- Loads classical models eagerly; deep models registered lazily
+- Selects default model by priority: XGBoost > LightGBM > RandomForest > Ridge > deep models
+- Provides unified `predict()` interface regardless of framework
+- `predict_array(X: np.ndarray, model_name: str)` batch method enables vectorized simulation: accepts an (N, n_features) array and returns predictions for all N cycles in one call, avoiding Python loops
+- `_x_for_model()` normalizes input feature extraction for both single-cycle and batch paths
+- `_load_scaler()` lazily loads per-model scalers from `artifacts/scalers/`
+## Frontend Architecture
+- **Vite 7** build tool with React 19 + TypeScript 5.9
+- **lucide-react 0.575** for all icons — no emojis used anywhere in the UI
+- **Recharts 3** for all 2D charts (BarChart, AreaChart, LineChart, ScatterChart, RadarChart, PieChart)
+- **TailwindCSS 4** for styling
+- Tabs: Simulation | Predict | Metrics | Analytics | Recommendations | Research Paper
+- API proxy in dev mode (`/api` → `localhost:7860`) → same-origin in production (served by FastAPI)
+- **Analytics (GraphPanel):** 4-section dashboard — Fleet Overview (health kpi, fleet SOH bar, bubble scatter), Single Battery (SOH + RUL projection, capacity fade, degradation rate), Compare (multi-battery overlay), Temperature Analysis
+- **Metrics (MetricsPanel):** 6-section interactive dashboard — Overview KPIs, Models (sort/filter/chart-type controls), Validation, Deep Learning, Dataset stats, Figures searchable gallery
+- **Recommendations (RecommendationPanel):** Slider inputs for SOH/temp, 3 chart tabs (RUL bar, params bar, top-3 radar), expandable table rows with per-recommendation explanation

docs/dataset.md ADDED Viewed

	@@ -0,0 +1,76 @@

+# Dataset Documentation
+## NASA PCoE Li-ion Battery Dataset
+### Source
+- **Repository:** NASA Prognostics Center of Excellence (PCoE)
+- **Reference:** B. Saha and K. Goebel (2007). *Battery Data Set*, NASA Prognostics Data Repository, NASA Ames Research Center, Moffett Field, CA
+- **URL:** https://www.nasa.gov/content/prognostics-center-of-excellence-data-set-repository
+### Cells
+- **Type:** Li-ion 18650
+- **Nominal capacity:** 2.0 Ah
+- **Count:** 30 batteries after cleaning (from original 36)
+- **Total discharge cycles:** 2,678
+- **Sliding windows generated:** 1,734 (window size = 32 cycles)
+### Temperature Groups (discovered in EDA)
+| Group | Temperature | # Batteries | # Cycles |
+|-------|-------------|-------------|----------|
+| 1 | 4°C | 3 | ~200 |
+| 2 | 22°C | 4 | ~280 |
+| 3 | 24°C | 16 | ~1700 |
+| 4 | 43°C | 4 | ~320 |
+| 5 | 44°C | 3 | ~180 |
+Note: 5 temperature groups were discovered (not 3 as originally assumed).
+### End-of-Life Definitions
+- **30% capacity fade:** 1.4 Ah (default threshold)
+- **20% capacity fade:** 1.6 Ah (alternative)
+### Cycle Types
+#### Discharge
+Columns: `Voltage_measured`, `Current_measured`, `Temperature_measured`, `Current_load`, `Voltage_load`, `Time`
+#### Charge
+Columns: `Voltage_measured`, `Current_measured`, `Temperature_measured`, `Current_charge`, `Voltage_charge`, `Time`
+#### Impedance
+Columns: `Sense_current` (Re + Im), `Battery_current` (Re + Im), `Current_ratio` (Re + Im), `Battery_impedance` (Re + Im)
+### Metadata Schema (metadata.csv)
+- `type`: cycle type (charge, discharge, impedance)
+- `start_time`: MATLAB datenum
+- `ambient_temperature`: °C
+- `battery_id`: identifier
+- `test_id`: test sequence number
+- `uid`: unique identifier
+- `filename`: path to cycle CSV
+- `Capacity`: measured capacity (Ah)
+- `Re`: electrolyte resistance (Ω)
+- `Rct`: charge transfer resistance (Ω)
+### Feature Engineering
+#### Per-Cycle Scalar Features (12 dimensions)
+1. `cycle_number` — sequential cycle index
+2. `ambient_temperature` — environment temperature
+3. `peak_voltage` — max voltage in cycle
+4. `min_voltage` — min voltage in cycle
+5. `voltage_range` — peak - min
+6. `avg_current` — mean current magnitude
+7. `avg_temp` — mean cell temperature
+8. `temp_rise` — max - min temperature
+9. `cycle_duration` — total time (s)
+10. `Re` — electrolyte resistance
+11. `Rct` — charge transfer resistance
+12. `delta_capacity` — capacity change from previous cycle
+#### Derived Targets
+- **SOC:** Coulomb counting (integrated current)
+- **SOH:** (Current capacity / Nominal capacity) × 100%
+- **RUL:** Cycles remaining until EOL threshold
+- **Degradation State:** Healthy (≥90%), Moderate (80–90%), Degraded (70–80%), End-of-Life (<70%)

docs/deployment.md ADDED Viewed

	@@ -0,0 +1,131 @@

+# Deployment Guide
+## Local Development
+### Backend
+```bash
+cd aiBatteryLifecycle
+.\venv\Scripts\activate          # Windows
+source venv/bin/activate          # Linux/Mac
+uvicorn api.main:app --host 0.0.0.0 --port 7860 --reload
+```
+### Frontend (dev mode)
+```bash
+cd frontend
+npm install
+npm run dev
+```
+Frontend proxies `/api/*` to `localhost:7860` in dev mode.
+### Frontend (production build)
+```bash
+cd frontend
+npm run build
+```
+Built files go to `frontend/dist/` and are served by FastAPI.
+---
+## Docker
+### Build
+```bash
+docker build -t battery-predictor .
+```
+### Run
+```bash
+docker run -p 7860:7860 battery-predictor
+```
+### Build stages
+1. **frontend-build:** `node:20-slim` — installs npm deps and builds React SPA
+2. **runtime:** `python:3.11-slim` — installs Python deps, copies source and built frontend
+### Docker Compose (recommended)
+```bash
+# Production — single container (frontend + API)
+docker compose up --build
+# Development — backend only with hot-reload
+docker compose --profile dev up api-dev
+# then separately:
+cd frontend && npm run dev
+```
+### Environment Variables
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `LOG_LEVEL` | `INFO` | Logging verbosity (`DEBUG` / `INFO` / `WARNING` / `ERROR`) |
+| `WORKERS` | `1` | Uvicorn worker count |
+---
+## Hugging Face Spaces
+### Setup
+1. Create a new Space on Hugging Face (SDK: Docker)
+2. Push the repository to the Space
+3. The Dockerfile exposes port 7860 (HF Spaces default)
+### Dockerfile Requirements
+- Must expose port **7860**
+- Must respond to health checks at `/health`
+- Keep image size manageable (use CPU-only PyTorch/TF)
+### Files to include
+```
+Dockerfile
+requirements.txt
+api/
+src/
+frontend/               # Vite builds during Docker image creation
+cleaned_dataset/
+artifacts/v1/            # v1 model checkpoints (legacy)
+artifacts/v2/            # v2 model checkpoints (recommended)
+artifacts/models/        # Root-level models (backward compat)
+```
+### HuggingFace Space URL
+```
+https://huggingface.co/spaces/NeerajCodz/aiBatteryLifeCycle
+```
+### Space configuration (README.md header)
+```yaml
+---
+title: AI Battery Lifecycle Predictor
+emoji: 🔋
+colorFrom: green
+colorTo: blue
+sdk: docker
+pinned: false
+app_port: 7860
+---
+```
+---
+## Production Considerations
+### Performance
+- Use `--workers N` for multi-core deployment
+- Enable GPU passthrough for deep model inference: `docker run --gpus all`
+- Consider preloading all models (not lazy loading)
+### Security
+- Set `CORS_ORIGINS` to specific domains in production
+- Add authentication middleware if needed
+- Use HTTPS reverse proxy (nginx, Caddy)
+### Monitoring
+- Health endpoint: `/health`
+- Logs: JSON-per-line rotating log at `artifacts/logs/battery_lifecycle.log` (10 MB × 5 backups)
+  — set `LOG_LEVEL=DEBUG` for verbose output, mount volume to persist across container restarts
+- Metrics: Add Prometheus endpoint if needed

docs/frontend.md ADDED Viewed

	@@ -0,0 +1,219 @@

+# Frontend Documentation
+## Technology Stack
+| Technology | Version | Purpose |
+|-----------|---------|----------|
+| Vite | 7.x | Build tool & dev server |
+| React | 19.x | UI framework |
+| TypeScript | 5.9.x | Type safety |
+| Recharts | 3.7.x | Interactive 2D charts (BarChart, LineChart, AreaChart, RadarChart, ScatterChart, PieChart) |
+| lucide-react | 0.575.x | Icon system — **no emojis in UI** |
+| TailwindCSS | 4.x | Utility-first CSS |
+| Axios | 1.x | HTTP client |
+## Project Structure
+```
+frontend/
+├── index.html
+├── vite.config.ts          # Vite + /api proxy
+├── tsconfig.json
+├── package.json
+└── src/
+    ├── main.tsx
+    ├── App.tsx             # Root, tab navigation, v1/v2 selector
+    ├── api.ts              # All API calls + TypeScript types
+    ├── index.css
+    └── components/
+        ├── Dashboard.tsx           # Fleet overview heatmap + capacity charts
+        ├── PredictionForm.tsx      # Single-cycle SOH prediction + gauge
+        ├── SimulationPanel.tsx     # Multi-battery lifecycle simulation
+        ├── MetricsPanel.tsx        # Full model metrics dashboard
+        ├── GraphPanel.tsx          # Analytics — fleet, single battery, compare, temperature
+        └── RecommendationPanel.tsx # Operating condition optimizer with charts
+```
+## Tab Order
+| Tab | Component | Description |
+|-----|-----------|-------------|
+| Simulation | `SimulationPanel` | ML-backed multi-battery lifecycle forecasting |
+| Predict | `PredictionForm` | Single-cycle SOH + RUL prediction |
+| Metrics | `MetricsPanel` | Full model evaluation dashboard |
+| Analytics | `GraphPanel` | Fleet & per-battery interactive analytics |
+| Recommendations | `RecommendationPanel` | Operating condition optimizer |
+| Research Paper | — | Embedded research PDF |
+## Components
+### MetricsPanel
+Full interactive model evaluation dashboard with 6 switchable sections:
+- **Overview** — KPI stat cards with lucide icons, R² ranking bar chart, model family pie chart, normalized radar chart (top 5), R² vs MAE scatter trade-off plot, Top-3 rankings podium
+- **Models** — Interactive sort (R²/MAE/RMSE/MAPE, asc/desc), family filter dropdown, chart-type toggle (bar/radar/scatter), multi-select compare mode, colour-coded metric badges, full metrics table with per-row highlighting
+- **Validation** — Within-5% / within-10% grouped bar chart, full validation table with pass/fail badges
+- **Deep Learning** — LSTM/ensemble/VAE-LSTM/DG-iTransformer results with charts and metric tables
+- **Dataset** — Battery stats cards, engineered features list, temperature groups, degradation distribution bar chart, SOH range gauge
+- **Figures** — Searchable grid of all artifact figures with modal lightbox on click
+**Key features:**
+- All icons via `lucide-react` (no emojis)
+- `filteredModels` useMemo respects active sort/filter state
+- `MetricBadge` component colour-codes values green/yellow/red based on model quality thresholds
+- `SectionBadge` nav bar with icon + label
+### GraphPanel (Analytics)
+Four-section analytics dashboard:
+- **Fleet Overview** — SOH bar chart sorted by health (colour-coded green/yellow/red), SOH vs cycles bubble scatter (bubble = temperature), fleet status KPI cards (healthy/degraded/near-EOL), filter controls (min SOH slider, temp range), clickable battery roster table
+- **Single Battery** — SOH trajectory + linear RUL projection overlay, capacity fade area chart, smoothed degradation rate area chart, show/hide EOL reference line toggle
+- **Compare** — Multi-select up to 5 batteries; SOH overlay line chart with distinct colours per battery, capacity fade overlay, summary comparison table (final SOH, cycles, min capacity)
+- **Temperature Analysis** — Temperature vs SOH scatter, temperature distribution histogram
+**Key features:**
+- Multi-battery data loaded in parallel using `Promise.all`
+- RUL projection via least-squares on last 20 cycles → extrapolated to 70% SOH
+- `SohBadge` component with dynamic colour + icon
+### RecommendationPanel
+Interactive optimizer replacing the previous plain form + table:
+- **Input form** — Text input for battery ID, range sliders for SOH and ambient temperature, numeric inputs for cycle and top-k
+- **Summary cards** — Battery ID, best predicted RUL, best improvement, config count
+- **Visual Analysis tabs:**
+  - *RUL Comparison* — bar chart comparing predicted RUL across all recommendations
+  - *Parameters* — grouped bar chart showing temp/current/cutoff per rank
+  - *Radar* — normalized multi-metric radar chart for top-3 configs
+- **Recommendations table** — rank icons (Trophy/Award/Medal from lucide), colour-coded improvement badges, expandable rows showing per-recommendation explanation and parameter details
+### SimulationPanel
+- Configure up to N battery simulations with individual parameters (temp, voltage, current, EOL threshold)
+- Select ML model for lifecycle prediction (or pure physics fallback)
+- Animated SOH trajectory charts, final stats table, degradation state timeline
+### Dashboard
+- Fleet battery grid with SOH colour coding
+- Capacity fade line chart per selected battery
+- Model metrics bar chart
+### PredictionForm
+- 12-input form with all engineered cycle features
+- SOH gauge visualization (SVG ring) with degradation state colour
+- Confidence interval display
+- Model selector (v2 models / best_ensemble)
+## API Integration (`api.ts`)
+All API calls return typed TypeScript response objects:
+| Function | Endpoint | Description |
+|----------|----------|--------------|
+| `fetchDashboard()` | `GET /api/dashboard` | Fleet overview + capacity fade data |
+| `fetchBatteries()` | `GET /api/batteries` | All battery metadata |
+| `fetchBatteryCapacity(id)` | `GET /api/battery/{id}/capacity` | Per-battery cycles, capacity, SOH arrays |
+| `predictSoh(req)` | `POST /api/v2/predict` | Single-cycle SOH + RUL prediction |
+| `fetchRecommendations(req)` | `POST /api/v2/recommend` | Operating condition optimization |
+| `simulateBatteries(req)` | `POST /api/v2/simulate` | Multi-battery lifecycle simulation |
+| `fetchMetrics()` | `GET /api/metrics` | Full model evaluation metrics |
+| `fetchModels()` | `GET /api/v2/models` | All loaded models with metadata |
+## Development
+```bash
+cd frontend
+npm install
+npm run dev        # http://localhost:5173
+```
+API requests proxy to `http://localhost:7860` in dev mode (see `vite.config.ts`).
+## Build
+```bash
+npm run build      # outputs to dist/
+```
+The built `dist/` folder is served as static files by FastAPI at the root path.
+| Vite | 6.x | Build tool & dev server |
+| React | 18.x | UI framework |
+| TypeScript | 5.x | Type safety |
+| Three.js | latest | 3D rendering |
+| @react-three/fiber | latest | React renderer for Three.js |
+| @react-three/drei | latest | Three.js helpers |
+| Recharts | latest | 2D charts |
+| TailwindCSS | 4.x | Utility-first CSS |
+| Axios | latest | HTTP client |
+## Project Structure
+```
+frontend/
+├── index.html              # Entry HTML
+├── vite.config.ts          # Vite + proxy config
+├── tsconfig.json           # TypeScript config
+├── package.json
+├── public/
+│   └── vite.svg            # Favicon
+└── src/
+    ├── main.tsx            # React entry point
+    ├── App.tsx             # Root component + tab navigation
+    ├── api.ts              # API client + types
+    ├── index.css           # TailwindCSS import
+    └── components/
+        ├── Dashboard.tsx       # Fleet overview + charts
+        ├── PredictionForm.tsx  # SOH prediction form + gauge
+        ├── BatteryViz3D.tsx    # 3D battery pack heatmap
+        ├── GraphPanel.tsx      # Analytics / per-battery graphs
+        └── RecommendationPanel.tsx  # Operating condition optimizer
+```
+## Components
+### Dashboard
+- Stat cards (battery count, models, best R²)
+- Battery fleet grid with SOH color coding
+- SOH capacity fade line chart (Recharts)
+- Model R² comparison bar chart
+### PredictionForm
+- 12-input form with all cycle features
+- SOH gauge visualization (SVG ring)
+- Result display with degradation state coloring
+- Confidence interval display
+### BatteryViz3D
+- 3D battery pack with cylindrical cells
+- SOH-based fill level and color mapping
+- Click-to-inspect with side panel details
+- Auto-rotation with orbit controls
+- Health legend and battery list sidebar
+### GraphPanel
+- Battery selector dropdown
+- Per-battery SOH trajectory (line chart)
+- Per-battery capacity fade curve
+- Fleet scatter plot (SOH vs cycles, bubble size = temperature)
+### RecommendationPanel
+- Input: battery ID, current cycle, SOH, temperature, top_k
+- Table of ranked recommendations
+- Shows temperature, current, cutoff voltage, predicted RUL, improvement %
+## Development
+```bash
+cd frontend
+npm install
+npm run dev        # http://localhost:5173
+```
+API requests are proxied to `http://localhost:7860` during development.
+## Production Build
+```bash
+npm run build      # Outputs to dist/
+```
+The built files are served by FastAPI as static assets.