Spaces:

shadowsilence
/

MediSim

Sleeping

App Files Files Community

shadowsilence commited on 22 days ago

Commit

b17407a

verified ·

1 Parent(s): 5454cc5

Fix README config and remove merge conflict markers

Browse files

Files changed (1) hide show

README.md +54 -163

README.md CHANGED Viewed

@@ -1,4 +1,3 @@
-<<<<<<< HEAD
 ---
 title: MediSim
 emoji: "🩺"
@@ -11,192 +10,84 @@ pinned: false
 # MediSim: Multimodal Diagnostic and Agentic Triage System
-MediSim is an AI-powered medical assistant web application designed to safely process health inputs. It serves as our core NLP research project, targeting the reduction of clinical hallucination in generative healthcare applications using hybrid learning pipelines and multi-agent orchestration.
-=======
-# MediSim: Multimodal Diagnostic and Agentic Triage System
-**MediSim** is an AI-powered medical assistant web application designed to safely process complex health inputs. It serves as our core NLP research project, specifically targeting the reduction of clinical hallucination in generative healthcare applications using hybrid learning pipelines.
->>>>>>> origin/main
 ## Core Features
-MediSim offers two distinct standalone features addressing different triage and diagnostic modalities.
-<<<<<<< HEAD
 ### 1. Multimodal Diagnostic Assistant
-- **Purpose**: Provides preliminary diagnostic assessments by combining image data and clinical context.
-- **Input**: Medical scans (e.g., Chest X-ray) + Symptom descriptions.
-- **Architecture**: A vision-language fusion approach.
-  - **Vision**: ResNet-18 Image Encoder.
-  - **Text**: biLSTM Text Encoder.
-  - **Fusion**: Late-fusion layer with softmax classification.
-- **Advantage**: Higher reliability and lower compute requirements than standard large multimodal models in specialized domains.
-### 2. Agentic Triage & Consultation
-- **Purpose**: Interactively gathers patient symptoms and provides verified clinical guidance.
-- **Processing**: A three-agent coordination loop:
-  - **Triage Nurse**: Empathetic intake and symptom gathering.
-  - **Specialist Doctor**: Constructing differential hypotheses and clinical steps.
-  - **Fact-Checker**: Cross-verifying responses against clinical safety guidelines to prevent hallucinations.
-- **Advantage**: Drastically mitigates clinical AI hallucination through collaborative verification.
-## Project Architecture
-The project has transitioned to a professional distributed architecture:
-- **Frontend**: React (TypeScript) + Vite with a Premium Glassmorphism UI.
-- **Backend**: FastAPI (Python) serving our diagnostic models and agent orchestration.
-- **Database/Auth**: Firebase (Auth & Firestore) for secure Google sign-in and persistent user history.
-### Directory Structure
-```
 MediSim/
-├── web_app_pro/           # Professional Web Application Suite
-│   ├── frontend/          # React + Vite + Tailwind (Glassmorphism UI)
-│   └── backend/           # FastAPI + PyTorch + LangChain
-├── data/                  # Trained model weights and vocabulary
-├── notebooks/             # Training pipelines (ResNet18-biLSTM)
-├── reports/               # ACL-formatted project reports
-└── README.md              # Project documentation
 ```
-## Setup and Installation
-### Backend (FastAPI)
-1. Navigate to the backend directory:
-   ```bash
-   cd web_app_pro/backend
-   ```
-2. Install dependencies:
-   ```bash
-   pip install -r requirements.txt
-   ```
-3. Run the development server:
-   ```bash
-   python main.py
-   ```
-### Frontend (React)
-1. Navigate to the frontend directory:
-   ```bash
-   cd web_app_pro/frontend
-   ```
-2. Install dependencies:
-   ```bash
-   npm install
-   ```
-3. Run the development server:
-   ```bash
-   npm run dev
-   ```
-## Deployment
-The project includes a Dockerfile for easy deployment to platforms like Hugging Face Spaces. It serves the React application via FastAPI static mounting.
-## Team Members
-=======
-### Feature 1: Multimodal Diagnostic Assistant
-- **Purpose**: To provide preliminary diagnostic assessments by combining image data and clinical test inputs.
-- **Input**: A patient medical scan (e.g., Chest X-ray) accompanied by their symptom descriptions.
-- **Processing**: A deterministic vision-language fusion approach.
-  - Images are processed using a Convolutional Neural Network (CNN).
-  - Textual symptoms are processed using a Bidirectional LSTM (biLSTM).
-  - Features are aligned via a multimodal fusion layer to output structured diagnoses.
-- **Advantage**: Bypasses the high compute requirements of monolithic Large Multimodal Models (LMMs) and provides distinct interpretability limits.
-### Feature 2: Multi-Agent Triage & Consultation
-- **Purpose**: To interactively gather patient symptom data and propose verified clinical next steps.
-- **Processing**: A highly structured interactions loop involving three distinct Large Language Model (LLM) agents powered locally or via fast-inference APIs.
-  - **Triage Nurse Agent**: Engages patients to gather unstructured symptom descriptions and medical histories.
-  - **Specialist Doctor Agent**: Constructs possible differential hypotheses and clinical steps.
-  - **Medical Fact-Checker Agent**: Evaluates the specialist's outputs against clinical safety guidelines to actively block generative hallucination or unsafe recommendations.
-## Project Architecture (Phase 2 Focus)
-During **Phase 2**, we established the core hypotheses of our system:
-1. Multimodal baseline fusions can compete effectively with heavy LMMs in constrained environments.
-2. A Multi-Agent debate structure drastically mitigates clinical AI hallucination compared to standard single-prompt systems.
-### Directory Structure
 ```
-MediSim/
-├── data/                  # Standardized clinical datasets (e.g., IU X-Ray extracts)
-├── notebooks/             # Jupyter notebooks containing baseline training pipelines
-├── reports/
-│   └── Phase2/            # ACL-formatted PDF Proposal, presentation deck, and LaTeX sources
-├── web_app/               # The upcoming user-facing interface (Streamlit application)
-└── README.md              # This file
 ```
-<<<<<<< HEAD
-## Setup and Installation
-*Note: MediSim is currently in active development. Complete integration targets Phase 3.*
-=======
-## Project Status
-- **Phase 1: Literature Review** - [Complete] (Summarized in report)
-- **Phase 2: Project Proposal** - [Finalized] (See `reports/Phase2/` for report and presentation)
-- **Phase 3: Implementation** - [In Project] (Baseline data processing in `notebooks/`)
-- **Phase 4: Final Evaluation** - [Pending Phase 3]
-## Setup and Installation
-*Note: MediSim is currently in the proposal-to-implementation transition phase.*
->>>>>>> e2fd362 (Finalize Phase 2: Refined report, resized Figure 1, updated tables, and synced deliverables)
-**Requirements:**
-- Python 3.10+
-- PyTorch (for the Multimodal CNN/biLSTM baselines)
-- LangChain / LlamaIndex (for Multi-Agent orchestration)
-- Streamlit (for the Web Interface)
-1. **Clone the repository**
-   ```bash
-   git clone https://github.com/shadowsilence94/MediSim.git
-   cd MediSim
-   ```
-2. **Install Dependencies** (Placeholder for the final requirements file)
-   ```bash
-   pip install -r requirements.txt
-   ```
-3. **Running the Web App** (Scheduled for Phase 3)
-   ```bash
-   streamlit run web_app/app.py
-   ```
-<<<<<<< HEAD
-=======
-## Compiling the Phase 2 Report
-The Phase 2 report is written in LaTeX using the ACL template. To compile the raw source:
-1. Ensure you have a TeX distribution (e.g., TeX Live or MiKTeX) installed.
-2. Navigate to `reports/Phase2/source/`.
-3. Run the following sequence from your terminal:
-   ```bash
-   pdflatex report.tex
-   bibtex report
-   pdflatex report.tex
-   pdflatex report.tex
-   ```
-This will generate the final `report.pdf`.
->>>>>>> e2fd362 (Finalize Phase 2: Refined report, resized Figure 1, updated tables, and synced deliverables)
 ## Team Members
->>>>>>> origin/main
 - Htut Ko Ko (st126010)
 - Imtiaz Ahmad (st126685)
 - Michael R. Lacar (st126161)
 - Aashutosh Raut (st126438)
-<<<<<<< HEAD
 ## References
-Refer to reports/Phase2/report.pdf for the full methodology and literature review.
-=======
-## References & Readings
-The architectural choices for MediSim are modeled after state-of-the-art papers exclusively retrieved from the ACL Anthology, emphasizing safe conversation generation and lightweight clinical representation learning. Refer to `reports/Phase2/report.pdf` for the full methodology and literature review.
->>>>>>> origin/main

 ---
 title: MediSim
 emoji: "🩺"
 # MediSim: Multimodal Diagnostic and Agentic Triage System
+MediSim is an AI-powered medical assistant web application designed to safely process health inputs. It is developed as an NLP research project focused on reducing clinical hallucination in generative healthcare applications using hybrid learning pipelines and multi-agent orchestration.
 ## Core Features
 ### 1. Multimodal Diagnostic Assistant
+- Purpose: Provides preliminary diagnostic assessments by combining medical image data and symptom descriptions.
+- Input: Medical scans (for example, chest X-ray) plus symptom text.
+- Architecture:
+  - Vision encoder: ResNet-18.
+  - Text encoder: biLSTM.
+  - Fusion head: late-fusion classifier.
+- Advantage: Better reliability and lower compute demands than large generic multimodal models in this domain.
+### 2. Agentic Triage and Consultation
+- Purpose: Interactively gathers symptoms and provides verified clinical guidance.
+- Processing: Three-agent collaboration loop:
+  - Triage Nurse: empathic intake and symptom collection.
+  - Specialist Doctor: differential reasoning and next-step planning.
+  - Fact Checker: verifies outputs against safety constraints.
+- Advantage: Reduces hallucination risk through explicit multi-agent verification.
+## Architecture
+- Frontend: React + TypeScript + Vite.
+- Backend: FastAPI + PyTorch + LangChain orchestration.
+- Authentication and Storage: Firebase Auth + Firestore.
+- Deployment target: Hugging Face Space (Docker).
+## Directory Layout
+```text
 MediSim/
+|- web_app_pro/           # Production web application
+|  |- frontend/           # React + Vite app
+|  |- backend/            # FastAPI service and model logic
+|- web_app/               # Legacy app entrypoint used for HF runtime
+|- data/                  # Trained weights and supporting assets
+|- notebooks/             # Training and experimentation notebooks
+|- reports/               # Project reports and writeups
+|- scripts/               # Deployment and utility scripts
+`- README.md
 ```
+## Local Development
+### Backend
+```bash
+cd web_app_pro/backend
+pip install -r requirements.txt
+python main.py
 ```
+### Frontend
+```bash
+cd web_app_pro/frontend
+npm install
+npm run dev
 ```
+## Hugging Face Deployment Notes
+This repository includes:
+- A Docker-based Space configuration.
+- Space runtime entrypoint through `web_app/app.py`.
+- Environment-driven Firebase and backend configuration.
 ## Team Members
 - Htut Ko Ko (st126010)
 - Imtiaz Ahmad (st126685)
 - Michael R. Lacar (st126161)
 - Aashutosh Raut (st126438)
 ## References
+See project reports under `reports/` for methodology, literature review, and evaluation details.