Spaces:

twissamodi
/

MedQuery-Assist

Sleeping

App Files Files Community

MedQuery-Assist / README.md

twissamodi

switch to docker SDK with Python 3.12 to fix audioop issue

1a49a6a about 1 month ago

preview code

raw

history blame contribute delete

4.51 kB

	---
	title: MediQuery-Assist
	emoji: 🏥
	colorFrom: blue
	colorTo: green
	sdk: docker
	app_file: app.py
	pinned: false
	---

	# MediQuery-Assist - Medical Assistant Chatbot

	A conversational AI medical assistant that supports text, voice, and document-based interactions. Built with LangGraph, RAG, and Gradio.

	## Live at: https://huggingface.co/spaces/twissamodi/MedQuery-Assist

	## Features

	- Multi-modal Input: Text, voice (Whisper), and PDF document upload
	- Document Classification: Automatic classification of medical documents (lab reports, prescriptions, etc.)
	- RAG System: Store and retrieve patient medical records from PDF documents with page-level accuracy
	- Web Search: Access latest medical information via Google Serper API
	- Conversational Memory: Maintains context across conversation using LangGraph checkpointing
	- ReAct Framework: Step-by-step reasoning with tool usage
	- Auto-transcription: Voice messages automatically transcribed and sent

	## Architecture

	```
	├── rag_setup.py # Document processing and vector store
	├── document_classifier.py # Page-based document classification
	├── tools.py # Medical history search and web search tools
	├── graph_setup.py # LangGraph workflow configuration
	├── prompts.py # System prompts
	├── chat_handler.py # Chat logic and session management
	├── audio_handler.py # Audio transcription
	├── app.py # Gradio interface
	└── data/
	├── patient_record_db/ # Chroma vector store
	└── long_term_memory.db # SQLite conversation checkpoints
	```

	## Installation

	```bash
	pip install -r requirements.txt
	```

	## Environment Setup

	Create a `.env` file:

	```env
	HUGGINGFACEHUB_API_TOKEN=your_hf_token
	SERPER_API_KEY=your_serper_key
	```

	## Usage

	```bash
	python app.py
	```

	Access the interface at `http://127.0.0.1:7860`

	## How It Works

	### 1. Document Upload
	- Upload PDF medical records
	- Documents are chunked, embedded, and stored in Chroma vector database
	- Duplicate detection via file hashing

	### 2. Query Processing
	- User queries are processed through LangGraph workflow
	- LLM decides which tools to use (medical history search or web search)
	- Multi-step reasoning follows ReAct pattern

	### 3. Voice Input
	- Record audio via microphone
	- Automatic transcription using Whisper-small
	- Auto-send to chat after transcription

	### 4. Response Generation
	- DeepSeek-V3 model generates responses
	- Can make multiple tool calls per query
	- Maintains conversation context via SQLite checkpointing

	## Components

	### RAG_Setup
	- Embeddings: `sentence-transformers/all-mpnet-base-v2`
	- Vector Store: Chroma with persistence
	- Chunk size: 1000 characters
	- Similarity search returns top 5 results

	### GraphSetup
	- LLM: DeepSeek-V3 via HuggingFace Inference
	- Max tokens: 1024
	- Recursion limit: 25
	- Memory: SQLite checkpointing

	### Tools
	- `check_medical_history`: Searches patient records
	- `web_search`: Google Serper API for medical information

	### AudioHandler
	- Model: `openai/whisper-small`
	- Auto-send after transcription
	- Clears audio input after processing

	## Session Management

	- Each application instance generates a unique session ID
	- All users in the same instance share conversation history
	- Restart application to create new session

	## File Structure

	```
	data/
	├── patient_record_db/ # Vector embeddings
	│ └── chroma.sqlite3
	└── long_term_memory.db # Conversation checkpoints
	```

	## Limitations

	- Single global session (all users share history)
	- SQLite connection with `check_same_thread=False` (thread safety concern)
	- No user authentication
	- File uploads not validated beyond extension
	- No cleanup of uploaded temporary files

	## Example Queries

	Simple Query:
	```
	What medications am I taking?
	```

	Complex Query:
	```
	Can I take ibuprofen with my current medications?
	```

	Upload Flow:
	1. Upload PDF medical record
	2. System confirms upload success
	3. Ask questions about the uploaded document

	## Dependencies

	- langgraph
	- langchain-huggingface
	- langchain-community
	- langchain-chroma
	- gradio
	- transformers
	- sentence-transformers
	- pypdf
	- google-serper-api
	- python-dotenv

	## Notes

	- Requires active internet for HuggingFace Inference API
	- Requires Serper API key for web search
	- First run downloads embedding model (~400MB)
	- Whisper model downloads on first audio transcription (~500MB)