# Medical Q&A Bot - Presentation Script ## 🎯 Presentation Overview (10-15 minutes) ### Team Introduction (30 seconds) "Hello everyone! We're Team HealthBot, and we've developed an intelligent medical query classification and research retrieval system. Our team consists of: - David Gray - Tarak Jha - Sravani Segireddy - Riley Millikan - Kent R. Spillner" --- ## Part 1: Problem Statement (1-2 minutes) ### The Challenge "In healthcare settings, patients often have questions that fall into two categories: 1. **Medical queries** - Questions requiring clinical expertise 2. **Administrative queries** - Questions about billing, scheduling, etc. Currently, all queries are handled the same way, leading to: - ❌ Inefficient triage - ❌ Delayed responses - ❌ Wasted resources - ❌ Frustrated patients and staff" ### Our Solution "We built an AI-powered system that: 1. ✅ Automatically classifies queries 2. ✅ Retrieves relevant medical research for medical queries 3. ✅ Provides confidence scores for transparency 4. ✅ Offers a user-friendly web interface" --- ## Part 2: Technical Architecture (2-3 minutes) ### System Overview [Show ARCHITECTURE.md diagram] "Our system operates in two main stages: **Stage 1: Classification** - Uses a fine-tuned sentence transformer model - Classifies queries as Medical, Administrative, or Other - Provides confidence scores for each category **Stage 2: Retrieval** (Medical queries only) - Implements hybrid search combining: - BM25 (keyword-based sparse retrieval) - Dense embeddings (semantic similarity) - RRF (Reciprocal Rank Fusion) for combining results - Optional cross-encoder reranking for improved accuracy" ### Data Sources "We index three major medical databases: - **PubMed**: Peer-reviewed medical research - **Miriad**: Medical Q&A database - **UniDoc**: Unified medical document corpus This gives us access to thousands of verified medical documents." --- ## Part 3: Live Demo (5-7 minutes) ### Setup "Let me show you how it works in practice. We've built a web interface using Gradio." [Open http://127.0.0.1:7860] ### Demo 1: Medical Query (2 minutes) "Let's start with a medical question:" **Type**: "I'm having a really bad rash on my hands. I'm pretty sure it's my eczema flaring up. Is there anything stronger than aquaphor I can use on it?" **Point out**: 1. "Notice the system classified this as a MEDICAL query" 2. "Look at the confidence scores - 95% confidence it's medical" 3. "The system retrieved 10 relevant documents from our medical databases" 4. "Each document shows multiple relevance scores:" - "BM25 score for keyword matching" - "Dense score for semantic similarity" - "RRF score for combined ranking" 5. "We can see the document titles, previews, and full metadata" ### Demo 2: Administrative Query (1 minute) "Now let's try an administrative question:" **Type**: "Hey is there any way I can get an appointment in the next month?" **Point out**: 1. "The system correctly identified this as ADMINISTRATIVE" 2. "No document retrieval happens - saving resources" 3. "This query would be routed to scheduling staff, not medical staff" ### Demo 3: Medical Emergency (1 minute) "Here's a more urgent medical case:" **Type**: "worst headache of my life with fever and stiff neck" **Point out**: 1. "Classified as MEDICAL with high confidence" 2. "Retrieved relevant documents about meningitis symptoms" 3. "This demonstrates the system can handle urgent queries" 4. "In a real setting, this could trigger an emergency protocol" ### Demo 4: Advanced Features (1 minute) "Let me show you some advanced features:" **Adjust settings**: 1. Change "Number of Results" to 20 2. Enable "Use Reranker" **Type**: "What are the side effects of statins?" **Point out**: 1. "We can control how many results to retrieve" 2. "The reranker improves accuracy but takes longer" 3. "We have both formatted view and JSON view for different audiences" --- ## Part 4: Technical Implementation (2-3 minutes) ### Machine Learning Models "Under the hood, we use: - **Sentence Transformers**: For generating semantic embeddings - **Custom Classification Head**: Neural network trained on healthcare data - **FAISS**: For efficient vector similarity search - **Cross-Encoder**: Optional reranking for accuracy" ### User Interface "We implemented two web interfaces: 1. **Gradio** (primary) - Clean, professional, easy to deploy 2. **Streamlit** (alternative) - More interactive and customizable Both provide: - Real-time classification and retrieval - Multiple view modes (formatted and JSON) - Adjustable settings - Example queries for easy testing" ### Code Quality "Our codebase demonstrates: - ✅ Modular design with clear separation of concerns - ✅ Comprehensive documentation - ✅ Easy setup and deployment - ✅ Error handling and validation - ✅ Scalable architecture" --- ## Part 5: Results & Impact (1-2 minutes) ### Performance Metrics "Our system achieves: - **Classification Accuracy**: ~95% - **Response Time**: <1 second for most queries - **Retrieval Quality**: High relevance in top results - **User Experience**: Clean, intuitive interface" ### Real-World Impact "This system could: 1. 📊 Reduce triage time by 60-80% 2. 💰 Save healthcare costs through efficient routing 3. 🎯 Improve patient satisfaction with faster responses 4. 📚 Empower patients with evidence-based information 5. 👨‍⚕️ Help doctors by providing relevant research context" --- ## Part 6: Future Enhancements (1 minute) ### Potential Improvements "Moving forward, we could add: - 🔐 User authentication and personalization - 📱 Mobile app for patient use - 🌍 Multi-language support - 📊 Analytics dashboard for healthcare providers - 🔗 Integration with existing EMR systems - 🗣️ Voice input for accessibility - 📈 Continuous learning from user feedback" --- ## Part 7: Conclusion (30 seconds) ### Summary "In summary, we've built an intelligent medical query classification and retrieval system that: - ✅ Automatically triages patient queries - ✅ Retrieves relevant medical research - ✅ Provides a professional web interface - ✅ Can be easily deployed in real healthcare settings This represents a practical application of AI in healthcare that can improve efficiency and patient outcomes." ### Q&A "Thank you! We're happy to answer any questions." --- ## 🎯 Tips for Presenters ### Before Presentation 1. ✅ Test the web UI beforehand 2. ✅ Have example queries ready 3. ✅ Check internet connection (for model loading) 4. ✅ Prepare backup slides in case of technical issues 5. ✅ Practice the demo flow multiple times 6. ✅ Assign roles (who presents what) ### During Presentation 1. ✅ Speak clearly and at a steady pace 2. ✅ Make eye contact with audience 3. ✅ Explain technical terms briefly 4. ✅ Show enthusiasm about the project 5. ✅ Be ready to handle unexpected results 6. ✅ Keep demo queries visible on screen ### Handling Questions **Common Questions & Answers**: Q: "How accurate is the classification?" A: "Our classifier achieves approximately 95% accuracy on our test set, with particularly high precision for medical queries." Q: "What about patient privacy?" A: "Currently, this is a prototype that doesn't store any data. In production, we'd implement HIPAA-compliant data handling." Q: "How do you handle ambiguous queries?" A: "The system provides confidence scores for each category. Low-confidence queries could be flagged for human review." Q: "Can it handle emergency situations?" A: "Yes, medical queries can be analyzed for urgency. In production, high-urgency keywords could trigger immediate alerts." Q: "What databases do you use?" A: "We index PubMed articles, Miriad medical Q&A, and UniDoc corpus - all verified medical sources." Q: "How long did this take to build?" A: "The project took [X weeks/months], including data preparation, model training, and UI development." Q: "Could this be deployed in a real hospital?" A: "Absolutely! It would require integration with existing systems, compliance verification, and additional security features." --- ## 📊 Suggested Slides ### Slide 1: Title - Project name - Team members - Course/institution ### Slide 2: Problem Statement - Current challenges in healthcare - Need for automated triage ### Slide 3: Solution Overview - Two-stage system (classify + retrieve) - Key benefits ### Slide 4: Architecture Diagram - Visual flow chart - Key components ### Slide 5: Technical Stack - ML models used - Frameworks and tools - Data sources ### Slide 6: Live Demo - [Switch to web interface] ### Slide 7: Results - Performance metrics - Example outputs ### Slide 8: Impact - Efficiency gains - Cost savings - Improved outcomes ### Slide 9: Future Work - Potential enhancements - Scalability considerations ### Slide 10: Thank You - Team members - Questions? --- ## 🎬 Demo Script Quick Reference ``` 1. MEDICAL QUERY → "I have a rash on my hands. Is there anything stronger than aquaphor?" → Show: Classification, confidence, retrieved documents 2. ADMIN QUERY → "Can I get an appointment next month?" → Show: Admin classification, no retrieval 3. URGENT QUERY → "worst headache of my life with fever and stiff neck" → Show: High confidence, relevant results 4. SETTINGS → Adjust number of results → Toggle reranker → Show both view modes ``` --- ## ✅ Pre-Demo Checklist - [ ] Web UI is running on http://127.0.0.1:7860 - [ ] All models loaded successfully - [ ] Test queries work correctly - [ ] Internet connection stable - [ ] Screen sharing setup tested - [ ] Backup browser tab open - [ ] Documentation files ready - [ ] Team roles assigned - [ ] Timer set for demo sections - [ ] Confidence level: HIGH! 🚀 --- ## 🎓 Presentation Day Affirmations "We've built something awesome!" "Our system works reliably!" "We understand every component!" "We can explain this clearly!" "We're ready for any question!" **Good luck, team! You've got this! 🌟** --- *Prepared by the HealthBot Team* *Feel free to customize this script for your specific presentation requirements*