File size: 12,282 Bytes
6dc9d46
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
# CLI Chatbot User Guide
## Interactive Chat Interface for MediGuard AI RAG-Helper

**Date:** November 23, 2025  
**Status:** โœ… FULLY IMPLEMENTED AND OPERATIONAL

---

## ๐ŸŽฏ Quick Start

### Run the Chatbot
```powershell
python scripts/chat.py
```

### First Time Setup
Make sure you have:
1. โœ… Ollama running: `ollama serve`
2. โœ… Models pulled:
   ```powershell
   ollama pull llama3.1:8b-instruct
   ollama pull qwen2:7b
   ```
3. โœ… Vector store created: `python src/pdf_processor.py` (if not already done)

---

## ๐Ÿ’ฌ How to Use

### Example Conversations

#### **Example 1: Basic Biomarker Input**
```
You: My glucose is 185 and HbA1c is 8.2

๐Ÿ” Analyzing your input...
โœ… Found 2 biomarkers: Glucose, HbA1c
๐Ÿง  Predicting likely condition...
โœ… Predicted: Diabetes (85% confidence)
๐Ÿ“š Consulting medical knowledge base...
   (This may take 15-25 seconds...)

๐Ÿค– RAG-BOT:
======================================================================
Hi there! ๐Ÿ‘‹
Based on your biomarkers, I analyzed your results.

๐Ÿ”ด **Primary Finding:** Diabetes
   Confidence: 85%

โš ๏ธ **IMPORTANT SAFETY ALERTS:**
   โ€ข Glucose: CRITICAL: Glucose is 185.0 mg/dL, above critical threshold
     โ†’ SEEK IMMEDIATE MEDICAL ATTENTION

[... full analysis ...]
```

#### **Example 2: Multiple Biomarkers**
```
You: hemoglobin 10.5, RBC 3.8, MCV 78, platelets 180000

โœ… Found 4 biomarkers: Hemoglobin, RBC, MCV, Platelets
๐Ÿง  Predicting likely condition...
โœ… Predicted: Anemia (72% confidence)
```

#### **Example 3: With Patient Context**
```
You: I'm a 52 year old male, glucose 185, cholesterol 235, HDL 38

โœ… Found 3 biomarkers: Glucose, Cholesterol, HDL
โœ… Patient context: age=52, gender=male
```

---

## ๐Ÿ“‹ Available Commands

### `help` - Show Biomarker List
Displays all 24 supported biomarkers organized by category.

```
You: help

๐Ÿ“‹ Supported Biomarkers (24 total):

๐Ÿฉธ Blood Cells:
  โ€ข Hemoglobin, Platelets, WBC, RBC, Hematocrit, MCV, MCH, MCHC

๐Ÿ”ฌ Metabolic:
  โ€ข Glucose, Cholesterol, Triglycerides, HbA1c, LDL, HDL, Insulin, BMI

โค๏ธ Cardiovascular:
  โ€ข Heart Rate, Systolic BP, Diastolic BP, Troponin, C-reactive Protein

๐Ÿฅ Organ Function:
  โ€ข ALT, AST, Creatinine
```

### `example` - Run Demo Case
Runs a complete example of a Type 2 Diabetes patient with 11 biomarkers.

```
You: example

๐Ÿ“‹ Running Example: Type 2 Diabetes Patient
   52-year-old male with elevated glucose and HbA1c

๐Ÿ”„ Running analysis...
[... full RAG workflow execution ...]
```

### `quit` - Exit Chatbot
Exits the interactive session gracefully.

```
You: quit

๐Ÿ‘‹ Thank you for using MediGuard AI. Stay healthy!
```

---

## ๐Ÿฉบ Supported Biomarkers (24 Total)

### Blood Cells (8)
| Biomarker | Aliases | Example Input |
|-----------|---------|---------------|
| **Hemoglobin** | HGB, HB | "hemoglobin 13.5" |
| **Platelets** | PLT | "platelets 220000" |
| **WBC** | White Blood Cells | "WBC 7500" |
| **RBC** | Red Blood Cells | "RBC 4.8" |
| **Hematocrit** | HCT | "hematocrit 42" |
| **MCV** | Mean Corpuscular Volume | "MCV 85" |
| **MCH** | Mean Corpuscular Hemoglobin | "MCH 29" |
| **MCHC** | - | "MCHC 34" |

### Metabolic (8)
| Biomarker | Aliases | Example Input |
|-----------|---------|---------------|
| **Glucose** | Blood Sugar | "glucose 140" |
| **Cholesterol** | Total Cholesterol | "cholesterol 220" |
| **Triglycerides** | Trig | "triglycerides 180" |
| **HbA1c** | A1C, Hemoglobin A1c | "HbA1c 7.5" |
| **LDL** | LDL Cholesterol | "LDL 160" |
| **HDL** | HDL Cholesterol | "HDL 45" |
| **Insulin** | - | "insulin 18" |
| **BMI** | Body Mass Index | "BMI 28.5" |

### Cardiovascular (5)
| Biomarker | Aliases | Example Input |
|-----------|---------|---------------|
| **Heart Rate** | HR, Pulse | "heart rate 85" |
| **Systolic BP** | Systolic, SBP | "systolic 145" |
| **Diastolic BP** | Diastolic, DBP | "diastolic 92" |
| **Troponin** | - | "troponin 0.05" |
| **C-reactive Protein** | CRP | "CRP 8.5" |

### Organ Function (3)
| Biomarker | Aliases | Example Input |
|-----------|---------|---------------|
| **ALT** | Alanine Aminotransferase | "ALT 45" |
| **AST** | Aspartate Aminotransferase | "AST 38" |
| **Creatinine** | - | "creatinine 1.1" |

---

## ๐ŸŽจ Input Formats Supported

The chatbot accepts natural language input in various formats:

### Format 1: Conversational
```
My glucose is 140 and my HbA1c is 7.5
```

### Format 2: List Style
```
Hemoglobin 11.2, platelets 180000, cholesterol 235
```

### Format 3: Structured
```
glucose=185, HbA1c=8.2, HDL=38, triglycerides=210
```

### Format 4: With Context
```
I'm 52 years old male, glucose 185, cholesterol 235
```

### Format 5: Mixed
```
Blood test results: glucose is 140, my HbA1c came back at 7.5%, and cholesterol is 220
```

---

## ๐Ÿ” How It Works

### 1. Biomarker Extraction (LLM)
- Uses `llama3.1:8b-instruct` to extract biomarkers from natural language
- Normalizes biomarker names (e.g., "A1C" โ†’ "HbA1c")
- Extracts patient context (age, gender, BMI)

### 2. Disease Prediction (LLM)
- Uses `qwen2:7b` to predict disease based on biomarker patterns
- Returns: disease name, confidence score, probability distribution
- Fallback: Rule-based prediction if LLM fails

### 3. RAG Workflow Execution
- Runs complete 6-agent workflow:
  1. Biomarker Analyzer
  2. Disease Explainer (RAG)
  3. Biomarker-Disease Linker (RAG)
  4. Clinical Guidelines (RAG)
  5. Confidence Assessor
  6. Response Synthesizer

### 4. Conversational Formatting
- Converts technical JSON โ†’ friendly text
- Emoji indicators
- Safety alerts highlighted
- Clear structure with sections

---

## ๐Ÿ’พ Saving Reports

After each analysis, you'll be asked:

```
๐Ÿ’พ Save detailed report to file? (y/n):
```

If you choose **`y`**:
- Report saved to: `data/chat_reports/report_Diabetes_YYYYMMDD_HHMMSS.json`
- Contains: Input biomarkers + Complete analysis JSON
- Can be reviewed later or shared with healthcare providers

---

## โš ๏ธ Important Notes

### Minimum Requirements
- **At least 2 biomarkers** needed for analysis
- More biomarkers = more accurate predictions

### System Requirements
- **RAM:** 2GB minimum (2.5-3GB recommended)
- **Ollama:** Must be running (`ollama serve`)
- **Models:** llama3.1:8b-instruct, qwen2:7b

### Limitations
1. **Not a Medical Device** - For educational/informational purposes only
2. **No Real ML Model** - Uses LLM-based prediction (not trained ML model)
3. **Standard Units Only** - Enter values in standard medical units
4. **Manual Entry** - Must type biomarkers (no PDF upload yet)

---

## ๐Ÿ› Troubleshooting

### Issue 1: "Failed to initialize system"
**Cause:** Ollama not running or models not available

**Solution:**
```powershell
# Start Ollama
ollama serve

# Pull required models
ollama pull llama3.1:8b-instruct
ollama pull qwen2:7b
```

### Issue 2: "I couldn't find any biomarker values"
**Cause:** LLM couldn't extract biomarkers from input

**Solution:**
- Use clearer format: "glucose 140, HbA1c 7.5"
- Type `help` to see biomarker names
- Check spelling

### Issue 3: "Analysis failed: Ollama call failed"
**Cause:** Insufficient system memory or Ollama timeout

**Solution:**
- Close other applications
- Restart Ollama
- Try again with fewer biomarkers

### Issue 4: Unicode/Emoji Display Issues
**Solution:** Already handled! Script automatically sets UTF-8 encoding.

---

## ๐Ÿ“Š Example Output Structure

```
Hi there! ๐Ÿ‘‹
Based on your biomarkers, I analyzed your results.

๐Ÿ”ด **Primary Finding:** Diabetes
   Confidence: 87%

โš ๏ธ **IMPORTANT SAFETY ALERTS:**
   โ€ข Glucose: CRITICAL: Glucose is 185.0 mg/dL
     โ†’ SEEK IMMEDIATE MEDICAL ATTENTION

๐Ÿ” **Why this prediction?**
   โ€ข **Glucose** (185.0 mg/dL): Significantly elevated...
   โ€ข **HbA1c** (8.2%): Poor glycemic control...

โœ… **What You Should Do:**
   1. Consult healthcare provider immediately
   2. Bring lab results to appointment

๐ŸŒฑ **Lifestyle Recommendations:**
   1. Follow balanced diet
   2. Regular physical activity
   3. Monitor blood sugar

โ„น๏ธ **Important:** This is AI-assisted analysis, NOT medical advice.
   Please consult a healthcare professional.
```

---

## ๐Ÿš€ Performance

| Metric | Typical Value |
|--------|---------------|
| **Biomarker Extraction** | 3-5 seconds |
| **Disease Prediction** | 2-3 seconds |
| **RAG Workflow** | 15-25 seconds |
| **Total Time** | ~20-30 seconds |

---

## ๐Ÿ”ฎ Future Features (Planned)

### Phase 2 Enhancements
- [ ] **Multi-turn conversations** - Answer follow-up questions
- [ ] **PDF lab report upload** - Extract from scanned reports
- [ ] **Unit conversion** - Support mg/dL โ†” mmol/L
- [ ] **Trend tracking** - Compare with previous results
- [ ] **Voice input** - Speak biomarkers instead of typing

### Phase 3 Enhancements
- [ ] **Web interface** - Browser-based chat
- [ ] **Real ML model integration** - Professional disease prediction
- [ ] **Multi-language support** - Spanish, Chinese, etc.

---

## ๐Ÿ“š Technical Details

### Architecture
```
User Input (Natural Language)
    โ†“
extract_biomarkers() [llama3.1:8b]
    โ†“
predict_disease_llm() [qwen2:7b]
    โ†“
create_guild().run() [6 agents, RAG, LangGraph]
    โ†“
format_conversational()
    โ†“
Conversational Output
```

### Files
- **Main Script:** `scripts/chat.py` (~620 lines)
- **Config:** `config/biomarker_references.json`
- **Reports:** `data/chat_reports/*.json`

### Dependencies
- `langchain_community` - LLM interfaces
- `langchain_core` - Prompts
- Existing `src/` modules - Core RAG system

---

## โœ… Validation

### Tested Scenarios
โœ… Diabetes patient (glucose, HbA1c elevated)  
โœ… Anemia patient (low hemoglobin, MCV)  
โœ… Heart disease indicators (cholesterol, troponin)  
โœ… Minimal input (2 biomarkers)  
โœ… Invalid input handling  
โœ… Help command  
โœ… Example command  
โœ… Report saving  
โœ… Graceful exit  

---

## ๐ŸŽ“ Best Practices

### For Accurate Results
1. **Provide at least 3-5 biomarkers** for reliable analysis
2. **Include key indicators** for the condition you suspect
3. **Mention patient context** (age, gender) when relevant
4. **Use standard medical units** (mg/dL for glucose, % for HbA1c)

### Safety
1. **Always consult a doctor** - This is NOT medical advice
2. **Don't delay emergency care** - Critical alerts require immediate attention
3. **Share reports with healthcare providers** - Save and bring JSON reports

---

## ๐Ÿ“ž Support

### Questions?
- Review documentation: `docs/CLI_CHATBOT_IMPLEMENTATION_PLAN.md`
- Check system verification: `docs/SYSTEM_VERIFICATION.md`
- See project overview: `README.md`

### Issues?
- Check Ollama is running: `ollama list`
- Verify models are available
- Review error messages carefully

---

## ๐Ÿ“ Example Session

```
PS> python scripts/chat.py

======================================================================
๐Ÿค– MediGuard AI RAG-Helper - Interactive Chat
======================================================================

Welcome! I can help you understand your blood test results.

You can:
  1. Describe your biomarkers (e.g., 'My glucose is 140, HbA1c is 7.5')
  2. Type 'example' to see a sample diabetes case
  3. Type 'help' for biomarker list
  4. Type 'quit' to exit

======================================================================

๐Ÿ”ง Initializing medical knowledge system...
โœ… System ready!

You: my glucose is 185 and HbA1c is 8.2

๐Ÿ” Analyzing your input...
โœ… Found 2 biomarkers: Glucose, HbA1c
๐Ÿง  Predicting likely condition...
โœ… Predicted: Diabetes (85% confidence)
๐Ÿ“š Consulting medical knowledge base...
   (This may take 15-25 seconds...)

๐Ÿค– RAG-BOT:
======================================================================
[... full conversational response ...]
======================================================================

๐Ÿ’พ Save detailed report to file? (y/n): y
โœ… Report saved to: data/chat_reports/report_Diabetes_20251123_071530.json

You can:
  โ€ข Enter more biomarkers for a new analysis
  โ€ข Type 'quit' to exit

You: quit

๐Ÿ‘‹ Thank you for using MediGuard AI. Stay healthy!
```

---

**Status:** โœ… FULLY OPERATIONAL  
**Version:** 1.0  
**Last Updated:** November 23, 2025

*MediGuard AI RAG-Helper - Making medical insights accessible through conversation* ๐Ÿฅ๐Ÿ’ฌ