Spaces:

T0X1N
/

Agentic-RagBot

Sleeping

App Files Files Community

Agentic-RagBot / docs /archive /NAVIGATION_GUIDE.md

Nikhil Pravin Pise

docs: update all documentation to reflect current codebase state

aefac4f 19 days ago

preview code

raw

history blame contribute delete

32.9 kB

╔════════════════════════════════════════════════════════════════════════════╗ ║ 🗺️ NAVIGATION GUIDE: Using the RagBot 4-Month Roadmap ║ ║ How to use all the planning documents effectively ║ ╚════════════════════════════════════════════════════════════════════════════╝

You now have 5 comprehensive documents to guide your 4-month journey. Each serves a specific purpose. Here's how to use them together.

════════════════════════════════════════════════════════════════════════════════

📋 THE 5 DOCUMENTS ════════════════════════════════════════════════════════════════════════════════

1️⃣ IMPLEMENTATION_ROADMAP.md (THIS IS YOUR MASTER PLAN) ├─ 12-week breakdown with all 34 skills ├─ Phase 1-4 detailed task descriptions ├─ Success criteria for each task ├─ Code location hints └─ Use: Reference for detailed understanding of each skill

2️⃣ WEEK1_EXECUTION_PLAN.md (YOUR IMMEDIATE TODO LIST) ├─ This week's 6 tasks with hourly estimates ├─ Checkboxes for daily progress ├─ Useful commands to run ├─ Daily standup template └─ Use: Print this out, pin to monitor, check off daily

3️⃣ IMPLEMENTATION_STATUS_TRACKER.md (YOUR PROGRESS TRACKER) ├─ All 34 skills with status (TODO/IN-PROGRESS/DONE) ├─ Hours spent per skill ├─ Metrics tracking (coverage, latency, accuracy) ├─ Weekly checklist └─ Use: Update weekly, show progress to team

4️⃣ SKILL_TO_CODE_MAPPING.md (YOUR DEVELOPER REFERENCE) ├─ Where each skill applies in the codebase ├─ Which files to modify for each skill ├─ How skills fix the 6 critical issues ├─ Dependency graph └─ Use: When implementing a skill, find which code to change

5️⃣ This Document - NAVIGATION_GUIDE.md (YOU ARE HERE) ├─ How to use all 4 other documents ├─ Recommended reading order ├─ Quick workflows └─ Use: Getting started with the plan

🚀 GETTING STARTED (First 30 minutes) ════════════════════════════════════════════════════════════════════════════════

Read this document (5 minutes) ↓
Read WEEK1_EXECUTION_PLAN.md (10 minutes) ↓
Skim IMPLEMENTATION_ROADMAP.md Phase 1 (10 minutes) ↓
Bookmark SKILL_TO_CODE_MAPPING.md for reference ↓
Start Task 1.1 from WEEK1_EXECUTION_PLAN.md

📚 RECOMMENDED READING ORDER ════════════════════════════════════════════════════════════════════════════════

For the Project Manager/Team Lead:

This document (5 min)
IMPLEMENTATION_ROADMAP.md Summary section (5 min)
IMPLEMENTATION_STATUS_TRACKER.md (5 min)
Return to this document for weekly workflows

For the Developer/Engineer:

This document (5 min)
WEEK1_EXECUTION_PLAN.md (10 min)
SKILL_TO_CODE_MAPPING.md (10 min)
Specific skill section in IMPLEMENTATION_ROADMAP.md (5 min)
Read ~/.agents/skills//SKILL.md (varies)

For the QA/Test Specialist:

This document (5 min)
IMPLEMENTATION_ROADMAP.md Phase 2 (10 min)
SKILL_TO_CODE_MAPPING.md section on #22 Testing Patterns (5 min)
WEEK1_EXECUTION_PLAN.md Task 1.3 (5 min)

For the DevOps/Infrastructure Engineer:

This document (5 min)
IMPLEMENTATION_ROADMAP.md Phase 4 Week 9-10 (10 min)
SKILL_TO_CODE_MAPPING.md sections on #24, #25, #31 (5 min)
IMPLEMENTATION_STATUS_TRACKER.md (5 min)

🎯 TYPICAL WORKFLOWS ════════════════════════════════════════════════════════════════════════════════

WORKFLOW 1: Starting the Day ┌─────────────────────────────────────────────────────────────────────────┐ │ 1. Open WEEK1_EXECUTION_PLAN.md (or current week equivalent) │ │ └─ Find today's section │ │ │ │ 2. Click checkbox □ next to current task to mark as IN-PROGRESS │ │ │ │ 3. Open SKILL_TO_CODE_MAPPING.md │ │ └─ Find the skill for today's task │ │ └─ See which code files to modify │ │ │ │ 4. Open IMPLEMENTATION_ROADMAP.md │ │ └─ Find the Phase/Week/Task section for details │ │ │ │ 5. Read ~/.agents/skills//SKILL.md for guidance │ │ Example: ~/.agents/skills/owasp-security-check/SKILL.md │ │ │ │ 6. Implement, test, commit │ │ Command: git commit -m "feat: [Skill #X] [Description]" │ │ │ │ 7. Checkmark □ task as COMPLETE in WEEK1_EXECUTION_PLAN.md │ │ │ │ 8. Run: pytest tests/ -v --cov src (should pass/coverage increase) │ └─────────────────────────────────────────────────────────────────────────┘

WORKFLOW 2: Implementing a Specific Skill ┌─────────────────────────────────────────────────────────────────────────┐ │ Problem: "I need to implement Skill #8 (Hybrid Search)" │ │ │ │ Solution: │ │ 1. Find week/phase in IMPLEMENTATION_STATUS_TRACKER.md │ │ └─ Skill #8 is in Phase 3, Week 6 │ │ │ │ 2. Read IMPLEMENTATION_ROADMAP.md Phase 3 Week 6 section │ │ └─ Task description: "Implement hybrid search" │ │ └─ Duration: 4-6 hours │ │ └─ Actions: numbered steps to follow │ │ │ │ 3. Check SKILL_TO_CODE_MAPPING.md │ │ └─ Find "src/retrievers/hybrid_retriever.py (NEW)" │ │ └─ See which files to modify │ │ │ │ 4. Read ~/.agents/skills/hybrid-search-implementation/SKILL.md │ │ └─ Detailed implementation guidance │ │ │ │ 5. Code and test according to steps │ │ └─ Create src/retrievers/hybrid_retriever.py │ │ └─ Write tests in tests/test_hybrid_retriever.py │ │ └─ Run: pytest tests/test_hybrid_retriever.py -v │ │ │ │ 6. Update IMPLEMENTATION_STATUS_TRACKER.md │ │ └─ Mark Skill #8 as "✅ DONE" │ │ └─ Update hours actually spent │ │ └─ Update metric improvements │ └─────────────────────────────────────────────────────────────────────────┘

WORKFLOW 3: End of Week Progress Report ┌─────────────────────────────────────────────────────────────────────────┐ │ Every Friday at 5 PM: │ │ │ │ 1. Open IMPLEMENTATION_STATUS_TRACKER.md │ │ │ │ 2. For each task this week: │ │ □ Mark as "✅ DONE" if completed │ │ □ Update hours: planned [ ] → actual [X] │ │ □ Update metrics (coverage %, latency, accuracy change) │ │ │ │ 3. Run full test suite: │ │ $ pytest tests/ -v --cov src --cov-report=html │ │ └─ Record coverage percentage │ │ └─ Check for any new failures │ │ │ │ 4. Run Performance Benchmark: │ │ $ python tests/evaluation_metrics.py │ │ └─ Record response latency │ │ └─ Record accuracy metrics │ │ │ │ 5. Update Metrics section in IMPLEMENTATION_STATUS_TRACKER.md │ │ Week N: Coverage: [XX]%, Latency: [XX]s, Accuracy: [XX]% │ │ │ │ 6. Create team report: │ │ "Week 1: Completed 6/6 tasks. Coverage 70%→73%, auth implemented. │ │ No blockers. On track for Phase 1 completion by Feb 21." │ │ │ │ 7. Plan next week (Monday morning): │ │ - Check IMPLEMENTATION_ROADMAP.md for next phase tasks │ │ - Check dependencies (can we start without Skill X?) │ │ - Allocate resources │ └─────────────────────────────────────────────────────────────────────────┘

WORKFLOW 4: Fixing a Critical Issue (Example: Issue #1) ┌─────────────────────────────────────────────────────────────────────────┐ │ Problem: biomarker_flags not propagating through workflow │ │ │ │ Solution: │ │ 1. Read SKILL_TO_CODE_MAPPING.md │ │ └─ Search for "ISSUE #1" at top │ │ └─ See: "Primary Skills: #2, #3, #16" │ │ │ │ 2. Implementation order: │ │ Step 1: Check IMPLEMENTATION_ROADMAP.md for Skill #2 details │ │ Step 2: Check IMPLEMENTATION_ROADMAP.md for Skill #3 details │ │ Step 3: Check IMPLEMENTATION_ROADMAP.md for Skill #16 details │ │ │ │ 3. Code changes needed: │ │ + src/state.py (add missing fields) │ │ + src/agents/biomarker_analyzer.py (return flags) │ │ + src/agents/disease_explainer.py (preserve state) │ │ + api/app/models/response.py (unified schema) │ │ │ │ 4. Testing: │ │ + Write tests/test_state_propagation.py │ │ + Run: pytest tests/test_state_propagation.py -v │ │ + Run end-to-end: python scripts/test_chat_demo.py │ │ │ │ 5. Verification: │ │ - Log output shows flags present at each agent │ │ - Final response includes biomarker_flags │ │ - All tests passing │ │ │ │ 6. Commit: │ │ git commit -m "fix: [Skill #2, #3, #16] Propagate biomarker_flags" │ └─────────────────────────────────────────────────────────────────────────┘

WORKFLOW 5: Unblocking a Dependency ┌─────────────────────────────────────────────────────────────────────────┐ │ Scenario: Week 3 work is blocked, waiting for Week 2 to finish │ │ │ │ Check SKILL_TO_CODE_MAPPING.md → "SKILL DEPENDENCY GRAPH" │ │ └─ "Phase 2 requires Phase 1: #22, #26, #4, #13, #14, #5" │ │ └─ If Phase 1 delayed, check which Phase 2 skills are independent │ │ │ │ Independent Phase 2 work possible: │ │ • #26 (Design Patterns) can refactor without Phase 1 complete │ │ • #13 (Prompt Engineer) can improve prompts in isolation │ │ • Extend Phase 1 tests while Skill work continues │ │ │ │ Execution shift: │ │ 1. Run: grep -n "#26|#13" IMPLEMENTATION_ROADMAP.md │ │ 2. Start #26 or #13 work in parallel │ │ 3. Update IMPLEMENTATION_STATUS_TRACKER.md schedule │ │ 4. Reorder next week's tasks │ └─────────────────────────────────────────────────────────────────────────┘

📂 FOLDER STRUCTURE: Where Everything Lives ════════════════════════════════════════════════════════════════════════════════

RagBot (root) │ ├─ 📋 PLANNING DOCUMENTS (NEW) │ ├─ IMPLEMENTATION_ROADMAP.md ← Master 12-week plan │ ├─ WEEK1_EXECUTION_PLAN.md ← This week's tasks
│ ├─ IMPLEMENTATION_STATUS_TRACKER.md ← Progress tracking │ ├─ SKILL_TO_CODE_MAPPING.md ← Developer reference │ └─ NAVIGATION_GUIDE.md ← This file │ ├─ 🛠️ SKILLS REFERENCE │ └─ ~/.agents/skills/ (Global, all installed) │ ├─ owasp-security-check/SKILL.md │ ├─ api-security-hardening/SKILL.md │ ├─ python-testing-patterns/SKILL.md │ ├─ workflow-orchestration-patterns/SKILL.md │ ├─ api-rate-limiting/SKILL.md │ └─ [30 more skills...] │ ├─ 📝 IMPLEMENTATION PROGRESS │ ├─ src/ │ │ ├─ state.py (Fix by Skill #2: Week 1) │ │ ├─ workflow.py (Fix by Skill #2, #3: Weeks 1-2) │ │ ├─ exceptions.py (NEW - Skill #21: Week 2) │ │ ├─ agents/ │ │ │ ├─ base_agent.py (NEW - Skill #26: Week 3) │ │ │ ├─ biomarker_analyzer.py (Fix by Skills #4, #13: Week 3) │ │ │ ├─ disease_explainer.py (Fix by Skills #8, #11, #13: Week 6) │ │ │ └─ confidence_assessor.py (Fix by Skill #4, #13: Week 3) │ │ ├─ retrievers/ │ │ │ └─ hybrid_retriever.py (NEW - Skill #8: Week 6) │ │ ├─ chunking_strategy.py (NEW - Skill #9: Week 6) │ │ ├─ knowledge_graph.py (NEW - Skill #12: Week 7) │ │ ├─ memory_manager.py (NEW - Skill #28: Week 7) │ │ ├─ observability.py (NEW - Skill #27: Week 2) │ │ └─ llm_config.py (Fix by Skills #15: Week 8) │ │ │ ├─ api/app/ │ │ ├─ main.py (Fix by Skills #17, #25: Weeks 1, 9) │ │ ├─ models/ │ │ │ └─ response.py (NEW - Skill #16: Week 1) │ │ ├─ middleware/ │ │ │ ├─ auth.py (NEW - Skill #17: Week 1) │ │ │ ├─ input_validation.py (NEW - Skill #19: Week 2) │ │ │ └─ rate_limiter.py (NEW - Skill #20: Week 1) │ │ └─ webhooks/ (NEW if needed - Skill #33: Week 11) │ │ │ ├─ tests/ │ │ ├─ test_api_auth.py (NEW - Skill #22: Week 1) │ │ ├─ test_parametrized_*.py (NEW - Skill #22: Week 3) │ │ ├─ test_response_schema.py (NEW - Skill #22: Week 1) │ │ ├─ evaluation_metrics.py (NEW - Skill #14: Week 4) │ │ ├─ conftest.py (NEW - Skill #22: Week 1) │ │ └─ fixtures/ (NEW - Skill #22: Week 1) │ │ ├─ auth.py │ │ ├─ biomarkers.py │ │ └─ evaluation_patients.py │ │ │ ├─ .github/ │ │ ├─ workflows/ │ │ │ ├─ test.yml (NEW - Skill #24: Week 2) │ │ │ ├─ security.yml (NEW - Skill #24: Week 2) │ │ │ ├─ docker.yml (NEW - Skill #24: Week 2) │ │ │ └─ deploy.yml (NEW - Skill #31: Week 10) │ │ ├─ CODEOWNERS (NEW - Skill #30: Week 9) │ │ └─ pull_request_template.md (NEW - Skill #30: Week 9) │ │ │ └─ docs/ │ ├─ SECURITY_AUDIT.md (NEW - Skill #18: Week 1) │ ├─ REVIEW_GUIDELINES.md (NEW - Skill #23: Week 10) │ └─ API.md (Updated by Skill #29: Week 9)

🔄 ITERATIVE IMPROVEMENT LOOP ════════════════════════════════════════════════════════════════════════════════

Each week follows this cycle:

┌─────────────────────────────────────────────────────────────────────────┐ │ │ │ MONDAY WEDNESDAY-FRIDAY FRIDAY PM │ │ ┌─────────┐ ┌──────────────┐ ┌──────────┐ │ │ │ Plan │ ────────────→ │ Implement │ ─────────→ │ Report │ │ │ │ Week │ │ + Test │ │ Progress │ │ │ └─────────┘ └──────────────┘ └──────────┘ │ │ ↓ ↓ ↓ │ │ • Review next tasks • Run tests daily • Update Status │ │ • Check dependencies • Commit to git • Calculate metrics │ │ • Allocate resources • Fix issues as found • Plan next week │ │ │ └─────────────────────────────────────────────────────────────────────────┘

Metrics to track weekly: Coverage: [baseline] → [target] Latency: [baseline:25s] → [target:15-20s] Accuracy: [baseline:65%] → [target:80%] Tests: [count increase] Issues Resolved: [count] Skill Hours: [planned vs actual]

❓ FREQUENTLY ASKED QUESTIONS ════════════════════════════════════════════════════════════════════════════════

Q: "How do I know what skill to use?" A: SKILL_TO_CODE_MAPPING.md maps skills to problems. If fixing Issue #1, section at top says: "Primary Skills: #2, #3, #16"

Q: "What if I fall behind schedule?" A: 1. Check SKILL_TO_CODE_MAPPING.md "Skill Dependency Graph" 2. See which Phase 2+ skills are independent of delayed Phase 1 work 3. Start those in parallel to maintain progress 4. Reschedule Phase 1 blockers

Q: "How do I measure progress?" A: Update IMPLEMENTATION_STATUS_TRACKER.md weekly: • Mark tasks as DONE • Run: pytest tests/ --cov src (record coverage %) • Run: python test_chat_demo.py 10 times, measure latency • Update metrics row for the week

Q: "What if a skill doesn't match my needs?" A: Each skill has detailed Actions in IMPLEMENTATION_ROADMAP.md. These suggest typical usage. Apply only what's relevant to RagBot. The plan is flexible - adapt it to your reality.

Q: "When do I read the actual SKILL.md files?" A: When implementing a skill. Example: • Day 1: Read WEEK1_EXECUTION_PLAN.md Task 1.1 • Opens IMPLEMENTATION_ROADMAP.md Phase 1 Week 1 • Opens SKILL_TO_CODE_MAPPING.md to see code files • THEN reads ~/.agents/skills/owasp-security-check/SKILL.md for details • Implements according to all guidance combined

Q: "What if tests fail during implementation?" A: Expected! This is normal development. When a test fails:

Read the error message carefully
Identify which code is wrong (src/ or test/)
Fix the code (not the test)
Re-run: pytest -v
Commit when green

Q: "How do I handle merge conflicts?" A: Phase 1 work happens in parallel: • Task 1.3 (Auth tests) = tests/test_api_auth.py • Task 1.4 (State fixing) = src/agents/ • Task 1.5 (Schema) = api/app/models/response.py These are different files, minimal conflicts. If conflicts:

Read git conflict markers (<<<<, ====, >>>>)
Pick correct version or merge manually
Run: pytest to verify still works
git add [file]; git commit

✅ SUCCESS CRITERIA FOR PHASE 1 ════════════════════════════════════════════════════════════════════════════════

By end of Week 2 (Feb 21):

Code Quality: ✅ 23+ new tests written ✅ Coverage increased 70% → 75% ✅ All tests passing ✅ No linter warnings

Features: ✅ JWT authentication working on /api endpoints ✅ biomarker_flags & safety_alerts propagate through workflow ✅ Unified response schema (API + CLI) ✅ Prompt injection detection active ✅ Rate limiting enforced (10 req/min)

Documentation: ✅ SECURITY_AUDIT.md completed ✅ .github/workflows/test.yml running on PR

Team: ✅ All developers understand Phase 1 changes ✅ Code review standards documented ✅ Deployment checklist for next phase

🎓 CONTINUOUS LEARNING ════════════════════════════════════════════════════════════════════════════════

As you implement each skill:

Read the SKILL.md documentation thoroughly └─ Take notes on best practices
Understand the "Why" not just the "How" └─ Why hybrid search over semantic only? └─ Why knowledge graphs for medical reasoning?
Apply learnings beyond RagBot └─ These patterns work for any Python/ML/LLM project
Share knowledge with team └─ Each week, 30-min skill share session └─ "This week I learned about [Skill X]..."
Revisit Phase 1-2 skills when you hit Phase 3-4 └─ Patterns reinforce and become second nature └─ You'll notice connections between skills

📞 NEED HELP? ════════════════════════════════════════════════════════════════════════════════

Stuck on a task? Follow this decision tree:

Issue: Don't know which skill to use? → Check SKILL_TO_CODE_MAPPING.md → Find the problem area → See "Primary Skills:" section

Issue: Skill documentation unclear? → Read ~/.agents/skills//SKILL.md fully → Check subdirectories for examples/templates → Apply to your specific use case

Issue: Not progressing fast enough? → Consider parallel work (see WORKFLOW 5) → Skill dependency check → Allocate more developer time → Simplify scope temporarily

Issue: Test failures? → Read error message → Check SKILL_TO_CODE_MAPPING.md for code changes needed → Review the specific skill's error handling section → Fix code (not test)

Issue: Code doesn't integrate? → Check Phase 2 tasks for integration points → Verify unified schema matches → Run end-to-end tests → Check observability logs for clues

🏁 THE FINISH LINE ════════════════════════════════════════════════════════════════════════════════

After 12 weeks of executing this plan:

Your RagBot will be: ✅ Enterprise-grade (OWASP + HIPAA aligned) ✅ Well-tested (90%+ coverage) ✅ Fast (15-20s latency, -30% vs baseline) ✅ Accurate (80%+ disease prediction) ✅ Cost-optimized (-40% API costs) ✅ Properly documented (API docs, code reviews, guides) ✅ Fully deployed (CI/CD, monitoring, alerts) ✅ Knowledge-integrated (graphs, hybrid search, citations) ✅ Maintainable (design patterns, observability, error handling) ✅ Secure (auth, rate limiting, input validation)

Your team will be: ✅ Trained on 34 industry best practices ✅ Capable of maintaining and evolving the system ✅ Confident in deployment and monitoring ✅ Equipped with reusable patterns for future projects

Success looks like: "We deployed a production-ready medical AI system that is secure, fast, accurate, and maintainable. We did it systematically using industry best practices. We can confidently handle increases in patient load and evolve the system for new biomarkers."

Let's build something great. Start with WEEK1_EXECUTION_PLAN.md. 🚀