Spaces:
Sleeping
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β πΊοΈ NAVIGATION GUIDE: Using the RagBot 4-Month Roadmap β β How to use all the planning documents effectively β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
You now have 5 comprehensive documents to guide your 4-month journey. Each serves a specific purpose. Here's how to use them together.
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π THE 5 DOCUMENTS ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
1οΈβ£ IMPLEMENTATION_ROADMAP.md (THIS IS YOUR MASTER PLAN) ββ 12-week breakdown with all 34 skills ββ Phase 1-4 detailed task descriptions ββ Success criteria for each task ββ Code location hints ββ Use: Reference for detailed understanding of each skill
2οΈβ£ WEEK1_EXECUTION_PLAN.md (YOUR IMMEDIATE TODO LIST) ββ This week's 6 tasks with hourly estimates ββ Checkboxes for daily progress ββ Useful commands to run ββ Daily standup template ββ Use: Print this out, pin to monitor, check off daily
3οΈβ£ IMPLEMENTATION_STATUS_TRACKER.md (YOUR PROGRESS TRACKER) ββ All 34 skills with status (TODO/IN-PROGRESS/DONE) ββ Hours spent per skill ββ Metrics tracking (coverage, latency, accuracy) ββ Weekly checklist ββ Use: Update weekly, show progress to team
4οΈβ£ SKILL_TO_CODE_MAPPING.md (YOUR DEVELOPER REFERENCE) ββ Where each skill applies in the codebase ββ Which files to modify for each skill ββ How skills fix the 6 critical issues ββ Dependency graph ββ Use: When implementing a skill, find which code to change
5οΈβ£ This Document - NAVIGATION_GUIDE.md (YOU ARE HERE) ββ How to use all 4 other documents ββ Recommended reading order ββ Quick workflows ββ Use: Getting started with the plan
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π GETTING STARTED (First 30 minutes) ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
- Read this document (5 minutes) β
- Read WEEK1_EXECUTION_PLAN.md (10 minutes) β
- Skim IMPLEMENTATION_ROADMAP.md Phase 1 (10 minutes) β
- Bookmark SKILL_TO_CODE_MAPPING.md for reference β
- Start Task 1.1 from WEEK1_EXECUTION_PLAN.md
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π RECOMMENDED READING ORDER ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
For the Project Manager/Team Lead:
- This document (5 min)
- IMPLEMENTATION_ROADMAP.md Summary section (5 min)
- IMPLEMENTATION_STATUS_TRACKER.md (5 min)
- Return to this document for weekly workflows
For the Developer/Engineer:
- This document (5 min)
- WEEK1_EXECUTION_PLAN.md (10 min)
- SKILL_TO_CODE_MAPPING.md (10 min)
- Specific skill section in IMPLEMENTATION_ROADMAP.md (5 min)
- Read ~/.agents/skills//SKILL.md (varies)
For the QA/Test Specialist:
- This document (5 min)
- IMPLEMENTATION_ROADMAP.md Phase 2 (10 min)
- SKILL_TO_CODE_MAPPING.md section on #22 Testing Patterns (5 min)
- WEEK1_EXECUTION_PLAN.md Task 1.3 (5 min)
For the DevOps/Infrastructure Engineer:
- This document (5 min)
- IMPLEMENTATION_ROADMAP.md Phase 4 Week 9-10 (10 min)
- SKILL_TO_CODE_MAPPING.md sections on #24, #25, #31 (5 min)
- IMPLEMENTATION_STATUS_TRACKER.md (5 min)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π― TYPICAL WORKFLOWS ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
WORKFLOW 1: Starting the Day βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β 1. Open WEEK1_EXECUTION_PLAN.md (or current week equivalent) β β ββ Find today's section β β β β 2. Click checkbox β‘ next to current task to mark as IN-PROGRESS β β β β 3. Open SKILL_TO_CODE_MAPPING.md β β ββ Find the skill for today's task β β ββ See which code files to modify β β β β 4. Open IMPLEMENTATION_ROADMAP.md β β ββ Find the Phase/Week/Task section for details β β β β 5. Read ~/.agents/skills//SKILL.md for guidance β β Example: ~/.agents/skills/owasp-security-check/SKILL.md β β β β 6. Implement, test, commit β β Command: git commit -m "feat: [Skill #X] [Description]" β β β β 7. Checkmark β‘ task as COMPLETE in WEEK1_EXECUTION_PLAN.md β β β β 8. Run: pytest tests/ -v --cov src (should pass/coverage increase) β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
WORKFLOW 2: Implementing a Specific Skill βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β Problem: "I need to implement Skill #8 (Hybrid Search)" β β β β Solution: β β 1. Find week/phase in IMPLEMENTATION_STATUS_TRACKER.md β β ββ Skill #8 is in Phase 3, Week 6 β β β β 2. Read IMPLEMENTATION_ROADMAP.md Phase 3 Week 6 section β β ββ Task description: "Implement hybrid search" β β ββ Duration: 4-6 hours β β ββ Actions: numbered steps to follow β β β β 3. Check SKILL_TO_CODE_MAPPING.md β β ββ Find "src/retrievers/hybrid_retriever.py (NEW)" β β ββ See which files to modify β β β β 4. Read ~/.agents/skills/hybrid-search-implementation/SKILL.md β β ββ Detailed implementation guidance β β β β 5. Code and test according to steps β β ββ Create src/retrievers/hybrid_retriever.py β β ββ Write tests in tests/test_hybrid_retriever.py β β ββ Run: pytest tests/test_hybrid_retriever.py -v β β β β 6. Update IMPLEMENTATION_STATUS_TRACKER.md β β ββ Mark Skill #8 as "β DONE" β β ββ Update hours actually spent β β ββ Update metric improvements β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
WORKFLOW 3: End of Week Progress Report βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β Every Friday at 5 PM: β β β β 1. Open IMPLEMENTATION_STATUS_TRACKER.md β β β β 2. For each task this week: β β β‘ Mark as "β DONE" if completed β β β‘ Update hours: planned [ ] β actual [X] β β β‘ Update metrics (coverage %, latency, accuracy change) β β β β 3. Run full test suite: β β $ pytest tests/ -v --cov src --cov-report=html β β ββ Record coverage percentage β β ββ Check for any new failures β β β β 4. Run Performance Benchmark: β β $ python tests/evaluation_metrics.py β β ββ Record response latency β β ββ Record accuracy metrics β β β β 5. Update Metrics section in IMPLEMENTATION_STATUS_TRACKER.md β β Week N: Coverage: [XX]%, Latency: [XX]s, Accuracy: [XX]% β β β β 6. Create team report: β β "Week 1: Completed 6/6 tasks. Coverage 70%β73%, auth implemented. β β No blockers. On track for Phase 1 completion by Feb 21." β β β β 7. Plan next week (Monday morning): β β - Check IMPLEMENTATION_ROADMAP.md for next phase tasks β β - Check dependencies (can we start without Skill X?) β β - Allocate resources β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
WORKFLOW 4: Fixing a Critical Issue (Example: Issue #1) βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β Problem: biomarker_flags not propagating through workflow β β β β Solution: β β 1. Read SKILL_TO_CODE_MAPPING.md β β ββ Search for "ISSUE #1" at top β β ββ See: "Primary Skills: #2, #3, #16" β β β β 2. Implementation order: β β Step 1: Check IMPLEMENTATION_ROADMAP.md for Skill #2 details β β Step 2: Check IMPLEMENTATION_ROADMAP.md for Skill #3 details β β Step 3: Check IMPLEMENTATION_ROADMAP.md for Skill #16 details β β β β 3. Code changes needed: β β + src/state.py (add missing fields) β β + src/agents/biomarker_analyzer.py (return flags) β β + src/agents/disease_explainer.py (preserve state) β β + api/app/models/response.py (unified schema) β β β β 4. Testing: β β + Write tests/test_state_propagation.py β β + Run: pytest tests/test_state_propagation.py -v β β + Run end-to-end: python scripts/test_chat_demo.py β β β β 5. Verification: β β - Log output shows flags present at each agent β β - Final response includes biomarker_flags β β - All tests passing β β β β 6. Commit: β β git commit -m "fix: [Skill #2, #3, #16] Propagate biomarker_flags" β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
WORKFLOW 5: Unblocking a Dependency βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β Scenario: Week 3 work is blocked, waiting for Week 2 to finish β β β β Check SKILL_TO_CODE_MAPPING.md β "SKILL DEPENDENCY GRAPH" β β ββ "Phase 2 requires Phase 1: #22, #26, #4, #13, #14, #5" β β ββ If Phase 1 delayed, check which Phase 2 skills are independent β β β β Independent Phase 2 work possible: β β β’ #26 (Design Patterns) can refactor without Phase 1 complete β β β’ #13 (Prompt Engineer) can improve prompts in isolation β β β’ Extend Phase 1 tests while Skill work continues β β β β Execution shift: β β 1. Run: grep -n "#26|#13" IMPLEMENTATION_ROADMAP.md β β 2. Start #26 or #13 work in parallel β β 3. Update IMPLEMENTATION_STATUS_TRACKER.md schedule β β 4. Reorder next week's tasks β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π FOLDER STRUCTURE: Where Everything Lives ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
RagBot (root)
β
ββ π PLANNING DOCUMENTS (NEW)
β ββ IMPLEMENTATION_ROADMAP.md β Master 12-week plan
β ββ WEEK1_EXECUTION_PLAN.md β This week's tasks
β ββ IMPLEMENTATION_STATUS_TRACKER.md β Progress tracking
β ββ SKILL_TO_CODE_MAPPING.md β Developer reference
β ββ NAVIGATION_GUIDE.md β This file
β
ββ π οΈ SKILLS REFERENCE
β ββ ~/.agents/skills/ (Global, all installed)
β ββ owasp-security-check/SKILL.md
β ββ api-security-hardening/SKILL.md
β ββ python-testing-patterns/SKILL.md
β ββ workflow-orchestration-patterns/SKILL.md
β ββ api-rate-limiting/SKILL.md
β ββ [30 more skills...]
β
ββ π IMPLEMENTATION PROGRESS
β ββ src/
β β ββ state.py (Fix by Skill #2: Week 1)
β β ββ workflow.py (Fix by Skill #2, #3: Weeks 1-2)
β β ββ exceptions.py (NEW - Skill #21: Week 2)
β β ββ agents/
β β β ββ base_agent.py (NEW - Skill #26: Week 3)
β β β ββ biomarker_analyzer.py (Fix by Skills #4, #13: Week 3)
β β β ββ disease_explainer.py (Fix by Skills #8, #11, #13: Week 6)
β β β ββ confidence_assessor.py (Fix by Skill #4, #13: Week 3)
β β ββ retrievers/
β β β ββ hybrid_retriever.py (NEW - Skill #8: Week 6)
β β ββ chunking_strategy.py (NEW - Skill #9: Week 6)
β β ββ knowledge_graph.py (NEW - Skill #12: Week 7)
β β ββ memory_manager.py (NEW - Skill #28: Week 7)
β β ββ observability.py (NEW - Skill #27: Week 2)
β β ββ llm_config.py (Fix by Skills #15: Week 8)
β β
β ββ api/app/
β β ββ main.py (Fix by Skills #17, #25: Weeks 1, 9)
β β ββ models/
β β β ββ response.py (NEW - Skill #16: Week 1)
β β ββ middleware/
β β β ββ auth.py (NEW - Skill #17: Week 1)
β β β ββ input_validation.py (NEW - Skill #19: Week 2)
β β β ββ rate_limiter.py (NEW - Skill #20: Week 1)
β β ββ webhooks/ (NEW if needed - Skill #33: Week 11)
β β
β ββ tests/
β β ββ test_api_auth.py (NEW - Skill #22: Week 1)
β β ββ test_parametrized_*.py (NEW - Skill #22: Week 3)
β β ββ test_response_schema.py (NEW - Skill #22: Week 1)
β β ββ evaluation_metrics.py (NEW - Skill #14: Week 4)
β β ββ conftest.py (NEW - Skill #22: Week 1)
β β ββ fixtures/ (NEW - Skill #22: Week 1)
β β ββ auth.py
β β ββ biomarkers.py
β β ββ evaluation_patients.py
β β
β ββ .github/
β β ββ workflows/
β β β ββ test.yml (NEW - Skill #24: Week 2)
β β β ββ security.yml (NEW - Skill #24: Week 2)
β β β ββ docker.yml (NEW - Skill #24: Week 2)
β β β ββ deploy.yml (NEW - Skill #31: Week 10)
β β ββ CODEOWNERS (NEW - Skill #30: Week 9)
β β ββ pull_request_template.md (NEW - Skill #30: Week 9)
β β
β ββ docs/
β ββ SECURITY_AUDIT.md (NEW - Skill #18: Week 1)
β ββ REVIEW_GUIDELINES.md (NEW - Skill #23: Week 10)
β ββ API.md (Updated by Skill #29: Week 9)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π ITERATIVE IMPROVEMENT LOOP ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Each week follows this cycle:
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β β β MONDAY WEDNESDAY-FRIDAY FRIDAY PM β β βββββββββββ ββββββββββββββββ ββββββββββββ β β β Plan β βββββββββββββ β Implement β ββββββββββ β Report β β β β Week β β + Test β β Progress β β β βββββββββββ ββββββββββββββββ ββββββββββββ β β β β β β β β’ Review next tasks β’ Run tests daily β’ Update Status β β β’ Check dependencies β’ Commit to git β’ Calculate metrics β β β’ Allocate resources β’ Fix issues as found β’ Plan next week β β β βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Metrics to track weekly: Coverage: [baseline] β [target] Latency: [baseline:25s] β [target:15-20s] Accuracy: [baseline:65%] β [target:80%] Tests: [count increase] Issues Resolved: [count] Skill Hours: [planned vs actual]
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β FREQUENTLY ASKED QUESTIONS ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Q: "How do I know what skill to use?" A: SKILL_TO_CODE_MAPPING.md maps skills to problems. If fixing Issue #1, section at top says: "Primary Skills: #2, #3, #16"
Q: "What if I fall behind schedule?" A: 1. Check SKILL_TO_CODE_MAPPING.md "Skill Dependency Graph" 2. See which Phase 2+ skills are independent of delayed Phase 1 work 3. Start those in parallel to maintain progress 4. Reschedule Phase 1 blockers
Q: "How do I measure progress?" A: Update IMPLEMENTATION_STATUS_TRACKER.md weekly: β’ Mark tasks as DONE β’ Run: pytest tests/ --cov src (record coverage %) β’ Run: python test_chat_demo.py 10 times, measure latency β’ Update metrics row for the week
Q: "What if a skill doesn't match my needs?" A: Each skill has detailed Actions in IMPLEMENTATION_ROADMAP.md. These suggest typical usage. Apply only what's relevant to RagBot. The plan is flexible - adapt it to your reality.
Q: "When do I read the actual SKILL.md files?" A: When implementing a skill. Example: β’ Day 1: Read WEEK1_EXECUTION_PLAN.md Task 1.1 β’ Opens IMPLEMENTATION_ROADMAP.md Phase 1 Week 1 β’ Opens SKILL_TO_CODE_MAPPING.md to see code files β’ THEN reads ~/.agents/skills/owasp-security-check/SKILL.md for details β’ Implements according to all guidance combined
Q: "What if tests fail during implementation?" A: Expected! This is normal development. When a test fails:
- Read the error message carefully
- Identify which code is wrong (src/ or test/)
- Fix the code (not the test)
- Re-run: pytest -v
- Commit when green
Q: "How do I handle merge conflicts?" A: Phase 1 work happens in parallel: β’ Task 1.3 (Auth tests) = tests/test_api_auth.py β’ Task 1.4 (State fixing) = src/agents/ β’ Task 1.5 (Schema) = api/app/models/response.py These are different files, minimal conflicts. If conflicts:
- Read git conflict markers (<<<<, ====, >>>>)
- Pick correct version or merge manually
- Run: pytest to verify still works
- git add [file]; git commit
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β SUCCESS CRITERIA FOR PHASE 1 ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
By end of Week 2 (Feb 21):
Code Quality: β 23+ new tests written β Coverage increased 70% β 75% β All tests passing β No linter warnings
Features: β JWT authentication working on /api endpoints β biomarker_flags & safety_alerts propagate through workflow β Unified response schema (API + CLI) β Prompt injection detection active β Rate limiting enforced (10 req/min)
Documentation: β SECURITY_AUDIT.md completed β .github/workflows/test.yml running on PR
Team: β All developers understand Phase 1 changes β Code review standards documented β Deployment checklist for next phase
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π CONTINUOUS LEARNING ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
As you implement each skill:
Read the SKILL.md documentation thoroughly ββ Take notes on best practices
Understand the "Why" not just the "How" ββ Why hybrid search over semantic only? ββ Why knowledge graphs for medical reasoning?
Apply learnings beyond RagBot ββ These patterns work for any Python/ML/LLM project
Share knowledge with team ββ Each week, 30-min skill share session ββ "This week I learned about [Skill X]..."
Revisit Phase 1-2 skills when you hit Phase 3-4 ββ Patterns reinforce and become second nature ββ You'll notice connections between skills
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π NEED HELP? ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Stuck on a task? Follow this decision tree:
Issue: Don't know which skill to use? β Check SKILL_TO_CODE_MAPPING.md β Find the problem area β See "Primary Skills:" section
Issue: Skill documentation unclear? β Read ~/.agents/skills//SKILL.md fully β Check subdirectories for examples/templates β Apply to your specific use case
Issue: Not progressing fast enough? β Consider parallel work (see WORKFLOW 5) β Skill dependency check β Allocate more developer time β Simplify scope temporarily
Issue: Test failures? β Read error message β Check SKILL_TO_CODE_MAPPING.md for code changes needed β Review the specific skill's error handling section β Fix code (not test)
Issue: Code doesn't integrate? β Check Phase 2 tasks for integration points β Verify unified schema matches β Run end-to-end tests β Check observability logs for clues
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
π THE FINISH LINE ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
After 12 weeks of executing this plan:
Your RagBot will be: β Enterprise-grade (OWASP + HIPAA aligned) β Well-tested (90%+ coverage) β Fast (15-20s latency, -30% vs baseline) β Accurate (80%+ disease prediction) β Cost-optimized (-40% API costs) β Properly documented (API docs, code reviews, guides) β Fully deployed (CI/CD, monitoring, alerts) β Knowledge-integrated (graphs, hybrid search, citations) β Maintainable (design patterns, observability, error handling) β Secure (auth, rate limiting, input validation)
Your team will be: β Trained on 34 industry best practices β Capable of maintaining and evolving the system β Confident in deployment and monitoring β Equipped with reusable patterns for future projects
Success looks like: "We deployed a production-ready medical AI system that is secure, fast, accurate, and maintainable. We did it systematically using industry best practices. We can confidently handle increases in patient load and evolve the system for new biomarkers."
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Let's build something great. Start with WEEK1_EXECUTION_PLAN.md. π
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ