Spaces:
Running
Running
File size: 29,739 Bytes
aefac4f | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 | ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β π SKILL-TO-CODE MAPPING: Where Each Skill Applies in RagBot β
β Reference guide showing skill application locations β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
This document maps each of the 34 skills to specific code files and critical
issues they resolve. Use this for quick lookup: "Where do I apply Skill #X?"
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
CRITICAL ISSUES MAPPING TO SKILLS
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
ISSUE #1: biomarker_flags & safety_alerts not propagating through workflow
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: src/state.py, src/agents/*.py, src/workflow.py
Affected Code:
ββ GuildState (missing fields)
ββ BiomarkerAnalyzerAgent.invoke() (only returns biomarkers)
ββ ResponseSynthesizerAgent.invoke() (fields missing in input)
ββ Workflow edges (state not fully passed)
Primary Skills:
β #2 Workflow Orchestration Patterns β Fix state passing
β #3 Multi-Agent Orchestration β Ensure deterministic flow
β #16 Structured Output β Enforce complete schema
Secondary Skills:
β’ #22 Testing Patterns β Write tests for state flow
β’ #27 Observability β Log state changes
Action: Read src/state.py β identify missing fields β update all agents to
return complete state β test end-to-end
ISSUE #2: Schema mismatch between workflow output & API formatter
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: src/workflow.py, api/app/models/ (missing or inconsistent)
Affected Code:
ββ ResponseSynthesizerAgent output structure (varies)
ββ api/app/services/ragbot.py format_response() (expects different keys)
ββ CLI scripts/chat.py (different field names)
ββ Tests referencing old schema
Primary Skills:
β #16 AI Wrapper/Structured Output β Create unified Pydantic model
β #22 Testing Patterns β Write schema validation tests
Secondary Skills:
β’ #27 Observability β Log schema mismatches (debugging)
Action: Create api/app/models/response.py with BaseAnalysisResponse β
update all agents to return it β validate in API
ISSUE #3: Prediction confidence forced to 0.5 (dangerous for medical)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: src/agents/confidence_assessor.py, api/app/routes/analyze.py
Affected Code:
ββ ConfidenceAssessorAgent.invoke() (ignores actual assessment)
ββ Default response in analyze endpoint (hardcoded 0.5)
ββ CLI logic (no failure path for low confidence)
Primary Skills:
β #13 Senior Prompt Engineer β Better reasoning in assessor
β #14 LLM Evaluation β Benchmark accuracy
Secondary Skills:
β’ #4 Agentic Development β Decision logic improvements
β’ #22 Testing Patterns β Test confidence boundaries
β’ #27 Observability β Track confidence distributions
Action: Update confidence_assessor.py to use actual evidence β test with
multiple biomarker scenarios β Add high/medium/low confidence paths
ISSUE #4: Biomarker naming inconsistency (API vs CLI)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: config/biomarker_references.json, src/agents/*, api/*
Affected Code:
ββ config/biomarker_references.json (canonical list)
ββ BiomarkerAnalyzerAgent (validation against reference)
ββ CLI scripts/chat.py (different naming)
ββ API endpoints (naming transformation)
Primary Skills:
β #9 Chunking Strategy β Include standard names in embedding
β #16 Structured Output β Enforce standard field names
Secondary Skills:
β’ #10 Embedding Pipeline β Index with canonical names
β’ #22 Testing Patterns β Test name transformation
β’ #27 Observability β Log name mismatches
Action: Create biomarker_normalizer() β apply in all code paths β add
mapping tests
ISSUE #5: JSON parsing breaks on malformed LLM output
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: api/app/services/extraction.py, src/agents/extraction code
Affected Code:
ββ LLM.predict() returns text
ββ json.loads() has no error handling
ββ Invalid JSON crashes endpoint
ββ No fallback strategy
Primary Skills:
β #5 Tool/Function Calling β Use function calling instead
β #21 Python Error Handling β Graceful degradation
Secondary Skills:
β’ #16 Structured Output β Pydantic validation
β’ #19 LLM Security β Prevent injection in JSON
β’ #27 Observability β Log parsing failures
β’ #14 LLM Evaluation β Track failure rate
Action: Replace json.loads() with Pydantic validator β implement retry logic
β add function calling as fallback
ISSUE #6: No citation enforcement in RAG outputs
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Problem Location: src/agents/disease_explainer.py, response synthesis
Affected Code:
ββ retriever.retrieve() returns docs but citations dropped
ββ DiseaseExplainerAgent doesn't track sources
ββ ResponseSynthesizerAgent loses citation info
ββ API response has no source attribution
Primary Skills:
β #11 RAG Implementation β Enforce citations in loop
β #8 Hybrid Search β Better relevance = better cites
β #12 Knowledge Graph β Link to authoritative sources
Secondary Skills:
β’ #1 LangChain Architecture β Tool for citation tracking
β’ #7 RAG Agent Builder β Full RAG best practices
β’ #14 LLM Evaluation β Test for hallucinations
β’ #27 Observability β Track citation frequency
Action: Modify disease_explainer.py to preserve doc metadata β add citation
validation β return sources in API response
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
SKILL-BY-SKILL APPLICATION GUIDE
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
#1 LangChain Architecture
Phase: 3, Week 7
Apply To: src/agents/, src/services/
Key Files:
ββ src/agents/base_agent.py (NEW) - Create BaseAgent with LangChain patterns
ββ src/agents/*/invoke() - Add callbacks, chains, tools
ββ src/services/*.py - RunnableWithMessageHistory for conversation
Integration: Advanced chain composition, callbacks for metrics
Outcome: More sophisticated agent orchestration
Effort: 3-4 hours
#2 Workflow Orchestration Patterns
Phase: 1, Week 1 / Phase 4, Week 12 (final review)
Apply To: src/workflow.py, src/state.py
Key Files:
ββ src/state.py - REFACTOR GuildState with all fields
ββ src/workflow.py - REFACTOR state passing between agents
ββ src/agents/biomarker_analyzer.py - Return complete state
ββ src/agents/disease_explainer.py - Preserve incoming state
Integration: Fix Issue #1 (state propagation)
Outcome: biomarker_flags & safety_alerts flow through entire workflow
Effort: 4-6 hours (Week 1) + 2 hours (Week 12 refine)
#3 Multi-Agent Orchestration
Phase: 1, Week 2
Apply To: src/workflow.py
Key Files:
ββ src/workflow.py - Ensure deterministic agent order
ββ Parallel execution order documentation
Integration: Ensure agents execute in correct order with proper state passing
Outcome: Deterministic workflow execution
Effort: 3-4 hours
#4 Agentic Development
Phase: 2, Week 3
Apply To: src/agents/biomarker_analyzer.py, confidence_assessor.py
Key Files:
ββ BiomarkerAnalyzerAgent.invoke() - Add confidence thresholds
ββ ConfidenceAssessorAgent - Better decision logic
ββ Add reasoning trace to responses
Integration: Better medical decisions, alternatives for low confidence
Outcome: More reliable biomarker analysis
Effort: 3-4 hours
#5 Tool/Function Calling Patterns
Phase: 2, Week 4
Apply To: api/app/services/extraction.py, src/agents/extraction.py
Key Files:
ββ api/app/services/extraction.py - Define extraction tools/functions
ββ src/agents/ - Use function returns instead of JSON parsing
Integration: Fix Issue #5 (JSON parsing fragility)
Outcome: Structured LLM outputs guaranteed valid
Effort: 3-4 hours
#6 LLM Application Dev with LangChain
Phase: 4, Week 11
Apply To: src/agents/ (production patterns)
Key Files:
ββ src/agents/base_agent.py - Implement lifecycle (setup, execute, cleanup)
ββ All agents - Add retry logic, graceful degradation
ββ Agent composition patterns - Chain agents
Integration: Production-ready agent code
Outcome: Robust, maintainable agents with error recovery
Effort: 4-5 hours
#7 RAG Agent Builder
Phase: 4, Week 12
Apply To: src/agents/ (full review)
Key Files:
ββ src/agents/disease_explainer.py - RAG pattern review
ββ Ensure all responses cite sources
ββ Verify accuracy benchmarks
Integration: Full RAG agent validation before production
Outcome: Production-ready RAG agents
Effort: 4-5 hours
#8 Hybrid Search Implementation
Phase: 3, Week 6
Apply To: src/retrievers/ (NEW)
Key Files:
ββ src/retrievers/hybrid_retriever.py (NEW) - Combine BM25 + FAISS
ββ src/agents/disease_explainer.py - Use hybrid retriever
Integration: Better document retrieval (semantic + keyword)
Outcome: +15% recall on rare disease queries
Effort: 4-6 hours
#9 Chunking Strategy
Phase: 3, Week 6
Apply To: src/chunking_strategy.py (NEW), src/pdf_processor.py
Key Files:
ββ src/chunking_strategy.py (NEW) - Split by medical sections
ββ scripts/setup_embeddings.py - Use new chunking
ββ Re-chunk and re-embed medical_knowledge.faiss
Integration: Fix Issue #4 (naming), improve context window usage
Outcome: Better semantic chunks, improved retrieval quality
Effort: 4-5 hours
#10 Embedding Pipeline Builder
Phase: 3, Week 6
Apply To: src/llm_config.py, scripts/setup_embeddings.py
Key Files:
ββ src/llm_config.py - Consider medical embedding models
ββ scripts/setup_embeddings.py - Use new embeddings
ββ Benchmark embedding quality
Integration: Better semantic search for medical terminology
Outcome: Improved document relevance ranking
Effort: 3-4 hours
#11 RAG Implementation
Phase: 3, Week 6
Apply To: src/agents/disease_explainer.py
Key Files:
ββ src/agents/disease_explainer.py - Track and enforce citations
ββ src/models/response.py - Add sources field
ββ api/app/routes/analyze.py - Return sources
Integration: Fix Issue #6 (no citations), enforce medical accuracy
Outcome: All claims backed by sources
Effort: 3-4 hours
#12 Knowledge Graph Builder
Phase: 3, Week 7
Apply To: src/knowledge_graph.py (NEW)
Key Files:
ββ src/knowledge_graph.py (NEW) - Disease β Biomarker β Treatment graph
ββ Extract entities from medical PDFs
ββ src/agents/biomarker_analyzer.py - Use knowledge graph
ββ Create graph.html visualization
Integration: Better disease prediction via relationships
Outcome: Knowledge graph with 100+ nodes, 500+ edges
Effort: 6-8 hours
#13 Senior Prompt Engineer
Phase: 2, Week 3
Apply To: src/agents/ (all agent prompts)
Key Files:
ββ src/agents/biomarker_analyzer.py - Prompt: few-shot extraction
ββ src/agents/disease_explainer.py - Prompt: chain-of-thought reasoning
ββ src/agents/confidence_assessor.py - Prompt: decision logic
ββ src/agents/clinical_guidelines.py - Prompt: evidence-based
Integration: Fix Issue #3 (confidence), improve medical reasoning
Outcome: +15% accuracy improvement
Effort: 5-6 hours
#14 LLM Evaluation
Phase: 2, Week 4
Apply To: tests/evaluation_metrics.py (NEW)
Key Files:
ββ tests/evaluation_metrics.py (NEW) - Benchmarking suite
ββ tests/fixtures/evaluation_patients.py - Test scenarios
ββ Benchmark Groq vs Gemini performance
ββ Track before/after improvements
Integration: Measure all improvements quantitatively
Outcome: Clear metrics showing progress
Effort: 4-5 hours
#15 Cost-Aware LLM Pipeline
Phase: 3, Week 8
Apply To: src/llm_config.py
Key Files:
ββ src/llm_config.py - Model routing by complexity
ββ Implement caching (hash β result)
ββ Cost tracking and reporting
ββ Target: -40% cost reduction
Integration: Optimize API costs without sacrificing accuracy
Outcome: Lower operational costs
Effort: 4-5 hours
#16 AI Wrapper/Structured Output
Phase: 1, Week 1
Apply To: api/app/models/ (NEW and REFACTORED)
Key Files:
ββ api/app/models/response.py (NEW) - Create unified BaseAnalysisResponse
ββ api/app/services/ragbot.py - Use unified schema
ββ All agents - Match unified output
ββ API responses - Validate with Pydantic
Integration: Fix Issues #1, #2, #4, #5 (schema consistency)
Outcome: Single canonical response format
Effort: 3-5 hours
#17 API Security Hardening
Phase: 1, Week 1
Apply To: api/app/middleware/, api/main.py
Key Files:
ββ api/app/middleware/auth.py (NEW) - JWT auth
ββ api/main.py - Add security middleware chain
ββ CORS, headers, rate limiting
Integration: Secure REST API endpoints
Outcome: API hardened against common attacks
Effort: 4-6 hours
#18 OWASP Security Check
Phase: 1, Week 1
Apply To: docs/ (audit report)
Key Files:
ββ docs/SECURITY_AUDIT.md (NEW) - Security findings
ββ Scan api/ and src/ for vulnerabilities
ββ Create tickets for each issue
Integration: Establish security baseline
Outcome: All vulnerabilities documented and prioritized
Effort: 2-3 hours
#19 LLM Security
Phase: 1, Week 2
Apply To: api/app/middleware/input_validation.py (NEW)
Key Files:
ββ api/app/middleware/input_validation.py (NEW) - Input sanitization
ββ Detect prompt injection attempts
ββ Validate biomarker inputs
ββ Escape special characters
Integration: Fix Issue #5 (JSON safety), prevent prompt injection
Outcome: Inputs validated before LLM processing
Effort: 3-4 hours
#20 API Rate Limiting
Phase: 1, Week 1
Apply To: api/app/middleware/rate_limiter.py (NEW)
Key Files:
ββ api/app/middleware/rate_limiter.py (NEW) - Token bucket limiter
ββ api/main.py - Add to middleware chain
ββ Tiered limits (free/pro based on API key)
Integration: Protect API from abuse
Outcome: Rate limiting in place
Effort: 2-3 hours
#21 Python Error Handling
Phase: 2, Week 2
Apply To: src/exceptions.py (NEW), src/agents/
Key Files:
ββ src/exceptions.py (NEW) - Custom exception hierarchy
ββ RagBotException, BiomarkerValidationError, LLMTimeoutError, etc.
ββ All agents - Replace generic try-except
ββ API - Proper error responses
Integration: Graceful error handling throughout system
Outcome: No uncaught exceptions, useful error messages
Effort: 3-4 hours
#22 Python Testing Patterns
Phase: 1, Week 1 + Phase 2, Week 3 (primary), Week 4
Apply To: tests/ (throughout project)
Key Files:
ββ tests/conftest.py - Shared fixtures
ββ tests/fixtures/ - auth, biomarkers, patients
ββ tests/test_api_auth.py - Auth tests (Week 1)
ββ tests/test_parametrized_*.py - 50+ parametrized tests (Week 3)
ββ tests/test_response_schema.py - Schema validation (Week 1)
ββ 80-90% code coverage
Integration: Comprehensive test suite ensures reliability
Outcome: 125+ tests, 90%+ coverage
Effort: 10-13 hours total
#23 Code Review Excellence
Phase: 4, Week 10
Apply To: docs/REVIEW_GUIDELINES.md (NEW), all PRs
Key Files:
ββ docs/REVIEW_GUIDELINES.md (NEW) - Medical code review standards
ββ Apply to all Phase 1-3 pull requests
ββ Self-review checklist
Integration: Maintain code quality
Outcome: Clear review guidelines
Effort: 2-3 hours
#24 GitHub Actions Templates
Phase: 1, Week 2
Apply To: .github/workflows/ (NEW)
Key Files:
ββ .github/workflows/test.yml - Run tests on PR
ββ .github/workflows/security.yml - Security checks
ββ .github/workflows/docker.yml - Build Docker images
Integration: Automated CI/CD pipeline
Outcome: Tests run automatically
Effort: 2-3 hours
#25 FastAPI Templates
Phase: 4, Week 9
Apply To: api/app/main.py, api/app/dependencies.py
Key Files:
ββ api/app/main.py - REFACTOR with best practices
ββ Async patterns, dependency injection
ββ Connection pooling, caching headers
ββ Health check endpoints
Integration: Production-grade FastAPI configuration
Outcome: Optimized API performance
Effort: 3-4 hours
#26 Python Design Patterns
Phase: 2, Week 3
Apply To: src/agents/base_agent.py (NEW), src/agents/
Key Files:
ββ src/agents/base_agent.py (NEW) - Extract common pattern
ββ Factory pattern for agent creation
ββ Composition over inheritance
ββ Refactor BiomarkerAnalyzerAgent, etc.
Integration: Cleaner, more maintainable code
Outcome: Reduced coupling, better abstractions
Effort: 4-5 hours
#27 Python Observability
Phase: 1, Week 2 (logging) / Phase 4, Week 10 (metrics) / Phase 2, Week 5
Apply To: src/, api/app/
Key Files:
ββ src/observability.py (NEW) - Logging infrastructure (Week 2)
ββ All agents - Add structured JSON logging
ββ src/monitoring/ (NEW) - Prometheus metrics (Week 10)
ββ Track latency, accuracy, costs
Integration: Visibility into system behavior
Outcome: JSON logs, metrics at /metrics
Effort: 12-15 hours total
#28 Memory Management
Phase: 3, Week 7
Apply To: src/memory_manager.py (NEW)
Key Files:
ββ src/memory_manager.py (NEW) - Sliding window memory
ββ Context compression for conversation history
ββ Token usage optimization
Integration: Handle long conversations without exceeding limits
Outcome: 20-30% token savings
Effort: 3-4 hours
#29 API Docs Generator
Phase: 4, Week 9
Apply To: api/app/routes/ (documentation)
Key Files:
ββ api/app/routes/*.py - Enhance docstrings
ββ Add examples to endpoints
ββ Auto-generates /docs (Swagger UI), /redoc
Integration: API discoverable by developers
Outcome: Interactive API documentation
Effort: 2-3 hours
#30 GitHub PR Review Workflow
Phase: 4, Week 9
Apply To: .github/ (NEW)
Key Files:
ββ .github/CODEOWNERS - Code ownership rules
ββ .github/pull_request_template.md - PR checklist
ββ Branch protection rules
Integration: Establish code review standards
Outcome: Consistent PR quality
Effort: 2-3 hours
#31 CI-CD Best Practices
Phase: 4, Week 10
Apply To: .github/workflows/deploy.yml (NEW)
Key Files:
ββ .github/workflows/deploy.yml (NEW) - Deployment pipeline
ββ Build β Test β Staging β Canary β Production
ββ Environment management (.env files)
Integration: Automated, safe deployments
Outcome: Confident production deployments
Effort: 3-4 hours
#32 Frontend Accessibility (OPTIONAL)
Phase: 4, Week 10
Apply To: examples/web_interface/ (if building web UI)
Key Files:
ββ examples/web_interface/ - WCAG 2.1 AA compliance
Integration: Accessible web interface (if needed)
Outcome: Screen-reader friendly, keyboard navigable
Effort: 2-3 hours (skip if CLI only)
#33 Webhook Receiver Hardener (OPTIONAL)
Phase: 4, Week 11
Apply To: api/app/webhooks/ (NEW, if integrations needed)
Key Files:
ββ api/app/webhooks/ (NEW) - Webhook handlers
ββ Signature verification, replay protection
Integration: Secure webhook handling for EHR integrations
Outcome: Protected webhook endpoints
Effort: 2-3 hours (skip if no webhooks)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
QUICK LOOKUP: BY FILE
api/app/main.py
ββ #17 API Security Hardening (JWT middleware)
ββ #20 Rate Limiting (rate limiter middleware)
ββ #25 FastAPI Templates (async patterns)
ββ #24 GitHub Actions (workflow) (CI/CD reference)
ββ #29 API Docs Generator (docstrings)
api/app/models/response.py (NEW)
ββ #16 AI Wrapper/Structured Output (unified schema)
ββ #22 Testing Patterns (Pydantic validation)
api/app/middleware/ (NEW)
ββ auth.py #17 API Security Hardening
ββ input_validation.py #19 LLM Security
ββ rate_limiter.py #20 API Rate Limiting
src/state.py
ββ #2 Workflow Orchestration (fix state fields)
ββ #16 Structured Output (enforce schema)
ββ #22 Testing Patterns (state tests)
src/workflow.py
ββ #2 Workflow Orchestration (state passing)
ββ #3 Multi-Agent Orchestration (agent order)
ββ #27 Observability (logging)
src/agents/base_agent.py (NEW)
ββ #26 Python Design Patterns (factory, composition)
ββ #6 LLM App Dev LangChain (lifecycle)
ββ #21 Error Handling (graceful degradation)
ββ #27 Observability (logging)
src/agents/biomarker_analyzer.py
ββ #4 Agentic Development (confidence thresholds)
ββ #13 Senior Prompt Engineer (prompt optimization)
ββ #2 Workflow Orchestration (return complete state)
ββ #12 Knowledge Graph (use relationships)
src/agents/disease_explainer.py
ββ #8 Hybrid Search (retriever)
ββ #11 RAG Implementation (enforcement)
ββ #13 Senior Prompt Engineer (chain-of-thought)
ββ #1 LangChain Architecture (advanced patterns)
ββ #7 RAG Agent Builder (RAG best practices)
src/agents/confidence_assessor.py
ββ #4 Agentic Development (decision logic)
ββ #13 Senior Prompt Engineer (better reasoning)
ββ #14 LLM Evaluation (benchmark)
ββ #22 Testing Patterns (confidence tests)
src/agents/clinical_guidelines.py
ββ #13 Senior Prompt Engineer (evidence-based)
ββ #1 LangChain Architecture (advanced retrieval)
src/exceptions.py (NEW)
ββ #21 Python Error Handling (exception hierarchy)
ββ #27 Observability (error logging)
src/retrievers/hybrid_retriever.py (NEW)
ββ #8 Hybrid Search Implementation (BM25 + FAISS)
ββ #9 Chunking Strategy (better chunks)
ββ #10 Embedding Pipeline (semantic search)
ββ #27 Observability (retrieval metrics)
src/chunking_strategy.py (NEW)
ββ #9 Chunking Strategy (medical section splitting)
ββ #10 Embedding Pipeline (prepare for embedding)
ββ #4 Agentic Development (standardization)
src/knowledge_graph.py (NEW)
ββ #12 Knowledge Graph Builder (extract relationships)
ββ #13 Senior Prompt Engineer (entity extraction prompt)
ββ #1 LangChain Architecture (graph traversal)
src/memory_manager.py (NEW)
ββ #28 Memory Management (sliding window, compression)
ββ #15 Cost-Aware Pipeline (token optimization)
src/llm_config.py
ββ #15 Cost-Aware LLM Pipeline (model routing, caching)
ββ #10 Embedding Pipeline (embedding model config)
ββ #27 Observability (cost tracking)
src/observability.py (NEW)
ββ #27 Python Observability (logging, metrics)
ββ #21 Error Handling (error tracking)
ββ #14 LLM Evaluation (metric collection)
src/monitoring/ (NEW)
ββ #27 Python Observability (metrics, dashboards)
tests/conftest.py
ββ #22 Python Testing Patterns (shared fixtures)
tests/fixtures/
ββ auth.py #22 Testing Patterns
ββ biomarkers.py #22 Testing Patterns
ββ evaluation_patients.py #14 LLM Evaluation
tests/test_api_auth.py (NEW)
ββ #22 Python Testing Patterns
ββ #17 API Security Hardening
ββ #25 FastAPI Templates
tests/test_parametrized_*.py (NEW)
ββ #22 Python Testing Patterns
tests/evaluation_metrics.py (NEW)
ββ #14 LLM Evaluation
.github/workflows/
ββ test.yml #24 GitHub Actions Templates
ββ security.yml #18 OWASP Check + #24 Actions
ββ docker.yml #24 Actions
ββ deploy.yml #31 CI-CD Best Practices
.github/
ββ CODEOWNERS #30 GitHub PR Review Workflow
ββ pull_request_template.md #30 Workflow
ββ branch protection rules
docs/
ββ SECURITY_AUDIT.md #18 OWASP Check
ββ REVIEW_GUIDELINES.md #23 Code Review Excellence
ββ API.md (updated by #29 API Docs Generator)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
SKILL DEPENDENCY GRAPH
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Phase 1 must finish before Phase 2:
#18, #17, #22, #2, #16, #20, #3, #19, #21, #27, #24
β
Phase 2 requires Phase 1:
#22, #26, #4, #13, #14, #5
β
Phase 3 requires Phases 1-2:
#8, #9, #10, #11, #12, #1, #28, #15
β
Phase 4 requires Phases 1-3:
#25, #29, #30, #27, #23, #31, #32*, #6, #33*, #7
Within phases, some order dependencies:
- #16 should complete before other Phase 1 work finalizes
- #13 should complete before #14 evaluation
- #8, #9, #10 should coordinate (hybrid search β chunking β embeddings)
- #11 depends on #8 (retriever first)
- #12 depends on #13 (prompt engineering for entity extraction)
- #27 used 3 times (Week 2, Week 5, Week 10)
- #22 used 2 times (Week 1, Weeks 3-4)
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
DAILY WORKFLOW
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
1. Open the skill SKILL.md documented in ~/.agents/skills/<skill-name>/
2. Read the relevant section for your task
3. Apply to specific code files listed above
4. Write tests immediately (use #22 Testing Patterns)
5. Commit with clear message: "feat: [Skill #X] [Description]"
6. Track in IMPLEMENTATION_STATUS_TRACKER.md
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|