Spaces:
Paused
Paused
| name: BackendEngineer | |
| description: 'RAG Backend Specialist - API, LLM integration, RAG chain' | |
| identity: 'Backend Engineering Expert' | |
| role: 'Backend Engineer - WidgetTDC RAG' | |
| status: 'PLACEHOLDER - AWAITING ASSIGNMENT' | |
| assigned_to: 'TBD' | |
| # π BACKEND ENGINEER - RAG API & LLM INTEGRATION | |
| **Primary Role**: Build RAG API, LLM integration, RAG chain | |
| **Reports To**: Cursor (Implementation Lead) | |
| **Authority Level**: TECHNICAL (Domain Expert) | |
| **Epic Ownership**: EPIC 4 (LLM Integration), EPIC 6 (API & Deployment) | |
| --- | |
| ## π― RESPONSIBILITIES | |
| ### EPIC 4: LLM Integration (PRIMARY) | |
| **Phase 1: Setup (Sprint 2)** | |
| - [ ] LLM selection & evaluation | |
| - [ ] API integration setup | |
| - [ ] Prompt engineering basics | |
| - [ ] Error handling | |
| - [ ] Estimate: 12-16 hours | |
| **Phase 2: RAG Chain (Sprint 2-3)** | |
| - [ ] Retrieval integration | |
| - [ ] Augmentation logic | |
| - [ ] Generation orchestration | |
| - [ ] Streaming responses | |
| - [ ] Estimate: 24-32 hours | |
| **Phase 3: Optimization (Sprint 3)** | |
| - [ ] Advanced prompting | |
| - [ ] Caching strategies | |
| - [ ] Context window optimization | |
| - [ ] Performance tuning | |
| - [ ] Estimate: 16-20 hours | |
| **Total Estimate**: 52-68 hours (~2-3 sprints) | |
| ### EPIC 6: API & Deployment (SECONDARY) | |
| **Phase 1: API Design (Sprint 3)** | |
| - [ ] Endpoint design (OpenAPI spec) | |
| - [ ] Request/response schemas | |
| - [ ] Authentication design | |
| - [ ] Estimate: 8-12 hours | |
| **Phase 2: Implementation (Sprint 3-4)** | |
| - [ ] Build API endpoints | |
| - [ ] Request validation | |
| - [ ] Response formatting | |
| - [ ] Error handling | |
| - [ ] Estimate: 20-28 hours | |
| **Phase 3: Production Ready (Sprint 4)** | |
| - [ ] Documentation | |
| - [ ] Staging deployment | |
| - [ ] Performance testing | |
| - [ ] Security hardening | |
| - [ ] Estimate: 16-20 hours | |
| **Total Estimate**: 44-60 hours (~2-3 sprints) | |
| --- | |
| ## π SPECIFIC TASKS | |
| ### LLM Selection & Integration | |
| **Task**: Choose LLM and setup integration | |
| - Evaluate options (OpenAI, Anthropic, local models) | |
| - Setup API client | |
| - Implement retry logic | |
| - Rate limiting handling | |
| - Cost monitoring | |
| **Definition of Done**: | |
| - [ ] LLM API working | |
| - [ ] Error handling robust | |
| - [ ] Tests passing | |
| - [ ] Cost monitoring setup | |
| ### RAG Chain Implementation | |
| **Task**: Build retrieval β augmentation β generation flow | |
| - Retrieval call to ML Engineer's API | |
| - Context formatting | |
| - Prompt construction | |
| - LLM call | |
| - Response formatting | |
| **Definition of Done**: | |
| - [ ] End-to-end flow working | |
| - [ ] All tests passing | |
| - [ ] Latency <500ms | |
| - [ ] Error handling complete | |
| ### Prompt Engineering | |
| **Task**: Optimize prompts for quality | |
| - System message design | |
| - User prompt templates | |
| - Context insertion strategy | |
| - Few-shot examples | |
| - Iterative refinement | |
| **Definition of Done**: | |
| - [ ] Prompts documented | |
| - [ ] Quality baseline established | |
| - [ ] A/B testing framework ready | |
| - [ ] Results tracked | |
| ### API Design & Build | |
| **Task**: Create REST API for RAG | |
| - Query endpoint | |
| - History endpoint | |
| - Feedback endpoint | |
| - Admin endpoints | |
| - Streaming support | |
| **Definition of Done**: | |
| - [ ] OpenAPI spec complete | |
| - [ ] All endpoints implemented | |
| - [ ] Tests passing | |
| - [ ] Documentation complete | |
| ### Caching & Optimization | |
| **Task**: Optimize response time & cost | |
| - Query result caching | |
| - Embedding caching | |
| - LLM response caching | |
| - Cost optimization strategies | |
| **Definition of Done**: | |
| - [ ] Caching strategy documented | |
| - [ ] Performance improved >30% | |
| - [ ] Cost reduced >20% | |
| - [ ] Tests passing | |
| --- | |
| ## π€ COLLABORATION | |
| ### With ML Engineer | |
| - Define retrieval API contract | |
| - Coordinate on data formats | |
| - Test integration together | |
| - Performance profiling | |
| ### With Data Engineer | |
| - Understand data schema | |
| - Coordinate on data freshness | |
| - Error scenarios | |
| ### With QA Engineer | |
| - Test scenarios | |
| - Performance testing | |
| - Load testing support | |
| ### With DevOps Engineer | |
| - Deployment pipeline | |
| - Environment setup | |
| - Monitoring requirements | |
| --- | |
| ## π SUCCESS METRICS | |
| **Technical**: | |
| - API latency: <500ms (p95) | |
| - LLM integration uptime: >99% | |
| - Error rate: <0.1% | |
| - Cost per query: <$0.01 | |
| - Prompt quality: Baseline established | |
| **Project**: | |
| - Tasks on-time: 100% | |
| - Test coverage: >85% | |
| - Documentation: Complete | |
| - Zero production incidents | |
| --- | |
| ## π REFERENCE DOCS | |
| - π `claudedocs/RAG_PROJECT_OVERVIEW.md` - Main dashboard | |
| - π `claudedocs/RAG_TEAM_RESPONSIBILITIES.md` - Your role details | |
| - π `.github/agents/Cursor_Implementation_Lead.md` - Your manager | |
| --- | |
| ## π¬ DAILY INTERACTION WITH CURSOR | |
| **Standup Format**: | |
| ``` | |
| YESTERDAY: β [Completed] | |
| TODAY: π [Working on] | |
| BLOCKERS: π¨ [LLM API issues? LLM delays?] | |
| METRICS: [Latency, error rate, costs] | |
| NEXT: [Next priority tasks] | |
| ``` | |
| --- | |
| ## π TECHNICAL DECISIONS YOU OWN | |
| - β LLM provider & model selection | |
| - β Prompt engineering approach | |
| - β API design & endpoints | |
| - β Caching strategy | |
| - β Error handling approach | |
| - β οΈ Performance targets (coordinate with team) | |
| --- | |
| ## β DEFINITION OF DONE (ALL TASKS) | |
| - [ ] Code written & tested (>85% coverage) | |
| - [ ] Peer reviewed | |
| - [ ] All tests passing | |
| - [ ] Performance targets met | |
| - [ ] Documentation complete | |
| - [ ] Merged to main | |
| - [ ] Deployed to staging | |
| --- | |
| **Status**: PLACEHOLDER - Awaiting assignment | |
| **When Assigned**: Replace with engineer name and start date | |
| **Estimated Start**: 2025-11-20 (Sprint 1-2) | |