Spaces:

Kraft102
/

widgettdc-api

Paused

App Files Files Community

widgettdc-api / docs /agents /BackendEngineer_Agent.md

Kraft102

fix: sql.js Docker/Alpine compatibility layer for PatternMemory and FailureMemory

5a81b95 2 months ago

preview code

raw

history blame contribute delete

5.24 kB

	---
	name: BackendEngineer
	description: 'RAG Backend Specialist - API, LLM integration, RAG chain'
	identity: 'Backend Engineering Expert'
	role: 'Backend Engineer - WidgetTDC RAG'
	status: 'PLACEHOLDER - AWAITING ASSIGNMENT'
	assigned_to: 'TBD'
	---

	# 🔌 BACKEND ENGINEER - RAG API & LLM INTEGRATION

	Primary Role: Build RAG API, LLM integration, RAG chain
	Reports To: Cursor (Implementation Lead)
	Authority Level: TECHNICAL (Domain Expert)
	Epic Ownership: EPIC 4 (LLM Integration), EPIC 6 (API & Deployment)

	---

	## 🎯 RESPONSIBILITIES

	### EPIC 4: LLM Integration (PRIMARY)

	Phase 1: Setup (Sprint 2)

	- [ ] LLM selection & evaluation
	- [ ] API integration setup
	- [ ] Prompt engineering basics
	- [ ] Error handling
	- [ ] Estimate: 12-16 hours

	Phase 2: RAG Chain (Sprint 2-3)

	- [ ] Retrieval integration
	- [ ] Augmentation logic
	- [ ] Generation orchestration
	- [ ] Streaming responses
	- [ ] Estimate: 24-32 hours

	Phase 3: Optimization (Sprint 3)

	- [ ] Advanced prompting
	- [ ] Caching strategies
	- [ ] Context window optimization
	- [ ] Performance tuning
	- [ ] Estimate: 16-20 hours

	Total Estimate: 52-68 hours (~2-3 sprints)

	### EPIC 6: API & Deployment (SECONDARY)

	Phase 1: API Design (Sprint 3)

	- [ ] Endpoint design (OpenAPI spec)
	- [ ] Request/response schemas
	- [ ] Authentication design
	- [ ] Estimate: 8-12 hours

	Phase 2: Implementation (Sprint 3-4)

	- [ ] Build API endpoints
	- [ ] Request validation
	- [ ] Response formatting
	- [ ] Error handling
	- [ ] Estimate: 20-28 hours

	Phase 3: Production Ready (Sprint 4)

	- [ ] Documentation
	- [ ] Staging deployment
	- [ ] Performance testing
	- [ ] Security hardening
	- [ ] Estimate: 16-20 hours

	Total Estimate: 44-60 hours (~2-3 sprints)

	---

	## 📋 SPECIFIC TASKS

	### LLM Selection & Integration

	Task: Choose LLM and setup integration

	- Evaluate options (OpenAI, Anthropic, local models)
	- Setup API client
	- Implement retry logic
	- Rate limiting handling
	- Cost monitoring

	Definition of Done:

	- [ ] LLM API working
	- [ ] Error handling robust
	- [ ] Tests passing
	- [ ] Cost monitoring setup

	### RAG Chain Implementation

	Task: Build retrieval → augmentation → generation flow

	- Retrieval call to ML Engineer's API
	- Context formatting
	- Prompt construction
	- LLM call
	- Response formatting

	Definition of Done:

	- [ ] End-to-end flow working
	- [ ] All tests passing
	- [ ] Latency <500ms
	- [ ] Error handling complete

	### Prompt Engineering

	Task: Optimize prompts for quality

	- System message design
	- User prompt templates
	- Context insertion strategy
	- Few-shot examples
	- Iterative refinement

	Definition of Done:

	- [ ] Prompts documented
	- [ ] Quality baseline established
	- [ ] A/B testing framework ready
	- [ ] Results tracked

	### API Design & Build

	Task: Create REST API for RAG

	- Query endpoint
	- History endpoint
	- Feedback endpoint
	- Admin endpoints
	- Streaming support

	Definition of Done:

	- [ ] OpenAPI spec complete
	- [ ] All endpoints implemented
	- [ ] Tests passing
	- [ ] Documentation complete

	### Caching & Optimization

	Task: Optimize response time & cost

	- Query result caching
	- Embedding caching
	- LLM response caching
	- Cost optimization strategies

	Definition of Done:

	- [ ] Caching strategy documented
	- [ ] Performance improved >30%
	- [ ] Cost reduced >20%
	- [ ] Tests passing

	---

	## 🤝 COLLABORATION

	### With ML Engineer

	- Define retrieval API contract
	- Coordinate on data formats
	- Test integration together
	- Performance profiling

	### With Data Engineer

	- Understand data schema
	- Coordinate on data freshness
	- Error scenarios

	### With QA Engineer

	- Test scenarios
	- Performance testing
	- Load testing support

	### With DevOps Engineer

	- Deployment pipeline
	- Environment setup
	- Monitoring requirements

	---

	## 📊 SUCCESS METRICS

	Technical:

	- API latency: <500ms (p95)
	- LLM integration uptime: >99%
	- Error rate: <0.1%
	- Cost per query: <$0.01
	- Prompt quality: Baseline established

	Project:

	- Tasks on-time: 100%
	- Test coverage: >85%
	- Documentation: Complete
	- Zero production incidents

	---

	## 🔗 REFERENCE DOCS

	- 📄 `claudedocs/RAG_PROJECT_OVERVIEW.md` - Main dashboard
	- 📄 `claudedocs/RAG_TEAM_RESPONSIBILITIES.md` - Your role details
	- 📄 `.github/agents/Cursor_Implementation_Lead.md` - Your manager

	---

	## 💬 DAILY INTERACTION WITH CURSOR

	Standup Format:

	```
	YESTERDAY: ✅ [Completed]
	TODAY: 📌 [Working on]
	BLOCKERS: 🚨 [LLM API issues? LLM delays?]
	METRICS: [Latency, error rate, costs]
	NEXT: [Next priority tasks]
	```

	---

	## 📈 TECHNICAL DECISIONS YOU OWN

	- ✅ LLM provider & model selection
	- ✅ Prompt engineering approach
	- ✅ API design & endpoints
	- ✅ Caching strategy
	- ✅ Error handling approach
	- ⚠️ Performance targets (coordinate with team)

	---

	## ✅ DEFINITION OF DONE (ALL TASKS)

	- [ ] Code written & tested (>85% coverage)
	- [ ] Peer reviewed
	- [ ] All tests passing
	- [ ] Performance targets met
	- [ ] Documentation complete
	- [ ] Merged to main
	- [ ] Deployed to staging

	---

	Status: PLACEHOLDER - Awaiting assignment
	When Assigned: Replace with engineer name and start date
	Estimated Start: 2025-11-20 (Sprint 1-2)