File size: 8,418 Bytes
31f0e50
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
# Product Requirements Document: ScamShield AI
## Agentic Honeypot System for Scam Detection & Intelligence Extraction

**Version:** 1.0  
**Date:** January 26, 2026  
**Owner:** Product & Architecture Team  
**Target Competition:** India AI Impact Buildathon 2026 - Challenge 2  
**Submission Deadline:** February 5, 2026

---

## EXECUTIVE SUMMARY

ScamShield AI is an autonomous AI-powered honeypot system designed to detect scam messages, actively engage scammers using believable personas, and extract actionable intelligence including bank accounts, UPI IDs, and phishing links. Built exclusively with free-tier technologies, the system targets 90%+ detection accuracy and multi-turn conversational engagement in English and Hindi.

**Target Outcome:** TOP 10 ranking from 40,000 participants through technical superiority and production-grade implementation.

---

## PROBLEM STATEMENT

### Market Context
- 500,000+ scam calls/messages daily in India (TRAI 2025)
- β‚Ή60+ crore daily losses to fraud
- 47% of Indians affected by or know victims of scam fraud
- Predominant scams: UPI fraud, fake loans, police/bank impersonation

### Solution Gap
Existing solutions focus on passive detection. ScamShield AI introduces active engagement to extract intelligence while scammers remain unaware they're interacting with an AI system.

---

## PRODUCT VISION

**Mission:** Proactively combat digital fraud through autonomous AI agents that gather actionable intelligence from scammers.

**Core Differentiators:**
1. **Active Engagement:** Multi-turn conversations (up to 20 turns) vs. single-response detection
2. **Intelligence Extraction:** Structured extraction of UPI IDs, bank accounts, IFSC codes, phone numbers, phishing links
3. **Persona Simulation:** Dynamic, believable personas (elderly, eager victim, confused user)
4. **Bilingual:** Native English + Hindi support with Hinglish handling
5. **Cost-Effective:** 100% free-tier implementation

---

## TARGET USERS

**Primary:** Competition judges evaluating via Mock Scammer API integration

**Secondary (Future):**
- Financial institutions (banks, payment providers)
- Law enforcement agencies
- Consumer protection organizations
- Telecom operators

---

## PRODUCT REQUIREMENTS

### Phase 1: Text-Based Honeypot (Feb 5, 2026)

#### FR-1: Scam Detection
- **Requirement:** Classify incoming messages as scam/not-scam with confidence scores
- **Accuracy Target:** β‰₯90% on test dataset
- **Languages:** English, Hindi, Hinglish (code-mixed)
- **Confidence Threshold:** 0.7 (trigger engagement)

#### FR-2: Agentic Engagement
- **Requirement:** Conduct multi-turn conversations to extract intelligence
- **Turn Range:** 1-20 turns per conversation
- **Persona Types:** Elderly (60+), Eager Victim (middle-aged), Confused User (young)
- **Strategy:** Progressive engagement (interest β†’ confusion β†’ probing)

#### FR-3: Intelligence Extraction
- **Requirement:** Extract and validate financial/contact information
- **Target Types:**
  - UPI IDs (e.g., user@paytm)
  - Bank Account Numbers (9-18 digits)
  - IFSC Codes (11 characters, format: XXXX0XXXXXX)
  - Phone Numbers (Indian mobile: +91XXXXXXXXXX or 10-digit)
  - Phishing Links (http/https URLs)
- **Precision Target:** β‰₯85%
- **Recall Target:** β‰₯80%

#### FR-4: API Integration
- **Requirement:** REST API endpoint for competition testing
- **Response Time:** <2 seconds per request
- **Format:** Structured JSON output
- **Session Management:** Stateful conversation tracking

#### FR-5: State Persistence
- **Requirement:** Maintain conversation context across turns
- **Storage:** PostgreSQL (logs), Redis (active sessions), ChromaDB (embeddings)
- **Session Expiry:** 1 hour for active sessions

### Phase 2: Audio Extension (Post-Competition)
- Whisper-based audio transcription
- Voice deepfake detection
- Phone call integration

---

## NON-FUNCTIONAL REQUIREMENTS

### Performance
- API Latency: <2s (p95), <1s (p50)
- Throughput: 100 requests/minute
- Concurrent Sessions: 50+

### Reliability
- Uptime: 99%+ during competition testing window
- Error Rate: <1%
- Graceful degradation on LLM rate limits

### Scalability
- Horizontal scaling via containerization
- Stateless API design (state in external stores)
- Database connection pooling

### Security
- No storage of real user PII
- Anonymization of extracted phone numbers
- Safe engagement (no provocation/threats)
- Data retention: 30 days max

### Compliance
- DPDP Act 2023 adherence
- Ethical AI guidelines (no harm principle)
- Transparent data handling

---

## SUCCESS METRICS

### Competition Metrics
1. **Detection Accuracy:** >90% (true positive rate)
2. **False Positive Rate:** <5%
3. **Engagement Quality:** >10 turns average
4. **Extraction Precision:** >85%
5. **Response Time:** <2s per request
6. **System Uptime:** 99%+ during testing

### Technical Metrics
- Code Coverage: >80%
- Documentation Completeness: 100%
- API Compliance: 100% (all endpoints functional)
- Error Handling: All edge cases covered

---

## CONSTRAINTS & ASSUMPTIONS

### Constraints
- **Cost:** $0 operational cost (free tier only)
- **Time:** 10 days to production deployment
- **Languages:** English + Hindi only (no Gujarati/Tamil/etc.)
- **Modality:** Text only in Phase 1

### Assumptions
- Competition provides functional Mock Scammer API
- Groq API maintains 30 req/min free tier
- Test dataset representative of real scam messages
- Judges evaluate on detection accuracy, engagement quality, extraction precision

---

## DEPENDENCIES

### External Services
- Groq Cloud API (LLM)
- Supabase (PostgreSQL)
- Redis Cloud (cache)
- Hugging Face (model downloads)

### Critical Risks
| Risk | Impact | Mitigation |
|------|--------|-----------|
| Groq rate limits | High | Retry logic, exponential backoff, request queueing |
| Model loading time | Medium | Load at startup, cache in memory |
| Database connectivity | High | Connection pooling, auto-reconnect, local fallback |
| Competition API changes | Medium | Flexible schema design, extensive pre-testing |

---

## OUT OF SCOPE (Phase 1)

- Audio/voice call handling (Phase 2)
- Real-time phone system integration
- Automated police reporting
- Gujarati or other regional languages
- Web scraping of phishing sites
- Blockchain/cryptocurrency scam detection (unless text-based)

---

## ACCEPTANCE CRITERIA

**Phase 1 Launch Readiness:**
1. βœ… API endpoint deployed and publicly accessible
2. βœ… Health check endpoint returns 200 OK
3. βœ… Detection accuracy β‰₯85% on 100+ test cases
4. βœ… Extraction precision β‰₯80% on validation dataset
5. βœ… Response time <2s for 95% of requests
6. βœ… Multi-turn engagement averages >8 turns
7. βœ… Hindi and English both functional
8. βœ… JSON output matches specified schema
9. βœ… Monitoring dashboard active
10. βœ… Documentation complete (API docs, deployment guide)

---

## ROADMAP

### Week 1 (Jan 26 - Feb 1): Core Development
- Days 1-2: Project setup, dependencies, database initialization
- Days 3-4: Detection module (IndicBERT, language detection)
- Days 5-6: Agentic module (LangGraph, Groq integration, personas)
- Day 7: Extraction module (spaCy NER, regex patterns)

### Week 2 (Feb 2 - Feb 5): Testing & Deployment
- Day 8: Integration and end-to-end testing
- Day 9: Unit/integration/load testing
- Day 10: Production deployment to Render/Railway
- Day 11: Final testing and competition submission

---

## APPENDIX

### Technology Stack
- **Detection:** IndicBERT (ai4bharat/indic-bert), langdetect
- **LLM:** Groq Llama 3.1 70B (free tier)
- **Orchestration:** LangGraph + LangChain
- **Extraction:** spaCy (en_core_web_sm), regex patterns

- **API:** FastAPI + Uvicorn + Pydantic

- **Storage:** PostgreSQL, Redis, ChromaDB (all local/free)

- **Deployment:** Docker, Render/Railway



### Key Performance Indicators

- Scam detection calls: target 1000+ during competition testing

- Average engagement turns: target 12

- Intelligence pieces extracted per conversation: target 2.5

- System uptime during judging window: 99.9%



---



**Document Status:** Approved for Implementation  

**Next Steps:** Proceed to FRD.md for detailed functional specifications