Spaces:

Abeshith
/

Voice-Bot-RAG

Runtime error

App Files Files Community

Abeshith commited on Jun 2

Commit

1813edc

0 Parent(s):

Voice BOT RAG Initial Commit

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

.gitignore +2 -0
README.md +389 -0
START_SYSTEM.ps1 +64 -0
backend/__init__.py +1 -0
backend/__pycache__/__init__.cpython-311.pyc +0 -0
backend/__pycache__/config.cpython-311.pyc +0 -0
backend/__pycache__/main.cpython-311.pyc +0 -0
backend/__pycache__/voice_bot_controller.cpython-311.pyc +0 -0
backend/config.py +68 -0
backend/main.py +241 -0
backend/voice_bot_controller.py +134 -0
data/latency_results.json +142 -0
data/load_sample_data.py +201 -0
data/sessions.db +0 -0
data/test_latency.json +22 -0
frontend/__pycache__/streamlit_app.cpython-311.pyc +0 -0
frontend/streamlit_app.py +739 -0
orchestration/__init__.py +1 -0
orchestration/__pycache__/__init__.cpython-311.pyc +0 -0
orchestration/__pycache__/langgraph_workflow.cpython-311.pyc +0 -0
orchestration/__pycache__/latency_tracker.cpython-311.pyc +0 -0
orchestration/__pycache__/state.cpython-311.pyc +0 -0
orchestration/langgraph_workflow.py +119 -0
orchestration/latency_tracker.py +123 -0
orchestration/nodes/__init__.py +1 -0
orchestration/nodes/__pycache__/__init__.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/context_builder.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/entity_extraction.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/intent_detection.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/memory_persistence.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/response_generation.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/retrieval_router.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/sentiment_analysis.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/sentiment_hybrid.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/tts_generation.cpython-311.pyc +0 -0
orchestration/nodes/__pycache__/validation.cpython-311.pyc +0 -0
orchestration/nodes/context_builder.py +60 -0
orchestration/nodes/entity_extraction.py +58 -0
orchestration/nodes/intent_detection.py +61 -0
orchestration/nodes/memory_persistence.py +45 -0
orchestration/nodes/response_generation.py +93 -0
orchestration/nodes/retrieval_router.py +51 -0
orchestration/nodes/sentiment_analysis.py +49 -0
orchestration/nodes/sentiment_hybrid.py +133 -0
orchestration/nodes/tts_generation.py +72 -0
orchestration/nodes/validation.py +61 -0
orchestration/state.py +67 -0
rag/__init__.py +1 -0
rag/__pycache__/__init__.cpython-311.pyc +0 -0
rag/__pycache__/cache_manager.cpython-311.pyc +0 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ venv/
2	+ .env

README.md ADDED Viewed

	@@ -0,0 +1,389 @@

+# Voice RAG Bot - AI Customer Support System
+**Status**: ✅ **FULLY FUNCTIONAL** | Latest Update: May 30, 2026
+## 📋 Quick Overview
+Voice RAG Bot is an intelligent AI customer support system that:
+- 🎤 **Accepts voice input** via microphone or audio file upload
+- 🧠 **Processes with LLM** (Groq) for intent detection and response generation
+- 📚 **Retrieves relevant context** from knowledge base and customer history using vector search
+- 😊 **Analyzes sentiment** to provide empathetic, sentiment-aware responses
+- 🔊 **Generates speech output** via text-to-speech
+- 📊 **Orchestrates 9-node workflow** using LangGraph
+**Tech Stack**: Faster Whisper (STT) → LangGraph (9 nodes) → Groq LLM → Qdrant (Vector DB) → gTTS (TTS)
+---
+## 🚀 Quick Start (3 Steps)
+### Step 1: Prerequisites
+- Docker Desktop running (for Qdrant)
+- Python 3.11+
+- Git (optional)
+### Step 2: Start Qdrant (Vector Database)
+```bash
+docker run -p 6333:6333 qdrant/qdrant:latest
+```
+Leave this running in background. ✅ System will auto-create collections.
+### Step 3: Start Voice RAG Bot
+```bash
+cd d:\Voice RAG Bot\voice-rag-bot
+# Activate virtual environment
+.\venv\Scripts\Activate.ps1
+# Run startup script (starts backend + Streamlit)
+.\START_SYSTEM.ps1
+```
+**Or start services manually:**
+Terminal 1 (Backend):
+```bash
+.\venv\Scripts\Activate.ps1
+python backend/main.py
+# Runs on http://localhost:8000
+```
+Terminal 2 (Frontend):
+```bash
+.\venv\Scripts\Activate.ps1
+streamlit run frontend/streamlit_app.py
+# Opens http://localhost:8501
+```
+---
+## 📖 Usage Guide
+### Via Streamlit Frontend (Recommended)
+1. **Open Browser**: http://localhost:8501
+2. **Enter Customer ID**: Unique identifier for customer (enables history tracking)
+3. **Choose Input Method**:
+   - **Option A**: Click 🎤 **Record** → Speak your message → **Process Audio**
+   - **Option B**: Upload audio file (MP3/WAV)
+   - **Option C**: Type message directly in text area
+4. **View Results** (automatically displayed):
+   - 📝 Generated Response
+   - 🎯 Detected Intent (+ confidence)
+   - 😊 Sentiment Analysis (+ confidence)
+   - 🏷️ Extracted Entities
+   - 📚 Knowledge Base context (if relevant)
+   - 📜 Customer History (if relevant)
+   - 🔊 Audio playback of response
+### Via REST API (For Integration)
+**Process Audio:**
+```bash
+curl -X POST "http://localhost:8000/process-audio?customer_id=CUST_001" \
+  -F "file=@voice_message.wav"
+```
+**Process Text:**
+```bash
+curl -X POST "http://localhost:8000/process-text" \
+  -d "user_input=I want to return my laptop&customer_id=CUST_001"
+```
+**Health Check:**
+```bash
+curl http://localhost:8000/health
+```
+---
+## 📊 System Architecture
+```
+Input Layer
+  ├─ 🎤 Audio Input (Streamlit st.audio_input)
+  └─ 📝 Text Input (Streamlit text area)
+         ↓
+Speech-to-Text
+  └─ Faster Whisper (base model, CPU inference)
+         ↓
+Orchestration Layer (LangGraph - 9 Nodes)
+  1. sentiment_analysis (DistilBERT)
+  2. entity_extraction (BERT-base-NER)
+  3. intent_detection (Groq LLM)
+  4. retrieval_router (Qdrant search)
+  5. context_builder (Format prompt)
+  6. response_generation (Groq LLM)
+  7. validation (Hallucination checks)
+  8. memory_persistence (Qdrant upsert)
+  9. tts_generation (gTTS)
+         ↓
+Output Layer
+  ├─ 📝 Text Response
+  ├─ 😊 Sentiment-aware Tone
+  ├─ 🔊 Audio File (MP3)
+  └─ 🎯 Intent Classification
+```
+---
+## 🔧 Configuration
+**Environment Variables** (`.env`):
+```
+GROQ_API_KEY=your_groq_api_key_here
+QDRANT_URL=http://localhost:6333
+BACKEND_URL=http://localhost:8000
+VECTOR_DIMENSION=1024
+EMBEDDING_MODEL=BAAI/bge-m3
+GROQ_MODEL=openai/gpt-oss-20b
+KB_COLLECTION_NAME=knowledge_base
+HISTORY_COLLECTION_NAME=customer_history
+WHISPER_MODEL=base
+```
+---
+## 📝 Sample Data
+Load sample data (4 KB documents + 4 customer history records):
+```bash
+.\venv\Scripts\Activate.ps1
+python data/load_sample_data.py
+```
+**Included Data:**
+- KB Documents: Return Policy, Shipping Info, Warranty Info, Account Management
+- Customer History: 4 interactions (complaints, refunds, inquiries)
+---
+## 🧪 Testing
+### Quick Verification
+```bash
+# Test complete pipeline (end-to-end)
+.\venv\Scripts\Activate.ps1
+python tests/test_full_integration.py
+```
+**Expected Output**: ✅ FULL INTEGRATION TEST PASSED
+### Component Status
+- ✅ All 9 nodes connected and working
+- ✅ FastAPI endpoints operational
+- ✅ Qdrant vector search functional
+- ✅ LLM integration responding
+- ✅ Audio processing working
+- ✅ Sample data loadable
+---
+## 🎯 Intent Types Supported
+| Intent | Example | Response |
+|--------|---------|----------|
+| `refund_request` | "I want to return this" | Empathetic, processing info |
+| `order_status` | "Where's my order?" | Tracking info |
+| `product_inquiry` | "Tell me about...?" | Product details |
+| `billing_issue` | "My charge was wrong" | Empathetic, billing process |
+| `warranty_claim` | "Product broke" | Warranty eligibility info |
+| `account_management` | "Change my password" | Account instructions |
+| `general_support` | "How do I...?" | General assistance |
+| `complaint` | "This is unacceptable" | Empathetic, resolution steps |
+| `other` | Misc questions | General help |
+---
+## 📊 Response Quality Factors
+1. **Sentiment Detection**: POSITIVE/NEGATIVE/NEUTRAL classification
+2. **Confidence Scores**: 0-1 for both intent and sentiment
+3. **Context Retrieval**: Up to 3 KB documents + customer history
+4. **Tone Matching**: Empathetic for negative, professional for neutral, friendly for positive
+5. **Hallucination Prevention**: Validation layer checks for accuracy
+---
+## 🐛 Troubleshooting
+### Issue: "Backend Not Connected"
+**Solution**: Ensure FastAPI backend is running
+```bash
+python backend/main.py
+```
+### Issue: "Qdrant Connection Error"
+**Solution**: Start Qdrant Docker container
+```bash
+docker run -p 6333:6333 qdrant/qdrant:latest
+```
+### Issue: "Groq API Error"
+**Solution**: Check GROQ_API_KEY in `.env` file
+```bash
+# Verify key is set
+echo $env:GROQ_API_KEY
+```
+### Issue: "Audio Processing Timeout"
+**Solution**: Processing may take 30-60 seconds for audio
+- First run downloads models (Whisper, BGE-M3, DistilBERT)
+- Subsequent runs are faster
+- Ensure sufficient disk space (~5GB)
+### Issue: "Module Not Found"
+**Solution**: Reinstall dependencies
+```bash
+.\venv\Scripts\Activate.ps1
+pip install -r requirements.txt
+```
+---
+## 📁 Project Structure
+```
+d:\Voice RAG Bot\voice-rag-bot\
+├── backend/
+│   ├── main.py                 FastAPI server
+│   └── config.py               Configuration
+├── frontend/
+│   └── streamlit_app.py        Web UI
+├── orchestration/
+│   ├── langgraph_workflow.py   9-node workflow
+│   ├── state.py                State management
+│   └── nodes/                  Individual nodes
+│       ├── sentiment_analysis.py
+│       ├── entity_extraction.py
+│       ├── intent_detection.py
+│       ├── retrieval_router.py
+│       ├── context_builder.py
+│       ├── response_generation.py
+│       ├── validation.py
+│       ├── memory_persistence.py
+│       └── tts_generation.py
+├── rag/
+│   ├── qdrant_manager.py       Vector DB client
+│   └── embedding_manager.py    BGE-M3 embeddings
+├── data/
+│   ├── load_sample_data.py     Sample data loader
+│   └── audio_output/           Generated audio files
+├── tests/
+│   └── test_full_integration.py End-to-end test
+├── .env                        Configuration
+├── requirements.txt            Dependencies
+├── START_SYSTEM.ps1           Quick start script
+└── venv/                       Python environment
+```
+---
+## 🔄 Workflow Execution (Behind the Scenes)
+1. **sentiment_analysis**: Input → DistilBERT → POSITIVE/NEGATIVE/NEUTRAL
+2. **entity_extraction**: Input → BERT-NER → Extract names, locations, etc.
+3. **intent_detection**: Input → Groq LLM → 9-intent classification
+4. **retrieval_router**: Intent → Qdrant search → 3 KB docs + customer history
+5. **context_builder**: Format contexts → Unified prompt
+6. **response_generation**: Prompt → Groq LLM → Response text
+7. **validation**: Check hallucinations → Retry if needed
+8. **memory_persistence**: Embed response → Upsert to Qdrant
+9. **tts_generation**: Response text → gTTS → MP3 audio file
+---
+## 📊 Performance Metrics (Approximate)
+| Component | Time | Notes |
+|-----------|------|-------|
+| STT (Audio → Text) | 5-15s | Depends on audio length |
+| Sentiment Analysis | 0.5s | DistilBERT inference |
+| Entity Extraction | 0.5s | BERT-NER inference |
+| Intent Detection | 1-2s | Groq API call |
+| KB Search | 0.2s | Qdrant vector search |
+| Response Generation | 2-5s | Groq streaming |
+| Validation | 0.5s | Local checks |
+| TTS Generation | 2-5s | gTTS processing |
+| **Total End-to-End** | **12-35s** | First run slower (model loading) |
+---
+## 💡 Tips & Tricks
+### Faster Processing
+- Use text input instead of audio (skips STT)
+- System caches models after first run
+- Keep audio messages under 30 seconds
+### Better Responses
+- Use clear, grammatically correct input
+- Provide context ("purchased last week" vs "bought before")
+- Specify what you need (return, refund, replacement)
+### Debugging
+- Check `backend/main.py` logs for errors
+- View Qdrant collections: http://localhost:6333/api/swagger/index.html
+- Monitor Streamlit server in terminal for issues
+---
+## 🚀 Next Steps
+1. **Load Sample Data**: `python data/load_sample_data.py`
+2. **Test with Demo Scenarios**: Use Streamlit to test various intents
+3. **Customize KB Documents**: Add your own documents to Qdrant
+4. **Fine-tune Prompts**: Edit prompts in `prompts/` directory
+5. **Production Deployment**: Add authentication, rate limiting, monitoring
+---
+## 📞 Support & References
+**Documentation Files:**
+- `data/DATA_REQUIREMENTS.md` - Data schema documentation
+- `.env` - Environment configuration
+**API Endpoints:**
+- `POST /process-audio` - Audio input endpoint
+- `POST /process-text` - Text input endpoint
+- `GET /health` - Health check
+**Backend Logs:**
+- Location: Console output when running `python backend/main.py`
+- Check for errors, model loading, API calls
+---
+## 📝 License & Attribution
+**Components**:
+- **Groq LLM**: Free tier, gpt-oss-20b model
+- **Faster Whisper**: OpenAI (MIT License)
+- **LangGraph**: LangChain (Open Source)
+- **Qdrant**: Open source vector database
+- **BGE-M3**: BAAI embeddings model
+- **DistilBERT**: Hugging Face transformers
+- **gTTS**: Google Text-to-Speech
+---
+## ✅ Verification Checklist
+Before considering system "ready for production":
+- [ ] Backend running on http://localhost:8000
+- [ ] Qdrant running on http://localhost:6333
+- [ ] Streamlit frontend accessible at http://localhost:8501
+- [ ] Sample data loaded (`python data/load_sample_data.py`)
+- [ ] Integration test passing (`python tests/test_full_integration.py`)
+- [ ] Audio input working (record or upload)
+- [ ] All 9 nodes executing (check logs)
+- [ ] Response generation working
+- [ ] Audio playback working
+- [ ] History tracking working (multiple messages same customer)
+---
+**Built with ❤️ | Last Updated: May 30, 2026**

START_SYSTEM.ps1 ADDED Viewed

	@@ -0,0 +1,64 @@

+# Voice RAG Bot - System Startup Script
+# Starts FastAPI backend and Streamlit frontend
+Write-Host "=================================="
+Write-Host "Voice RAG Bot - System Startup"
+Write-Host "=================================="
+Write-Host ""
+# Check if venv exists
+if (-not (Test-Path "venv\Scripts\Activate.ps1")) {
+    Write-Host "ERROR: Virtual environment not found!"
+    Write-Host "Please run: python -m venv venv"
+    exit 1
+}
+# Activate venv
+Write-Host "[1/3] Activating virtual environment..."
+& .\venv\Scripts\Activate.ps1
+# Check if Qdrant is running
+Write-Host "[2/3] Checking Qdrant connection..."
+try {
+    $response = Invoke-WebRequest -Uri "http://localhost:6333/health" -UseBasicParsing -TimeoutSec 2
+    if ($response.StatusCode -eq 200) {
+        Write-Host "✅ Qdrant is running on localhost:6333"
+    }
+} catch {
+    Write-Host "⚠️  WARNING: Cannot connect to Qdrant on localhost:6333"
+    Write-Host "   Make sure Docker is running and Qdrant container is active"
+    Write-Host "   Run: docker run -p 6333:6333 qdrant/qdrant:latest"
+}
+Write-Host ""
+Write-Host "[3/3] Starting services..."
+Write-Host ""
+# Start backend in a separate process
+Write-Host "Starting FastAPI backend on http://localhost:8000"
+$backendProcess = Start-Process -NoNewWindow -FilePath "python" -ArgumentList "backend/main.py" -PassThru
+Start-Sleep -Seconds 3
+# Start Streamlit
+Write-Host "Starting Streamlit frontend on http://localhost:8501"
+Write-Host ""
+Write-Host "=================================="
+Write-Host "Services started successfully!"
+Write-Host "=================================="
+Write-Host ""
+Write-Host "Frontend URL: http://localhost:8501"
+Write-Host "Backend API: http://localhost:8000"
+Write-Host ""
+Write-Host "Backend PID: $($backendProcess.Id)"
+Write-Host ""
+Write-Host "To stop the backend, run: Stop-Process -Id $($backendProcess.Id)"
+Write-Host ""
+# Start Streamlit
+python -m streamlit run frontend/streamlit_app.py
+# Cleanup
+Write-Host ""
+Write-Host "Stopping backend..."
+Stop-Process -Id $backendProcess.Id -Force
+Write-Host "Shutdown complete."

backend/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """Voice RAG Bot Backend Package"""

backend/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (202 Bytes). View file

backend/__pycache__/config.cpython-311.pyc ADDED Viewed

Binary file (2.91 kB). View file

backend/__pycache__/main.cpython-311.pyc ADDED Viewed

Binary file (19.2 kB). View file

backend/__pycache__/voice_bot_controller.cpython-311.pyc ADDED Viewed

Binary file (7.9 kB). View file

backend/config.py ADDED Viewed

	@@ -0,0 +1,68 @@

+"""
+Central Configuration Management using Pydantic Settings
+Loads environment variables from .env file
+"""
+from pydantic_settings import BaseSettings
+from typing import Optional
+from pathlib import Path
+class Settings(BaseSettings):
+    """Application configuration loaded from environment variables"""
+    # Groq LLM Configuration
+    groq_api_key: str
+    groq_model: str = "llama-3.3-70b-versatile"
+    groq_temperature: float = 0.7
+    groq_max_tokens: int = 1024
+    # Qdrant Vector Database Configuration
+    qdrant_url: str = "http://localhost:6333"
+    qdrant_api_key: Optional[str] = None  # Optional for local Docker setup
+    # Embedding Model Configuration
+    embedding_model: str = "BAAI/bge-m3"
+    embedding_batch_size: int = 32
+    # Collection Names
+    kb_collection_name: str = "knowledge_base"
+    history_collection_name: str = "customer_history"
+    # Vector Dimensions (BGE-M3 uses 1024 dimensions)
+    vector_dimension: int = 1024
+    # Model Configuration for NLP Tasks
+    sentiment_model: str = "distilbert-base-uncased-finetuned-sst-2-english"
+    # Application Configuration
+    app_name: str = "Voice RAG Bot"
+    app_version: str = "1.0.0"
+    debug_mode: bool = False
+    # Conversation Memory
+    max_conversation_history: int = 10
+    summary_interval: int = 5  # Generate summary every 5 turns
+    # Audio Configuration
+    sample_rate: int = 16000  # 16kHz for Whisper
+    audio_format: str = "wav"
+    class Config:
+        """Pydantic config for reading from .env file"""
+        env_file = str(Path(__file__).parent.parent / ".env")
+        case_sensitive = False
+        extra = "ignore"  # Ignore unknown fields from .env
+    def __repr__(self) -> str:
+        """String representation (hides API keys)"""
+        return (
+            f"Settings("
+            f"groq_model={self.groq_model}, "
+            f"qdrant_url={self.qdrant_url}, "
+            f"embedding_model={self.embedding_model})"
+        )
+# Global settings instance
+settings = Settings()

backend/main.py ADDED Viewed

	@@ -0,0 +1,241 @@

+"""
+FastAPI Backend for Voice RAG Bot
+Handles audio input, STT conversion, workflow orchestration, and response generation
+"""
+import logging
+import asyncio
+import sys
+from pathlib import Path
+from typing import Optional
+from io import BytesIO
+# Add project root to path for imports
+sys.path.insert(0, str(Path(__file__).parent.parent))
+from fastapi import FastAPI, UploadFile, File, HTTPException
+from fastapi.middleware.cors import CORSMiddleware
+from pydantic import BaseModel
+import uvicorn
+# Import configuration
+from backend.config import settings
+# Import workflow
+from orchestration.langgraph_workflow import run_workflow
+from orchestration.latency_tracker import get_tracker, reset_tracker
+# Import STT (Faster Whisper)
+from faster_whisper import WhisperModel
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+# ============================================================================
+# MODELS
+# ============================================================================
+class ProcessAudioResponse(BaseModel):
+    """Response model for audio processing"""
+    response_text: str
+    audio_path: Optional[str]
+    intent: dict
+    sentiment: dict
+    entities: Optional[dict]
+    kb_context: str
+    history_context: str
+class HealthResponse(BaseModel):
+    """Health check response"""
+    status: str
+    llm_model: str
+    qdrant_url: str
+    whisper_model: str
+# ============================================================================
+# FASTAPI APP INITIALIZATION
+# ============================================================================
+app = FastAPI(
+    title="Voice RAG Bot Backend",
+    description="AI-powered customer service bot with RAG and voice interface",
+    version="1.0.0"
+)
+# Add CORS middleware for frontend communication
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],
+    allow_credentials=True,
+    allow_methods=["*"],
+    allow_headers=["*"],
+)
+# ============================================================================
+# GLOBAL STATE
+# ============================================================================
+whisper_model = WhisperModel("base", device="cpu", compute_type="int8")
+def extract_audio_content(audio_bytes: bytes) -> str:
+    try:
+        audio_file = BytesIO(audio_bytes)
+        segments, _ = whisper_model.transcribe(audio_file, language="en")
+        transcribed_text = " ".join([segment.text for segment in segments])
+        if not transcribed_text.strip():
+            return "No speech detected"
+        tracker = get_tracker()
+        tracker.start("whisper_stt")
+        tracker.end("whisper_stt")
+        return transcribed_text
+    except Exception as e:
+        logger.error(f"STT Error: {str(e)}")
+        raise HTTPException(status_code=400, detail=f"STT failed: {str(e)}")
+async def run_workflow_async(user_input: str, customer_id: str) -> dict:
+    try:
+        return await run_workflow(user_input, customer_id)
+    except Exception as e:
+        logger.error(f"Workflow Error: {str(e)}")
+        raise HTTPException(status_code=500, detail=f"Workflow failed: {str(e)}")
+@app.get("/health", response_model=HealthResponse)
+async def health_check():
+    return {
+        "status": "healthy",
+        "llm_model": settings.groq_model,
+        "qdrant_url": settings.qdrant_url,
+        "whisper_model": "base"
+    }
+@app.post("/process-audio", response_model=ProcessAudioResponse)
+async def process_audio(
+    file: UploadFile = File(...),
+    customer_id: str = "DEFAULT_CUSTOMER"
+):
+    try:
+        reset_tracker()
+        tracker = get_tracker()
+        tracker.start_total()
+        audio_bytes = await file.read()
+        user_input = extract_audio_content(audio_bytes)
+        final_state = await run_workflow_async(user_input, customer_id)
+        response = ProcessAudioResponse(
+            response_text=final_state.get("response", ""),
+            audio_path=final_state.get("final_audio_path"),
+            intent=final_state.get("intent", {}),
+            sentiment=final_state.get("sentiment", {}),
+            entities=final_state.get("entities"),
+            kb_context=final_state.get("kb_context", ""),
+            history_context=final_state.get("history_context", "")
+        )
+        return response
+    except HTTPException:
+        raise
+    except Exception as e:
+        logger.error(f"Unexpected error: {str(e)}", exc_info=True)
+        raise HTTPException(status_code=500, detail=f"Processing failed: {str(e)}")
+@app.post("/process-text")
+async def process_text(
+    user_input: str,
+    customer_id: str = "DEFAULT_CUSTOMER"
+):
+    try:
+        final_state = await run_workflow_async(user_input, customer_id)
+        return ProcessAudioResponse(
+            response_text=final_state.get("response", ""),
+            audio_path=final_state.get("final_audio_path"),
+            intent=final_state.get("intent", {}),
+            sentiment=final_state.get("sentiment", {}),
+            entities=final_state.get("entities"),
+            kb_context=final_state.get("kb_context", ""),
+            history_context=final_state.get("history_context", "")
+        )
+    except Exception as e:
+        logger.error(f"Error: {str(e)}", exc_info=True)
+        raise HTTPException(status_code=500, detail=f"Processing failed: {str(e)}")
+@app.get("/")
+async def root():
+    return {
+        "name": "Voice RAG Bot Backend",
+        "version": "1.0.0",
+        "endpoints": {
+            "health": "GET /health",
+            "process_audio": "POST /process-audio (requires audio file)",
+            "process_text": "POST /process-text (requires text input)",
+            "voice_bot_start": "POST /voice-bot/start",
+            "voice_bot_message": "POST /voice-bot/message",
+            "voice_bot_end": "POST /voice-bot/end",
+            "docs": "GET /docs (Swagger UI)"
+        }
+    }
+from backend.voice_bot_controller import get_voice_bot_controller
+@app.post("/voice-bot/start")
+async def voice_bot_start(customer_id: str = "CUST_DEFAULT"):
+    try:
+        controller = get_voice_bot_controller()
+        return await controller.start_session(customer_id)
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=str(e))
+@app.post("/voice-bot/message")
+async def voice_bot_message(user_message: str):
+    try:
+        controller = get_voice_bot_controller()
+        return await controller.process_user_message(user_message)
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=str(e))
+@app.post("/voice-bot/end")
+async def voice_bot_end():
+    try:
+        controller = get_voice_bot_controller()
+        return await controller.end_session()
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=str(e))
+@app.get("/voice-bot/history")
+async def voice_bot_history():
+    try:
+        controller = get_voice_bot_controller()
+        return {"history": controller.get_session_history()}
+    except Exception as e:
+        raise HTTPException(status_code=500, detail=str(e))
+@app.on_event("startup")
+async def startup_event():
+    logger.info(f"Backend started - Config: {settings.groq_model}")
+@app.on_event("shutdown")
+async def shutdown_event():
+    logger.info("Backend shutdown")
+if __name__ == "__main__":
+    logger.info("Starting FastAPI server...")
+    uvicorn.run(
+        app,
+        host="0.0.0.0",
+        port=8000,
+        log_level="info"
+    )

backend/voice_bot_controller.py ADDED Viewed

	@@ -0,0 +1,134 @@

+"""Voice Bot Controller - Session management for conversations"""
+from typing import Dict, Any
+from datetime import datetime
+import asyncio
+from rag.session_manager import get_session_manager
+from rag.cache_manager import get_cache_manager
+from rag.tts_generator import get_tts_generator
+from orchestration.langgraph_workflow import run_workflow
+class VoiceBotController:
+    def __init__(self):
+        self.session_mgr = get_session_manager()
+        self.cache_mgr = get_cache_manager()
+        self.tts_gen = get_tts_generator()
+        self.current_session = None
+        self.customer_id = None
+        self.conversation_history = []
+    async def start_session(self, customer_id: str) -> Dict[str, Any]:
+        self.customer_id = customer_id
+        self.current_session = self.session_mgr.create_session(customer_id)
+        self.conversation_history = []
+        greeting = "Hello! How can I help you today?"
+        audio_path = self.tts_gen.generate_greeting(customer_id)
+        return {
+            "session_id": self.current_session,
+            "greeting": greeting,
+            "audio_path": audio_path,
+            "status": "listening"
+        }
+    async def process_user_message(self, user_message: str) -> Dict[str, Any]:
+        if not self.current_session:
+            return {"error": "No active session"}
+        self.session_mgr.add_message(self.current_session, "user", user_message)
+        cached_response = self.cache_mgr.get(self.customer_id, user_message)
+        if cached_response:
+            response_text = cached_response.get("response_text", "")
+            intent = cached_response.get("intent", {}).get("intent", "")
+            sentiment = cached_response.get("sentiment", {}).get("label", "")
+        else:
+            try:
+                result = await run_workflow(user_message, self.customer_id)
+                response_text = result.get("response", "")
+                intent = result.get("intent", {}).get("intent", "")
+                sentiment = result.get("sentiment", {}).get("label", "")
+                self.cache_mgr.set(self.customer_id, user_message, result)
+            except Exception as e:
+                response_text = f"Error processing request: {str(e)}"
+                intent = "error"
+                sentiment = "NEGATIVE"
+        self.session_mgr.add_message(self.current_session, "assistant", response_text, intent=intent, sentiment=sentiment)
+        follow_up = self._generate_follow_up(intent, sentiment)
+        should_continue = self._should_continue(intent, sentiment)
+        audio_path = self.tts_gen.generate_audio(response_text, self.customer_id, self.current_session)
+        return {
+            "response": response_text,
+            "intent": intent,
+            "sentiment": sentiment,
+            "follow_up": follow_up,
+            "audio_path": audio_path,
+            "status": "listening" if should_continue else "done",
+            "session_id": self.current_session
+        }
+    def _generate_follow_up(self, intent: str, sentiment: str) -> str:
+        """Generate context-aware follow-up question"""
+        follow_ups = {
+            "refund_request": "Would you like assistance with starting a return?",
+            "product_inquiry": "Do you need more details about this product?",
+            "billing_issue": "Can I help you further with your billing concern?",
+            "warranty_claim": "Would you like to proceed with the warranty claim?",
+            "order_status": "Is there anything else about your order?",
+            "complaint": "How can I make this right for you?",
+            "general_support": "Is there anything else I can help you with?"
+        }
+        # Choose follow-up based on intent
+        if intent in follow_ups:
+            return follow_ups[intent]
+        # Default follow-ups based on sentiment
+        if sentiment == "NEGATIVE":
+            return "I apologize for the inconvenience. Is there anything else I can help resolve?"
+        elif sentiment == "POSITIVE":
+            return "Great! Is there anything else I can help you with today?"
+        else:
+            return "Is there anything else I can help you with?"
+    def _should_continue(self, intent: str, sentiment: str) -> bool:
+        """Determine if conversation should continue"""
+        # Continue unless user explicitly ends or issue resolved
+        end_indicators = ["goodbye", "thanks", "that's it", "no thanks"]
+        # For now, always continue unless error
+        return intent != "error"
+    async def end_session(self) -> Dict[str, Any]:
+        if self.current_session:
+            self.session_mgr.close_session(self.current_session)
+            history = self.session_mgr.get_session_history(self.current_session)
+            return {
+                "session_id": self.current_session,
+                "status": "closed",
+                "message_count": len(history),
+                "farewell": "Thank you for contacting us. Goodbye!"
+            }
+        return {"error": "No active session"}
+    def get_session_history(self) -> list:
+        if not self.current_session:
+            return []
+        return self.session_mgr.get_session_history(self.current_session)
+# Global controller instance
+_voice_bot_controller = None
+def get_voice_bot_controller() -> VoiceBotController:
+    """Get or create global voice bot controller"""
+    global _voice_bot_controller
+    if _voice_bot_controller is None:
+        _voice_bot_controller = VoiceBotController()
+    return _voice_bot_controller

data/latency_results.json ADDED Viewed

	@@ -0,0 +1,142 @@

+[
+  {
+    "timestamp": "2026-06-02T20:29:02.891174",
+    "total_time_ms": 38587.39,
+    "modules": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 1627.82,
+      "intent_detection": 881.28,
+      "retrieval_router": 10918.91,
+      "context_builder": 0.0,
+      "response_generation": 1045.21,
+      "validation": 1.73,
+      "memory_persistence": 743.59,
+      "tts_generation": 23313.93,
+      "workflow_orchestration": 38587.39
+    },
+    "breakdown_percent": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 2.1,
+      "intent_detection": 1.1,
+      "retrieval_router": 14.2,
+      "context_builder": 0.0,
+      "response_generation": 1.4,
+      "validation": 0.0,
+      "memory_persistence": 1.0,
+      "tts_generation": 30.2,
+      "workflow_orchestration": 50.0
+    }
+  },
+  {
+    "timestamp": "2026-06-02T20:32:27.270235",
+    "total_time_ms": 18292.14,
+    "modules": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 1851.01,
+      "intent_detection": 879.92,
+      "retrieval_router": 11678.87,
+      "context_builder": 0.0,
+      "response_generation": 935.09,
+      "validation": 0.0,
+      "memory_persistence": 518.29,
+      "tts_generation": 2400.3,
+      "workflow_orchestration": 18292.14
+    },
+    "breakdown_percent": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 5.1,
+      "intent_detection": 2.4,
+      "retrieval_router": 31.9,
+      "context_builder": 0.0,
+      "response_generation": 2.6,
+      "validation": 0.0,
+      "memory_persistence": 1.4,
+      "tts_generation": 6.6,
+      "workflow_orchestration": 50.0
+    }
+  },
+  {
+    "timestamp": "2026-06-02T20:33:09.830661",
+    "total_time_ms": 6769.27,
+    "modules": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 489.84,
+      "intent_detection": 670.14,
+      "retrieval_router": 2088.21,
+      "context_builder": 0.0,
+      "response_generation": 850.0,
+      "validation": 0.0,
+      "memory_persistence": 602.15,
+      "tts_generation": 2051.77,
+      "workflow_orchestration": 6769.27
+    },
+    "breakdown_percent": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 3.6,
+      "intent_detection": 5.0,
+      "retrieval_router": 15.4,
+      "context_builder": 0.0,
+      "response_generation": 6.3,
+      "validation": 0.0,
+      "memory_persistence": 4.5,
+      "tts_generation": 15.2,
+      "workflow_orchestration": 50.1
+    }
+  },
+  {
+    "timestamp": "2026-06-02T20:48:50.209913",
+    "total_time_ms": 7611.41,
+    "modules": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 521.71,
+      "intent_detection": 869.67,
+      "retrieval_router": 1815.81,
+      "context_builder": 0.0,
+      "response_generation": 881.13,
+      "validation": 0.62,
+      "memory_persistence": 569.26,
+      "tts_generation": 2904.53,
+      "workflow_orchestration": 7611.41
+    },
+    "breakdown_percent": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 3.4,
+      "intent_detection": 5.7,
+      "retrieval_router": 12.0,
+      "context_builder": 0.0,
+      "response_generation": 5.8,
+      "validation": 0.0,
+      "memory_persistence": 3.8,
+      "tts_generation": 19.1,
+      "workflow_orchestration": 50.2
+    }
+  },
+  {
+    "timestamp": "2026-06-02T20:50:03.048163",
+    "total_time_ms": 3904.49,
+    "modules": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 451.61,
+      "intent_detection": 712.09,
+      "retrieval_router": 296.21,
+      "context_builder": 0.0,
+      "response_generation": 682.71,
+      "validation": 0.0,
+      "memory_persistence": 456.91,
+      "tts_generation": 1295.47,
+      "workflow_orchestration": 3904.49
+    },
+    "breakdown_percent": {
+      "sentiment_analysis": 0.0,
+      "entity_extraction": 5.8,
+      "intent_detection": 9.1,
+      "retrieval_router": 3.8,
+      "context_builder": 0.0,
+      "response_generation": 8.8,
+      "validation": 0.0,
+      "memory_persistence": 5.9,
+      "tts_generation": 16.6,
+      "workflow_orchestration": 50.1
+    }
+  }
+]

data/load_sample_data.py ADDED Viewed

	@@ -0,0 +1,201 @@

+"""
+Step 13: Load Sample Data into Qdrant
+Creates sample documents for knowledge base and customer history
+"""
+import sys
+from pathlib import Path
+# Add project root to path
+project_root = Path(__file__).parent.parent
+sys.path.insert(0, str(project_root))
+import asyncio
+import json
+import logging
+from datetime import datetime
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+print("\n" + "="*80)
+print("📚 LOADING SAMPLE DATA INTO QDRANT")
+print("="*80)
+# ============================================================================
+# SAMPLE DATA
+# ============================================================================
+# Knowledge Base Documents (Company Policies, FAQs)
+KB_DOCUMENTS = [
+    {
+        "id": "kb_001",
+        "title": "Return Policy",
+        "content": """
+        Return Policy: Customers can return unopened products within 30 days of purchase
+        for a full refund. Items must be in original condition with all packaging and accessories.
+        Refunds are processed within 5-7 business days. Shipping costs are non-refundable unless
+        the return is due to our error. For defective items, we offer replacements immediately.
+        """
+    },
+    {
+        "id": "kb_002",
+        "title": "Shipping Information",
+        "content": """
+        Shipping Options: We offer standard shipping (5-7 days), express shipping (2-3 days),
+        and overnight shipping. Standard shipping is free for orders over $50. Tracking
+        information is provided via email. All orders are insured. We ship to most countries
+        worldwide. International orders may have customs delays.
+        """
+    },
+    {
+        "id": "kb_003",
+        "title": "Product Warranty",
+        "content": """
+        Warranty Coverage: All electronics come with a 1-year manufacturer's warranty covering
+        defects in materials and workmanship. Warranty does not cover physical damage, water damage,
+        or normal wear. Warranty service is available through our support team or authorized
+        service centers. Extended warranty options are available for 2 or 3 years.
+        """
+    },
+    {
+        "id": "kb_004",
+        "title": "Account Management",
+        "content": """
+        Account Features: Create an account to track orders, save preferences, and manage
+        payment methods. Password requirements: minimum 8 characters with upper/lowercase,
+        numbers, and symbols. Two-factor authentication available for security. Account
+        information can be updated anytime in settings. Contact support to delete account.
+        """
+    }
+]
+# Customer History Records (Previous Interactions)
+CUSTOMER_HISTORY = [
+    {
+        "customer_id": "CUST_001",
+        "interaction_type": "complaint",
+        "text": "Customer complained about slow shipping on previous order. Resolution: expedited reshipment provided."
+    },
+    {
+        "customer_id": "CUST_001",
+        "interaction_type": "purchase",
+        "text": "Purchased laptop model XPS-15 on 2025-11-20. Status: delivered. Customer satisfied."
+    },
+    {
+        "customer_id": "CUST_002",
+        "interaction_type": "inquiry",
+        "text": "Asked about warranty coverage for defective phone. Explained 1-year coverage policy. Customer satisfied."
+    },
+    {
+        "customer_id": "CUST_002",
+        "interaction_type": "refund_request",
+        "text": "Requested refund for unopened tablet within 30-day window. Refund approved and processed."
+    }
+]
+# ============================================================================
+# LOAD DATA
+# ============================================================================
+async def load_sample_data():
+    """Load sample data into Qdrant"""
+    try:
+        from rag.qdrant_manager import qdrant_manager
+        from rag.embedding_manager import embedding_manager
+        print("\n[1] Initializing managers...")
+        print(f"    ✅ Qdrant Manager: {qdrant_manager}")
+        print(f"    ✅ Embedding Manager: {embedding_manager}")
+        # ========== LOAD KNOWLEDGE BASE ==========
+        print("\n[2] Loading Knowledge Base documents...")
+        print(f"    Documents to load: {len(KB_DOCUMENTS)}")
+        for doc in KB_DOCUMENTS:
+            try:
+                # Create document object for Qdrant
+                text = f"Title: {doc['title']}\n\n{doc['content']}"
+                logger.info(f"Adding KB doc: {doc['id']}")
+                # Add to knowledge base using qdrant_manager
+                qdrant_manager.add_to_kb(
+                    documents=[{
+                        "id": doc['id'],
+                        "text": text,
+                        "title": doc['title']
+                    }]
+                )
+                print(f"    ✅ {doc['title']} (ID: {doc['id']})")
+            except Exception as e:
+                print(f"    ❌ Error loading {doc['id']}: {str(e)}")
+        print("    ✅ Knowledge Base loaded")
+        # ========== LOAD CUSTOMER HISTORY ==========
+        print("\n[3] Loading Customer History...")
+        print(f"    Records to load: {len(CUSTOMER_HISTORY)}")
+        for record in CUSTOMER_HISTORY:
+            try:
+                logger.info(f"Adding history for {record['customer_id']}")
+                # Add to customer history using qdrant_manager
+                qdrant_manager.add_to_history(
+                    customer_id=record['customer_id'],
+                    text=record['text'],
+                    interaction_type=record['interaction_type']
+                )
+                print(f"    ✅ {record['customer_id']}: {record['interaction_type']} ({len(record['text'])} chars)")
+            except Exception as e:
+                print(f"    ❌ Error loading history for {record['customer_id']}: {str(e)}")
+        print("    ✅ Customer History loaded")
+        # ========== VERIFY DATA ==========
+        print("\n[4] Verifying loaded data...")
+        try:
+            kb_info = qdrant_manager.get_collection_info("knowledge_base")
+            print(f"    ✅ Knowledge Base:")
+            print(f"       - Name: knowledge_base")
+            print(f"       - Vector size: {kb_info.get('vector_size', 'N/A')}")
+            print(f"       - Points count: {kb_info.get('points_count', 'N/A')}")
+            hist_info = qdrant_manager.get_collection_info("customer_history")
+            print(f"    ✅ Customer History:")
+            print(f"       - Name: customer_history")
+            print(f"       - Vector size: {hist_info.get('vector_size', 'N/A')}")
+            print(f"       - Points count: {hist_info.get('points_count', 'N/A')}")
+        except Exception as e:
+            print(f"    ⚠️  Could not verify: {str(e)}")
+        print("\n" + "="*80)
+        print("✅ SAMPLE DATA LOADED SUCCESSFULLY")
+        print("="*80)
+        print("\n📊 Summary:")
+        print(f"   • Knowledge Base: {len(KB_DOCUMENTS)} documents loaded")
+        print(f"   • Customer History: {len(CUSTOMER_HISTORY)} records loaded")
+        print("\n🎯 Data is now available for:")
+        print("   • KB Search (retrieval_router node)")
+        print("   • Customer History Context (conditional retrieval)")
+        print("   • Personalized responses based on customer history")
+        print("="*80 + "\n")
+        return True
+    except Exception as e:
+        print(f"\n❌ ERROR: {str(e)}")
+        import traceback
+        traceback.print_exc()
+        return False
+if __name__ == "__main__":
+    print("\n🚀 Starting sample data loader...\n")
+    success = asyncio.run(load_sample_data())
+    sys.exit(0 if success else 1)

data/sessions.db ADDED Viewed

Binary file (32.8 kB). View file

data/test_latency.json ADDED Viewed

	@@ -0,0 +1,22 @@

+[
+  {
+    "timestamp": "2026-06-02T20:03:43.290341",
+    "total_time_ms": 51.15,
+    "modules": {
+      "test_module": 51.15
+    },
+    "breakdown_percent": {
+      "test_module": 100.0
+    }
+  },
+  {
+    "timestamp": "2026-06-02T20:04:07.232259",
+    "total_time_ms": 50.66,
+    "modules": {
+      "test_module": 50.66
+    },
+    "breakdown_percent": {
+      "test_module": 100.0
+    }
+  }
+]

frontend/__pycache__/streamlit_app.cpython-311.pyc ADDED Viewed

Binary file (32.2 kB). View file

frontend/streamlit_app.py ADDED Viewed

	@@ -0,0 +1,739 @@

+"""
+Streamlit Frontend - Voice RAG Bot
+Interactive UI for audio input, processing, and response playback
+"""
+import streamlit as st
+import requests
+import json
+import os
+import time
+import base64
+from pathlib import Path
+from datetime import datetime
+from typing import Optional, Dict, Any
+# Page configuration
+st.set_page_config(
+    page_title="Voice RAG Bot",
+    page_icon="🤖",
+    layout="wide",
+    initial_sidebar_state="expanded"
+)
+# Styling
+st.markdown("""
+<style>
+    .main {
+        padding: 0rem 1rem;
+    }
+    .stTabs [data-baseweb="tab-list"] button [data-testid="stMarkdownContainer"] p {
+        font-size: 1.1rem;
+        font-weight: 500;
+    }
+    .success-box {
+        padding: 1rem;
+        border-radius: 0.5rem;
+        background-color: #d4edda;
+        border: 1px solid #c3e6cb;
+        color: #155724;
+    }
+    .error-box {
+        padding: 1rem;
+        border-radius: 0.5rem;
+        background-color: #f8d7da;
+        border: 1px solid #f5c6cb;
+        color: #721c24;
+    }
+    .info-box {
+        padding: 1rem;
+        border-radius: 0.5rem;
+        background-color: #d1ecf1;
+        border: 1px solid #bee5eb;
+        color: #0c5460;
+    }
+</style>
+""", unsafe_allow_html=True)
+# ============================================================================
+# CONFIGURATION
+# ============================================================================
+BACKEND_URL = os.getenv("BACKEND_URL", "http://localhost:8000")
+DATA_DIR = Path("data/audio_output")
+DATA_DIR.mkdir(parents=True, exist_ok=True)
+# Session state initialization
+if "customer_id" not in st.session_state:
+    st.session_state.customer_id = "CUST_001"
+if "processing" not in st.session_state:
+    st.session_state.processing = False
+if "last_response" not in st.session_state:
+    st.session_state.last_response = None
+if "history" not in st.session_state:
+    st.session_state.history = []
+if "voice_bot_mode" not in st.session_state:
+    st.session_state.voice_bot_mode = False
+if "voice_bot_session" not in st.session_state:
+    st.session_state.voice_bot_session = None
+if "voice_bot_active" not in st.session_state:
+    st.session_state.voice_bot_active = False
+if "voice_bot_messages" not in st.session_state:
+    st.session_state.voice_bot_messages = []
+if "pending_audio" not in st.session_state:
+    st.session_state.pending_audio = None
+if "processing_audio" not in st.session_state:
+    st.session_state.processing_audio = False
+if "last_processed_audio_id" not in st.session_state:
+    st.session_state.last_processed_audio_id = None
+# ============================================================================
+# UTILITY FUNCTIONS
+# ============================================================================
+def check_backend_health() -> bool:
+    """Check if FastAPI backend is running"""
+    try:
+        response = requests.get(f"{BACKEND_URL}/health", timeout=5)
+        return response.status_code == 200
+    except requests.exceptions.ConnectionError:
+        return False
+    except requests.exceptions.Timeout:
+        return False
+    except Exception as e:
+        return False
+def process_audio_file(audio_bytes: bytes, customer_id: str) -> Optional[Dict[str, Any]]:
+    """Send audio to backend for processing"""
+    try:
+        from io import BytesIO
+        # Send audio bytes directly as file to backend
+        with st.spinner("Processing audio... (may take 30-60 seconds)"):
+            # Create file-like object from bytes
+            audio_file = BytesIO(audio_bytes)
+            audio_file.name = f"audio_{datetime.now().strftime('%Y%m%d_%H%M%S')}.wav"
+            files = {"file": (audio_file.name, audio_file, "audio/wav")}
+            response = requests.post(
+                f"{BACKEND_URL}/process-audio",
+                files=files,
+                params={"customer_id": customer_id},
+                timeout=120
+            )
+        if response.status_code == 200:
+            result = response.json()
+            return result
+        else:
+            st.error(f"Backend error: {response.status_code}")
+            st.error(response.text)
+            return None
+    except requests.exceptions.Timeout:
+        st.error("Request timeout. Processing took too long.")
+        return None
+    except Exception as e:
+        st.error(f"Error processing audio: {str(e)}")
+        import traceback
+        st.error(traceback.format_exc())
+        return None
+def process_text_input(user_input: str, customer_id: str) -> Optional[Dict[str, Any]]:
+    """Send text to backend for processing"""
+    try:
+        with st.spinner("Processing text... (may take 20-30 seconds)"):
+            response = requests.post(
+                f"{BACKEND_URL}/process-text",
+                params={
+                    "user_input": user_input,
+                    "customer_id": customer_id
+                },
+                timeout=120
+            )
+        if response.status_code == 200:
+            return response.json()
+        else:
+            st.error(f"Backend error: {response.status_code}")
+            st.error(response.text)
+            return None
+    except requests.exceptions.Timeout:
+        st.error("Request timeout. Processing took too long.")
+        return None
+    except Exception as e:
+        st.error(f"Error processing text: {str(e)}")
+        return None
+def voice_bot_start(customer_id: str) -> Optional[Dict[str, Any]]:
+    """Start voice bot session"""
+    try:
+        response = requests.post(
+            f"{BACKEND_URL}/voice-bot/start",
+            params={"customer_id": customer_id},
+            timeout=60
+        )
+        if response.status_code == 200:
+            return response.json()
+        else:
+            st.error(f"Error starting voice bot: {response.status_code}")
+            return None
+    except Exception as e:
+        st.error(f"Error starting voice bot: {str(e)}")
+        return None
+def voice_bot_process_message(user_message: str) -> Optional[Dict[str, Any]]:
+    """Process message in voice bot session"""
+    try:
+        response = requests.post(
+            f"{BACKEND_URL}/voice-bot/message",
+            params={"user_message": user_message},
+            timeout=120
+        )
+        if response.status_code == 200:
+            return response.json()
+        else:
+            st.error(f"Backend error {response.status_code}: {response.text}")
+            return None
+    except Exception as e:
+        st.error(f"Backend connection error: {str(e)}")
+        return None
+def voice_bot_end() -> Optional[Dict[str, Any]]:
+    """End voice bot session"""
+    try:
+        response = requests.post(
+            f"{BACKEND_URL}/voice-bot/end",
+            timeout=10
+        )
+        if response.status_code == 200:
+            return response.json()
+        else:
+            return None
+    except Exception as e:
+        return None
+def display_response_results(response: Dict[str, Any]):
+    """Display formatted response from backend"""
+    # Display latency metrics first if available
+    latency_metrics = response.get("latency_metrics")
+    if latency_metrics:
+        st.markdown("### ⏱️ Performance Metrics")
+        total_time = latency_metrics.get("total_time_ms", 0)
+        modules = latency_metrics.get("modules", {})
+        breakdown = latency_metrics.get("breakdown_percent", {})
+        # Display total time prominently
+        col1, col2, col3 = st.columns(3)
+        with col1:
+            st.metric("Total Processing Time", f"{total_time:.0f} ms", f"{total_time/1000:.2f}s")
+        with col2:
+            fastest = min(modules.items(), key=lambda x: x[1]) if modules else ("N/A", 0)
+            st.metric("Fastest Module", fastest[0].replace("_", " ").title(), f"{fastest[1]:.0f} ms")
+        with col3:
+            slowest = max(modules.items(), key=lambda x: x[1]) if modules else ("N/A", 0)
+            st.metric("Slowest Module", slowest[0].replace("_", " ").title(), f"{slowest[1]:.0f} ms")
+        # Module breakdown with progress bars
+        with st.expander("📊 Detailed Module Breakdown", expanded=True):
+            st.markdown("#### Time per Module")
+            # Sort modules by time
+            sorted_modules = sorted(modules.items(), key=lambda x: x[1], reverse=True)
+            for module_name, time_ms in sorted_modules:
+                percent = breakdown.get(module_name, 0)
+                display_name = module_name.replace("_", " ").title()
+                col1, col2, col3 = st.columns([3, 1, 1])
+                with col1:
+                    st.write(f"**{display_name}**")
+                with col2:
+                    st.write(f"{time_ms:.2f} ms")
+                with col3:
+                    st.write(f"{percent:.1f}%")
+                # Progress bar
+                st.progress(percent / 100)
+        st.markdown("---")
+    # Create tabs for different result sections
+    tabs = st.tabs([
+        "📝 Response",
+        "🎯 Intent",
+        "😊 Sentiment",
+        "🏷️ Entities",
+        "📚 Knowledge Base",
+        "📜 History",
+        "🔊 Audio"
+    ])
+    # Tab 1: Main Response
+    with tabs[0]:
+        st.markdown("### Generated Response")
+        st.info(response.get("response_text", "No response generated"))
+        # Save to history
+        st.session_state.history.append({
+            "timestamp": datetime.now().isoformat(),
+            "customer_id": st.session_state.customer_id,
+            "response": response.get("response_text", ""),
+            "intent": response.get("intent", {}).get("intent", ""),
+            "sentiment": response.get("sentiment", {}).get("label", "")
+        })
+    # Tab 2: Intent Detection
+    with tabs[1]:
+        intent_data = response.get("intent", {})
+        col1, col2 = st.columns(2)
+        with col1:
+            st.metric("Detected Intent", intent_data.get("intent", "N/A"))
+        with col2:
+            confidence = intent_data.get("confidence", 0)
+            st.metric("Confidence", f"{confidence:.1%}")
+        # Intent explanation
+        intent_types = {
+            "refund_request": "Customer wants to return/refund a product",
+            "order_status": "Customer inquiring about order tracking",
+            "product_inquiry": "Customer asking product details",
+            "billing_issue": "Customer has billing/payment problems",
+            "warranty_claim": "Customer filing warranty claim",
+            "account_management": "Account settings/updates",
+            "general_support": "General support request",
+            "complaint": "Customer complaint",
+            "other": "Other inquiry"
+        }
+        intent = intent_data.get("intent", "")
+        if intent in intent_types:
+            st.write(f"**Category**: {intent_types[intent]}")
+    # Tab 3: Sentiment Analysis
+    with tabs[2]:
+        sentiment_data = response.get("sentiment", {})
+        label = sentiment_data.get("label", "NEUTRAL")
+        score = sentiment_data.get("score", 0)
+        # Color-coded sentiment display
+        if label == "POSITIVE":
+            color = "🟢"
+            tone = "Positive"
+        elif label == "NEGATIVE":
+            color = "🔴"
+            tone = "Negative"
+        else:
+            color = "🟡"
+            tone = "Neutral"
+        col1, col2 = st.columns(2)
+        with col1:
+            st.metric("Sentiment", f"{color} {tone}")
+        with col2:
+            st.metric("Confidence", f"{score:.1%}")
+        st.write(f"**Interpretation**: Response was generated with {tone.lower()}-{tone.lower()} tone")
+    # Tab 4: Entities
+    with tabs[3]:
+        entities = response.get("entities", {})
+        if entities:
+            for entity_type, values in entities.items():
+                if values:
+                    st.write(f"**{entity_type.upper()}**")
+                    for entity in values:
+                        st.write(f"  • {entity}")
+        else:
+            st.info("No entities extracted from input")
+    # Tab 5: Knowledge Base Context
+    with tabs[4]:
+        kb_context = response.get("kb_context", "")
+        if kb_context and isinstance(kb_context, str) and kb_context.strip() != "No relevant policies found.":
+            st.write("**Retrieved Documents:**")
+            st.write(kb_context)
+        else:
+            st.info("No KB documents retrieved")
+    # Tab 6: Customer History
+    with tabs[5]:
+        history_context = response.get("history_context", "")
+        if history_context and isinstance(history_context, str) and history_context.strip() != "No customer history available.":
+            st.write("**Customer History:**")
+            st.write(history_context)
+        else:
+            st.info("No customer history found")
+    # Tab 7: Audio Output
+    with tabs[6]:
+        audio_path = response.get("audio_path", "")
+        if audio_path and audio_path.strip():
+            try:
+                # Normalize path
+                audio_file_path = Path(audio_path.replace("\\", "/"))
+                if not audio_file_path.is_absolute():
+                    project_root = Path(__file__).parent.parent
+                    audio_file_path = project_root / audio_file_path
+                if audio_file_path.exists():
+                    st.write(f"**Audio file**: {audio_path}")
+                    with open(audio_file_path, "rb") as audio_file:
+                        st.audio(audio_file, format="audio/mp3")
+                else:
+                    st.warning(f"Audio file not found: {audio_file_path}")
+            except Exception as e:
+                st.error(f"Could not load audio file: {str(e)}")
+        else:
+            st.warning("No audio file generated")
+# ============================================================================
+# MAIN UI LAYOUT
+# ============================================================================
+# Header
+st.title("🤖 Voice RAG Bot")
+st.markdown("AI Customer Support with Voice Recognition and Retrieval-Augmented Generation")
+# Sidebar
+with st.sidebar:
+    st.header("⚙️ Configuration")
+    # Backend status with refresh
+    col1, col2 = st.columns([3, 1])
+    with col1:
+        st.write("**Backend Status**")
+    with col2:
+        if st.button("🔄", help="Refresh status", key="refresh_health"):
+            st.rerun()
+    backend_healthy = check_backend_health()
+    if backend_healthy:
+        st.success("✅ Backend Connected")
+        st.caption(f"URL: {BACKEND_URL}")
+    else:
+        st.error("❌ Backend Not Connected")
+        st.error(f"Cannot reach {BACKEND_URL}")
+        st.info("**To fix:**")
+        st.code("python -m uvicorn backend.main:app --reload --port 8000", language="bash")
+        st.info("**Or use startup script:**")
+        st.code(".\\START_SYSTEM.ps1", language="bash")
+    # Customer ID input
+    st.subheader("Customer Information")
+    customer_id = st.text_input(
+        "Customer ID",
+        value=st.session_state.customer_id,
+        help="Unique identifier for customer (used for history)"
+    )
+    st.session_state.customer_id = customer_id
+    st.divider()
+    # Model information
+    st.subheader("System Components")
+    st.write("**LLM**: Groq (gpt-oss-20b)")
+    st.write("**STT**: Faster Whisper (base)")
+    st.write("**Vector DB**: Qdrant")
+    st.write("**Embeddings**: BGE-M3 (1024-dim)")
+    st.write("**Sentiment**: DistilBERT")
+    st.write("**NER**: BERT-base-NER")
+# Main content
+st.divider()
+# Voice Bot Mode Toggle
+col1, col2, col3 = st.columns([1, 3, 1])
+with col1:
+    voice_bot_enabled = st.toggle("🤖 Voice Bot Mode", value=st.session_state.voice_bot_mode, key="voice_bot_toggle")
+    st.session_state.voice_bot_mode = voice_bot_enabled
+if voice_bot_enabled:
+    # Voice Bot Interface
+    st.markdown("### 🎙️ Voice Bot Assistant")
+    if not st.session_state.voice_bot_active:
+        # Start button
+        col1, col2, col3 = st.columns([1, 2, 1])
+        with col2:
+            if st.button("🎙️ Start Conversation", use_container_width=True, key="start_voice_bot"):
+                with st.spinner("Starting voice bot..."):
+                    result = voice_bot_start(st.session_state.customer_id)
+                    if result:
+                        st.session_state.voice_bot_session = result.get("session_id")
+                        st.session_state.voice_bot_active = True
+                        greeting_audio = result.get("audio_path", "")
+                        st.session_state.voice_bot_messages = [
+                            {
+                                "role": "assistant",
+                                "content": result.get("greeting"),
+                                "audio_path": greeting_audio
+                            }
+                        ]
+                        st.rerun()
+    else:
+        # Conversation display
+        st.markdown("#### Conversation")
+        # Display conversation history
+        for msg in st.session_state.voice_bot_messages:
+            if msg["role"] == "assistant":
+                with st.chat_message("assistant", avatar="🤖"):
+                    st.write(msg["content"])
+                    # Play audio if available
+                    audio_path = msg.get("audio_path", "")
+                    if audio_path and audio_path.strip():
+                        try:
+                            # Normalize path and check in project root
+                            audio_file_path = Path(audio_path.replace("\\", "/"))
+                            if not audio_file_path.is_absolute():
+                                project_root = Path(__file__).parent.parent
+                                audio_file_path = project_root / audio_file_path
+                            if audio_file_path.exists():
+                                with open(audio_file_path, "rb") as audio_file:
+                                    audio_bytes = audio_file.read()
+                                    audio_b64 = base64.b64encode(audio_bytes).decode()
+                                    st.markdown(f"""
+                                    <audio autoplay controls style="width: 100%;">
+                                        <source src="data:audio/mpeg;base64,{audio_b64}" type="audio/mpeg">
+                                    </audio>
+                                    """, unsafe_allow_html=True)
+                            else:
+                                st.caption(f"⚠️ Audio file not found: {audio_file_path}")
+                        except Exception as e:
+                            st.caption(f"⚠️ Error loading audio: {str(e)}")
+            else:
+                with st.chat_message("user", avatar="👤"):
+                    st.write(msg["content"])
+        # Voice conversation section
+        st.markdown("---")
+        st.markdown("#### 🎤 Record your message:")
+        # Voice input - Store audio in session state
+        audio_bytes = st.audio_input(
+            "Record your message",
+            label_visibility="collapsed",
+            key="voice_bot_audio_input"
+        )
+        # If new audio recorded, store it with unique ID
+        if audio_bytes:
+            audio_id = id(audio_bytes)
+            if audio_id != st.session_state.last_processed_audio_id:
+                st.session_state.pending_audio = audio_bytes
+                st.session_state.last_processed_audio_id = audio_id
+                st.session_state.processing_audio = True
+        # Process pending audio (happens on next render after audio is saved)
+        if st.session_state.pending_audio and st.session_state.processing_audio:
+            # Immediately mark as processing to prevent duplicate processing
+            st.session_state.processing_audio = False
+            st.info("🎤 Processing audio...")
+            try:
+                from io import BytesIO
+                from faster_whisper import WhisperModel
+                # Convert UploadedFile to bytes if needed
+                audio_data = st.session_state.pending_audio
+                if hasattr(audio_data, 'read'):
+                    audio_data = audio_data.read()
+                st.info("Loading Whisper model...")
+                @st.cache_resource
+                def load_whisper():
+                    return WhisperModel("base", device="cpu", compute_type="int8")
+                whisper = load_whisper()
+                st.success("✅ Whisper model loaded")
+                st.info("Transcribing audio...")
+                audio_file = BytesIO(audio_data)
+                segments, info = whisper.transcribe(audio_file, language="en")
+                transcribed_text = " ".join([segment.text for segment in segments])
+                if transcribed_text.strip():
+                    st.success(f"✅ Transcribed: {transcribed_text}")
+                    # Add user message
+                    st.session_state.voice_bot_messages.append({
+                        "role": "user",
+                        "content": f"🎤 {transcribed_text}"
+                    })
+                    st.info("🤖 Sending to bot...")
+                    result = voice_bot_process_message(transcribed_text)
+                    if result:
+                        response = result.get("response", "")
+                        audio_path = result.get("audio_path", "")
+                        if response:
+                            st.success("✅ Bot responded")
+                            # Add ONLY ONE bot response
+                            st.session_state.voice_bot_messages.append({
+                                "role": "assistant",
+                                "content": response,
+                                "audio_path": audio_path
+                            })
+                            # Clear pending audio immediately
+                            st.session_state.pending_audio = None
+                            st.session_state.processing_audio = False
+                        else:
+                            st.error("❌ Bot response is empty")
+                            st.session_state.pending_audio = None
+                            st.session_state.processing_audio = False
+                    else:
+                        st.error("❌ Backend returned None")
+                        st.session_state.pending_audio = None
+                        st.session_state.processing_audio = False
+                else:
+                    st.warning("⚠️ No speech detected in audio")
+                    st.session_state.pending_audio = None
+                    st.session_state.processing_audio = False
+            except Exception as e:
+                st.error(f"❌ Error: {str(e)}")
+                st.session_state.pending_audio = None
+                st.session_state.processing_audio = False
+                import traceback
+                st.write(traceback.format_exc())
+        # End conversation button
+        st.markdown("---")
+        if st.button("🛑 End Conversation", use_container_width=True, key="end_voice_bot"):
+            with st.spinner("Ending session..."):
+                result = voice_bot_end()
+                st.session_state.voice_bot_active = False
+                st.session_state.voice_bot_messages = []
+                st.success("✅ Session ended. Thank you!")
+                st.rerun()
+else:
+    # Regular Input Tabs
+    st.markdown("### 💬 Manual Input Mode")
+    # Tabs for input methods
+    input_tab1, input_tab2 = st.tabs(["🎤 Audio Input", "📝 Text Input"])
+    with input_tab1:
+        st.subheader("Upload or Record Audio")
+        col1, col2 = st.columns(2)
+        with col1:
+            st.write("**Option 1: Record Audio**")
+            audio_data = st.audio_input(
+                "Record your message",
+                label_visibility="collapsed",
+                key="audio_input"
+            )
+            if audio_data:
+                st.success("Audio recorded successfully!")
+                if st.button("🔄 Process Audio", key="process_audio_btn"):
+                    response = process_audio_file(audio_data.getvalue(), st.session_state.customer_id)
+                    if response:
+                        st.session_state.last_response = response
+                        st.success("✅ Processing complete!")
+                        st.rerun()
+        with col2:
+            st.write("**Option 2: Upload Audio File**")
+            uploaded_file = st.file_uploader(
+                "Upload an MP3 or WAV file",
+                type=["mp3", "wav"],
+                label_visibility="collapsed"
+            )
+            if uploaded_file:
+                st.success(f"File uploaded: {uploaded_file.name}")
+                if st.button("🔄 Process Uploaded Audio", key="process_uploaded_btn"):
+                    response = process_audio_file(uploaded_file.getvalue(), st.session_state.customer_id)
+                    if response:
+                        st.session_state.last_response = response
+                        st.success("✅ Processing complete!")
+                        st.rerun()
+    with input_tab2:
+        st.subheader("Enter Text Directly")
+        # Text area for input
+        user_input = st.text_area(
+            "Enter your message",
+            placeholder="E.g., 'I want to return my defective laptop purchased last week'",
+            height=100,
+            label_visibility="collapsed"
+        )
+        if user_input:
+            col1, col2, col3 = st.columns([1, 1, 2])
+            with col1:
+                if st.button("🚀 Process Text", use_container_width=True):
+                    response = process_text_input(user_input, st.session_state.customer_id)
+                    if response:
+                        st.session_state.last_response = response
+                        st.success("✅ Processing complete!")
+                        st.rerun()
+            with col2:
+                if st.button("🔄 Clear", use_container_width=True):
+                    st.rerun()
+            with col3:
+                st.caption("ℹ️ Processing may take 20-30 seconds")
+# Display last response if available
+st.divider()
+if st.session_state.last_response:
+    st.subheader("📊 Latest Results")
+    display_response_results(st.session_state.last_response)
+# Conversation history
+st.divider()
+with st.expander("📜 Conversation History"):
+    if st.session_state.history:
+        for i, record in enumerate(st.session_state.history, 1):
+            with st.container(border=True):
+                col1, col2, col3, col4 = st.columns(4)
+                with col1:
+                    st.caption(f"Time: {record['timestamp'][:16]}")
+                with col2:
+                    st.caption(f"Customer: {record['customer_id']}")
+                with col3:
+                    st.caption(f"Intent: {record['intent']}")
+                with col4:
+                    st.caption(f"Sentiment: {record['sentiment']}")
+                st.write(record['response'][:150] + "..." if len(record['response']) > 150 else record['response'])
+    else:
+        st.info("No conversation history yet")
+# Footer
+st.divider()
+st.markdown("""
+---
+**Voice RAG Bot** | Powered by Groq LLM, Qdrant Vector DB, and LangGraph Orchestration
+For technical support, refer to the backend logs at `backend/main.py`
+""")

orchestration/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """Voice RAG Bot Orchestration Package"""

orchestration/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (214 Bytes). View file

orchestration/__pycache__/langgraph_workflow.cpython-311.pyc ADDED Viewed

Binary file (7.47 kB). View file

orchestration/__pycache__/latency_tracker.cpython-311.pyc ADDED Viewed

Binary file (6.98 kB). View file

orchestration/__pycache__/state.cpython-311.pyc ADDED Viewed

Binary file (2.82 kB). View file

orchestration/langgraph_workflow.py ADDED Viewed

	@@ -0,0 +1,119 @@

+"""LangGraph Workflow - 9-node orchestration pipeline"""
+from langgraph.graph import StateGraph, END, START
+from orchestration.state import ConversationState
+from typing import Any, Dict
+import logging
+from orchestration.latency_tracker import get_tracker, reset_tracker
+# Import all nodes
+from orchestration.nodes.sentiment_hybrid import sentiment_analysis_hybrid as sentiment_analysis_node
+from orchestration.nodes.entity_extraction import entity_extraction_node
+from orchestration.nodes.intent_detection import intent_detection_node
+from orchestration.nodes.retrieval_router import retrieval_router_node
+from orchestration.nodes.context_builder import context_builder_node
+from orchestration.nodes.response_generation import response_generation_node
+from orchestration.nodes.validation import validation_node
+from orchestration.nodes.memory_persistence import memory_persistence_node
+from orchestration.nodes.tts_generation import tts_generation_node
+logger = logging.getLogger(__name__)
+def build_workflow() -> StateGraph:
+    workflow = StateGraph(ConversationState)
+    workflow.add_node("sentiment_analysis", sentiment_analysis_node)
+    workflow.add_node("entity_extraction", entity_extraction_node)
+    workflow.add_node("intent_detection", intent_detection_node)
+    workflow.add_node("retrieval_router", retrieval_router_node)
+    workflow.add_node("context_builder", context_builder_node)
+    workflow.add_node("response_generation", response_generation_node)
+    workflow.add_node("validation", validation_node)
+    workflow.add_node("memory_persistence", memory_persistence_node)
+    workflow.add_node("tts_generation", tts_generation_node)
+    workflow.add_edge(START, "sentiment_analysis")
+    workflow.add_edge(START, "entity_extraction")
+    workflow.add_edge("sentiment_analysis", "intent_detection")
+    workflow.add_edge("entity_extraction", "intent_detection")
+    workflow.add_edge("intent_detection", "retrieval_router")
+    workflow.add_edge("retrieval_router", "context_builder")
+    workflow.add_edge("context_builder", "response_generation")
+    workflow.add_edge("response_generation", "validation")
+    def should_regenerate(state: ConversationState) -> str:
+        return "memory_persistence" if state.get("validation_passed", False) else "response_generation"
+    workflow.add_conditional_edges("validation", should_regenerate, {"memory_persistence": "memory_persistence", "response_generation": "response_generation"})
+    workflow.add_edge("memory_persistence", "tts_generation")
+    workflow.add_edge("tts_generation", END)
+    return workflow
+# Compile the workflow
+workflow = build_workflow()
+compiled_workflow = workflow.compile()
+async def run_workflow(user_input: str, customer_id: str) -> Dict[str, Any]:
+    """
+    Execute the complete workflow
+    Args:
+        user_input: Text from STT (user's speech converted to text)
+        customer_id: Unique customer identifier
+    Returns:
+        Complete state with response, audio path, and metadata
+    """
+    try:
+        # Reset and start tracking
+        reset_tracker()
+        tracker = get_tracker()
+        tracker.start_total()
+        tracker.start("workflow_orchestration")
+        # Initialize state
+        initial_state: ConversationState = {
+            "user_input": user_input,
+            "customer_id": customer_id,
+            "intent": {"intent": "unknown", "confidence": 0.0},
+            "sentiment": {"label": "NEUTRAL", "score": 0.5},
+            "entities": None,
+            "conversation_summary": "",
+            "kb_context": "",
+            "history_context": "",
+            "response": "",
+            "validation_passed": False,
+            "final_audio_path": None
+        }
+        # Run workflow
+        final_state = await compiled_workflow.ainvoke(initial_state)
+        tracker.end("workflow_orchestration")
+        # Save and print results
+        latency_results = tracker.save_to_file()
+        tracker.print_summary()
+        # Convert to regular dict and add latency info
+        result_dict = dict(final_state)
+        result_dict["latency_metrics"] = latency_results
+        logger.info(f"Total workflow time: {latency_results['total_time_ms']} ms")
+        return result_dict
+    except Exception as e:
+        logger.error(f"Workflow execution failed: {str(e)}")
+        logger.error(f"Error type: {type(e).__name__}")
+        import traceback
+        logger.error(traceback.format_exc())
+        raise
+def get_workflow_graph():
+    return compiled_workflow.get_graph().draw_mermaid()

orchestration/latency_tracker.py ADDED Viewed

	@@ -0,0 +1,123 @@

+"""
+Latency Tracker - Track execution time for each module
+"""
+import time
+import json
+from pathlib import Path
+from typing import Dict, Any
+from datetime import datetime
+class LatencyTracker:
+    """Track latency for each processing module"""
+    def __init__(self):
+        self.timings: Dict[str, float] = {}
+        self.start_times: Dict[str, float] = {}
+        self.total_start = None
+    def start_total(self):
+        """Start tracking total execution time"""
+        self.total_start = time.time()
+    def start(self, module_name: str):
+        """Start timing a module"""
+        self.start_times[module_name] = time.time()
+    def end(self, module_name: str):
+        """End timing a module"""
+        if module_name in self.start_times:
+            elapsed = time.time() - self.start_times[module_name]
+            self.timings[module_name] = round(elapsed * 1000, 2)  # Convert to ms
+            del self.start_times[module_name]
+    def get_results(self) -> Dict[str, Any]:
+        """Get all timing results"""
+        total_time = round((time.time() - self.total_start) * 1000, 2) if self.total_start else 0
+        return {
+            "timestamp": datetime.now().isoformat(),
+            "total_time_ms": total_time,
+            "modules": self.timings,
+            "breakdown_percent": self._calculate_percentages()
+        }
+    def _calculate_percentages(self) -> Dict[str, float]:
+        """Calculate percentage of total time for each module"""
+        total = sum(self.timings.values())
+        if total == 0:
+            return {}
+        return {
+            module: round((time_ms / total) * 100, 1)
+            for module, time_ms in self.timings.items()
+        }
+    def save_to_file(self, filepath: str = "data/latency_results.json"):
+        """Save results to JSON file"""
+        results = self.get_results()
+        path = Path(filepath)
+        path.parent.mkdir(parents=True, exist_ok=True)
+        # Append to existing results
+        existing = []
+        if path.exists():
+            try:
+                with open(path, 'r') as f:
+                    loaded = json.load(f)
+                    # Handle both list and dict formats for backward compatibility
+                    if isinstance(loaded, list):
+                        existing = loaded
+                    elif isinstance(loaded, dict):
+                        # If it's a dict, start fresh with a list
+                        existing = []
+                    else:
+                        existing = []
+            except:
+                existing = []
+        existing.append(results)
+        # Keep only last 100 results
+        if len(existing) > 100:
+            existing = existing[-100:]
+        with open(path, 'w') as f:
+            json.dump(existing, f, indent=2)
+        return results
+    def print_summary(self):
+        """Print formatted summary"""
+        results = self.get_results()
+        print("\n" + "="*60)
+        print("LATENCY TRACKING RESULTS")
+        print("="*60)
+        print(f"Total Time: {results['total_time_ms']} ms")
+        print("\nModule Breakdown:")
+        print("-"*60)
+        for module, time_ms in results['modules'].items():
+            percent = results['breakdown_percent'].get(module, 0)
+            bar = "#" * int(percent / 2)  # Visual bar
+            print(f"{module:25} {time_ms:8.2f} ms  {percent:5.1f}%  {bar}")
+        print("="*60 + "\n")
+# Global tracker instance
+_tracker = None
+def get_tracker() -> LatencyTracker:
+    """Get or create global tracker instance"""
+    global _tracker
+    if _tracker is None:
+        _tracker = LatencyTracker()
+    return _tracker
+def reset_tracker():
+    """Reset the global tracker"""
+    global _tracker
+    _tracker = LatencyTracker()

orchestration/nodes/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """Voice RAG Bot Orchestration Nodes"""

orchestration/nodes/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (218 Bytes). View file

orchestration/nodes/__pycache__/context_builder.cpython-311.pyc ADDED Viewed

Binary file (2.12 kB). View file

orchestration/nodes/__pycache__/entity_extraction.cpython-311.pyc ADDED Viewed

Binary file (2.26 kB). View file

orchestration/nodes/__pycache__/intent_detection.cpython-311.pyc ADDED Viewed

Binary file (2.48 kB). View file

orchestration/nodes/__pycache__/memory_persistence.cpython-311.pyc ADDED Viewed

Binary file (1.79 kB). View file

orchestration/nodes/__pycache__/response_generation.cpython-311.pyc ADDED Viewed

Binary file (4.19 kB). View file

orchestration/nodes/__pycache__/retrieval_router.cpython-311.pyc ADDED Viewed

Binary file (2.97 kB). View file

orchestration/nodes/__pycache__/sentiment_analysis.cpython-311.pyc ADDED Viewed

Binary file (1.87 kB). View file

orchestration/nodes/__pycache__/sentiment_hybrid.cpython-311.pyc ADDED Viewed

Binary file (5.15 kB). View file

orchestration/nodes/__pycache__/tts_generation.cpython-311.pyc ADDED Viewed

Binary file (4.24 kB). View file

orchestration/nodes/__pycache__/validation.cpython-311.pyc ADDED Viewed

Binary file (3.03 kB). View file

orchestration/nodes/context_builder.py ADDED Viewed

	@@ -0,0 +1,60 @@

+"""Context formatter for LLM prompts"""
+from orchestration.state import ConversationState
+from typing import Dict, Any
+from orchestration.latency_tracker import get_tracker
+def context_builder_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Build complete context string from all available information
+    for LLM to generate response
+    Combines:
+    - User input
+    - Intent & sentiment
+    - KB context
+    - Customer history
+    - Conversation summary
+    Returns:
+        State update (no new fields added, just confirmation)
+    """
+    tracker = get_tracker()
+    tracker.start("context_builder")
+    # Extract components
+    sentiment_label = state['sentiment']['label']
+    sentiment_score = state['sentiment']['score']
+    intent = state['intent']['intent']
+    kb_context = state['kb_context']
+    history_context = state['history_context']
+    conversation_summary = state['conversation_summary']
+    entities = state.get('entities', {})
+    # Build prompt context (this will be used by response_generation node)
+    # We just validate all components exist, they'll be used by next node
+    # Prepare formatted context for logging/debugging
+    formatted_context = f"""
+=== UNIFIED CONTEXT ===
+User Intent: {intent}
+User Sentiment: {sentiment_label} (confidence: {sentiment_score:.2f})
+KB Context:
+{kb_context}
+Customer History:
+{history_context}
+Conversation Summary:
+{conversation_summary}
+Entities Detected:
+{entities}
+"""
+    tracker.end("context_builder")
+    # Return minimal state update - context is already in state
+    # Next node (response_generation) will use these state fields directly
+    return {}

orchestration/nodes/entity_extraction.py ADDED Viewed

	@@ -0,0 +1,58 @@

+"""Named entity extraction using BERT-NER"""
+from transformers import pipeline
+from orchestration.state import ConversationState
+from typing import Dict, Any
+from orchestration.latency_tracker import get_tracker
+# Global model cache
+_ner_model = None
+def get_ner_model():
+    """Load NER model once and cache"""
+    global _ner_model
+    if _ner_model is None:
+        _ner_model = pipeline(
+            "token-classification",
+            model="dslim/bert-base-NER",
+            aggregation_strategy="simple"
+        )
+    return _ner_model
+def entity_extraction_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Extract named entities from user input
+    Uses token classification model to identify entity types
+    Returns:
+        state update with entities field
+    """
+    tracker = get_tracker()
+    tracker.start("entity_extraction")
+    try:
+        ner_pipeline = get_ner_model()
+        # Extract entities
+        entities_raw = ner_pipeline(state['user_input'])
+        # Format entities as dict with types
+        entities_dict = {}
+        for entity in entities_raw:
+            entity_type = entity['entity_group']
+            if entity_type not in entities_dict:
+                entities_dict[entity_type] = []
+            entities_dict[entity_type].append(entity['word'])
+        # Return formatted entities
+        tracker.end("entity_extraction")
+        if entities_dict:
+            return {"entities": entities_dict}
+        else:
+            return {"entities": None}
+    except Exception as e:
+        tracker.end("entity_extraction")
+        return {"entities": None}

orchestration/nodes/intent_detection.py ADDED Viewed

	@@ -0,0 +1,61 @@

+"""Intent classification using Groq LLM"""
+from langchain_groq import ChatGroq
+from langchain_core.prompts import PromptTemplate
+from orchestration.state import ConversationState
+from typing import Dict, Any
+import json
+from backend.config import settings
+from orchestration.latency_tracker import get_tracker
+def intent_detection_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Detect user intent using Groq LLM
+    Returns:
+        state update with intent field:
+        {"intent": {"intent": "...", "confidence": float}}
+    """
+    tracker = get_tracker()
+    tracker.start("intent_detection")
+    # Initialize Groq LLM
+    llm = ChatGroq(
+        model=settings.groq_model,
+        temperature=0.3,  # Low temp for consistent intent detection
+        groq_api_key=settings.groq_api_key
+    )
+    # Prompt template for intent detection
+    intent_prompt = PromptTemplate(
+        input_variables=["user_input"],
+        template="""Analyze the user's input and determine their intent. Respond ONLY with JSON.
+User Input: {user_input}
+Possible intents: complaint, refund_request, inquiry, account_issue, escalation, billing, product_question, order_status, other
+Response format:
+{{
+  "intent": "<selected_intent>",
+  "confidence": <0.0-1.0>
+}}"""
+    )
+    # Generate intent using chain pattern
+    chain = intent_prompt | llm
+    response = chain.invoke({"user_input": state['user_input']})
+    try:
+        # Parse JSON response from LLM
+        intent_data = json.loads(response.content.strip())
+    except json.JSONDecodeError:
+        # Fallback if JSON parsing fails
+        intent_data = {
+            "intent": "other",
+            "confidence": 0.5
+        }
+    tracker.end("intent_detection")
+    return {"intent": intent_data}

orchestration/nodes/memory_persistence.py ADDED Viewed

	@@ -0,0 +1,45 @@

+"""Conversation storage to Qdrant customer history"""
+from orchestration.state import ConversationState
+from rag.qdrant_manager import qdrant_manager
+from typing import Dict, Any
+from orchestration.latency_tracker import get_tracker
+def memory_persistence_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Store conversation turn to customer history collection in Qdrant
+    Enables historical context retrieval for repeat customers
+    Stores:
+    - Customer ID (for filtering)
+    - User input + response
+    - Intent classification
+    - Sentiment
+    - Timestamp
+    Returns:
+        state update (minimal, side-effect is primary)
+    """
+    tracker = get_tracker()
+    tracker.start("memory_persistence")
+    # Combine user input and response for storage
+    conversation_text = f"User: {state['user_input']}\nAssistant: {state['response']}"
+    # Determine interaction type from intent for categorization
+    intent = state['intent']['intent']
+    interaction_type = intent
+    # Store to Qdrant customer history
+    qdrant_manager.add_to_history(
+        customer_id=state['customer_id'],
+        text=conversation_text,
+        interaction_type=interaction_type
+    )
+    # Update conversation summary in memory (every 5 turns)
+    # For now, just store the current exchange
+    tracker.end("memory_persistence")
+    return {}

orchestration/nodes/response_generation.py ADDED Viewed

	@@ -0,0 +1,93 @@

+"""Response generation using Groq LLM"""
+from langchain_groq import ChatGroq
+from langchain_core.prompts import PromptTemplate
+from orchestration.state import ConversationState
+from typing import Dict, Any
+import logging
+from backend.config import settings
+from orchestration.latency_tracker import get_tracker
+logger = logging.getLogger(__name__)
+def response_generation_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Generate final response using Groq LLM
+    Incorporates KB context, customer history, intent, and sentiment
+    Returns:
+        state update with response field
+    """
+    tracker = get_tracker()
+    tracker.start("response_generation")
+    try:
+        logger.info("Response Generation: Initializing Groq LLM...")
+        # Initialize Groq LLM
+        llm = ChatGroq(
+            model=settings.groq_model,
+            temperature=settings.groq_temperature,
+            max_tokens=settings.groq_max_tokens,
+            groq_api_key=settings.groq_api_key
+        )
+        # Determine tone based on sentiment
+        sentiment_label = state['sentiment']['label']
+        tone_instruction = {
+            "POSITIVE": "Use a friendly, upbeat tone.",
+            "NEGATIVE": "Use an empathetic, understanding tone. Acknowledge frustration.",
+            "NEUTRAL": "Use a professional, helpful tone."
+        }.get(sentiment_label, "Use a professional tone.")
+        # Build response prompt
+        response_prompt = PromptTemplate(
+            input_variables=[
+                "user_input",
+                "intent",
+                "kb_context",
+                "history_context",
+                "tone_instruction"
+            ],
+            template="""You are a helpful customer service AI assistant.
+User Intent: {intent}
+{tone_instruction}
+Knowledge Base Context:
+{kb_context}
+Customer History:
+{history_context}
+User Message: {user_input}
+Provide a helpful, accurate response based on the context above. Keep response concise (2-3 sentences).
+If you don't have relevant information, say so clearly."""
+        )
+        # Generate response using chain pattern
+        logger.info("🤖 Response Generation: Invoking LLM chain...")
+        chain = response_prompt | llm
+        response = chain.invoke({
+            "user_input": state['user_input'],
+            "intent": state['intent']['intent'],
+            "kb_context": state['kb_context'],
+            "history_context": state['history_context'],
+            "tone_instruction": tone_instruction
+        })
+        response_text = response.content.strip()
+        logger.info(f"✅ Response generated: '{response_text[:80]}...'")
+        tracker.end("response_generation")
+        return {"response": response_text}
+    except Exception as e:
+        logger.error(f"❌ Response generation failed: {type(e).__name__}: {str(e)}")
+        import traceback
+        logger.error(traceback.format_exc())
+        tracker.end("response_generation")
+        # Return fallback response
+        return {"response": "I apologize, but I encountered an error processing your request. Please try again."}

orchestration/nodes/retrieval_router.py ADDED Viewed

	@@ -0,0 +1,51 @@

+"""Dual RAG routing - knowledge base + customer history"""
+from orchestration.state import ConversationState
+from rag.qdrant_manager import qdrant_manager
+from typing import Dict, Any
+from orchestration.latency_tracker import get_tracker
+def retrieval_router_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Dual RAG retrieval strategy:
+    1. Always search knowledge base for relevant policies/docs
+    2. For specific intents (complaint, refund, escalation), also search customer history
+    Returns:
+        state update with kb_context and history_context
+    """
+    tracker = get_tracker()
+    tracker.start("retrieval_router")
+    user_input = state['user_input']
+    customer_id = state['customer_id']
+    intent = state['intent']['intent']  # Intent classification result
+    # Always retrieve from knowledge base
+    kb_results = qdrant_manager.search_kb(user_input, limit=3)
+    kb_context = "\n".join([
+        f"- [{r['source']}] {r['text']} (relevance: {r['score']:.2f})"
+        for r in kb_results
+    ])
+    # Conditionally retrieve from customer history
+    history_context = ""
+    history_intents = ["complaint", "refund_request", "escalation", "billing_inquiry", "billing", "negative_sentiment"]
+    if intent in history_intents or state['sentiment']['label'] == "NEGATIVE":
+        history_results = qdrant_manager.search_history(
+            user_input,
+            customer_id,
+            limit=3
+        )
+        history_context = "\n".join([
+            f"- [{r['interaction_type']}] {r['text']} (relevance: {r['score']:.2f})"
+            for r in history_results
+        ])
+    tracker.end("retrieval_router")
+    return {
+        "kb_context": kb_context if kb_results else "No relevant policies found.",
+        "history_context": history_context if history_context else "No customer history available."
+    }

orchestration/nodes/sentiment_analysis.py ADDED Viewed

	@@ -0,0 +1,49 @@

+"""
+Sentiment Analysis Node - Emotion detection using DistilBERT
+Analyzes user input sentiment for tone-aware response generation
+"""
+from transformers import pipeline
+from orchestration.state import ConversationState
+from typing import Dict, Any
+# Global model cache
+_sentiment_model = None
+def get_sentiment_model():
+    """Load model once and cache"""
+    global _sentiment_model
+    if _sentiment_model is None:
+        _sentiment_model = pipeline(
+            "sentiment-analysis",
+            model="distilbert-base-uncased-finetuned-sst-2-english"
+        )
+    return _sentiment_model
+def sentiment_analysis_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Analyze sentiment of user input using DistilBERT
+    Returns:
+        state update with sentiment field populated:
+        {"sentiment": {"label": "POSITIVE|NEGATIVE|NEUTRAL", "score": float}}
+    """
+    try:
+        # Use cached model
+        sentiment_pipeline = get_sentiment_model()
+        # Analyze ONLY user input
+        result = sentiment_pipeline(state['user_input'])[0]
+        sentiment = {
+            "label": result['label'].upper(),  # POSITIVE, NEGATIVE, or NEUTRAL
+            "score": result['score']
+        }
+        return {"sentiment": sentiment}
+    except Exception as e:
+        # Default to neutral on error
+        return {"sentiment": {"label": "NEUTRAL", "score": 0.5}}

orchestration/nodes/sentiment_hybrid.py ADDED Viewed

	@@ -0,0 +1,133 @@

+"""Hybrid sentiment classifier - keyword-based + model fallback"""
+from orchestration.state import ConversationState
+from typing import Dict, Any
+from transformers import pipeline
+from orchestration.latency_tracker import get_tracker
+# Global model cache
+_sentiment_model = None
+def get_sentiment_model():
+    """Load model once and cache"""
+    global _sentiment_model
+    if _sentiment_model is None:
+        _sentiment_model = pipeline(
+            "sentiment-analysis",
+            model="distilbert-base-uncased-finetuned-sst-2-english"
+        )
+    return _sentiment_model
+def sentiment_analysis_hybrid(state: ConversationState) -> Dict[str, Any]:
+    """
+    Hybrid sentiment classification:
+    1. Check FAQ keywords → NEUTRAL
+    2. Check explicit sentiment words → POSITIVE/NEGATIVE
+    3. Fall back to DistilBERT model
+    Returns:
+        {"sentiment": {"label": "POSITIVE|NEGATIVE|NEUTRAL", "score": 0.95}}
+    """
+    tracker = get_tracker()
+    tracker.start("sentiment_analysis")
+    user_input = state['user_input'].lower()
+    # Step 1: FAQ keywords → Always NEUTRAL (domain-specific)
+    faq_keywords = [
+        "policy", "return", "warranty", "shipping", "account",
+        "details", "information", "how", "what", "when", "where",
+        "can i", "do you", "tell me", "help", "need", "about"
+    ]
+    if any(kw in user_input for kw in faq_keywords):
+        # Still check for strong sentiment words within FAQ
+        strong_negative = ["frustrated", "angry", "hate", "terrible", "broken", "worst", "useless"]
+        strong_positive = ["thank", "love", "great", "excellent", "amazing", "perfect"]
+        if any(word in user_input for word in strong_negative):
+            tracker.end("sentiment_analysis")
+            return {
+                "sentiment": {
+                    "label": "NEGATIVE",
+                    "score": 0.95,
+                    "reason": "Complaint with sentiment"
+                }
+            }
+        elif any(word in user_input for word in strong_positive):
+            tracker.end("sentiment_analysis")
+            return {
+                "sentiment": {
+                    "label": "POSITIVE",
+                    "score": 0.95,
+                    "reason": "Praise with sentiment"
+                }
+            }
+        else:
+            # Pure FAQ question = NEUTRAL
+            tracker.end("sentiment_analysis")
+            return {
+                "sentiment": {
+                    "label": "NEUTRAL",
+                    "score": 0.99,
+                    "reason": "FAQ inquiry"
+                }
+            }
+    # Step 2: Explicit strong sentiment words
+    strong_negative = [
+        "frustrated", "angry", "hate", "terrible", "broken",
+        "worst", "useless", "disaster", "awful", "horrible",
+        "unacceptable", "disgusted", "disappointed"
+    ]
+    strong_positive = [
+        "thank", "love", "great", "excellent", "amazing",
+        "perfect", "wonderful", "fantastic", "awesome", "impressed"
+    ]
+    if any(word in user_input for word in strong_negative):
+        tracker.end("sentiment_analysis")
+        return {
+            "sentiment": {
+                "label": "NEGATIVE",
+                "score": 0.95,
+                "reason": "Strong negative sentiment"
+            }
+        }
+    if any(word in user_input for word in strong_positive):
+        tracker.end("sentiment_analysis")
+        return {
+            "sentiment": {
+                "label": "POSITIVE",
+                "score": 0.95,
+                "reason": "Strong positive sentiment"
+            }
+        }
+    # Step 3: Fall back to DistilBERT model
+    try:
+        sentiment_pipeline = get_sentiment_model()
+        result = sentiment_pipeline(state['user_input'])[0]
+        tracker.end("sentiment_analysis")
+        return {
+            "sentiment": {
+                "label": result['label'].upper(),
+                "score": result['score'],
+                "reason": "Model inference"
+            }
+        }
+    except Exception as e:
+        # Default to neutral on error
+        tracker.end("sentiment_analysis")
+        return {
+            "sentiment": {
+                "label": "NEUTRAL",
+                "score": 0.5,
+                "reason": "Error - defaulting to neutral"
+            }
+        }

orchestration/nodes/tts_generation.py ADDED Viewed

	@@ -0,0 +1,72 @@

+"""Text-to-speech generation using gTTS"""
+from gtts import gTTS
+from orchestration.state import ConversationState
+from typing import Dict, Any
+import os
+import logging
+from pathlib import Path
+from orchestration.latency_tracker import get_tracker
+logger = logging.getLogger(__name__)
+def tts_generation_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Convert response text to speech using gTTS
+    Saves audio file and returns path
+    Returns:
+        state update with final_audio_path field
+    """
+    tracker = get_tracker()
+    tracker.start("tts_generation")
+    response_text = state.get('response', '')
+    # Validate response text exists
+    if not response_text or len(response_text.strip()) == 0:
+        logger.warning("⚠️ TTS: No response text to convert to speech")
+        tracker.end("tts_generation")
+        return {"final_audio_path": None}
+    # Create output directory if doesn't exist
+    audio_dir = Path("data/audio_output")
+    try:
+        audio_dir.mkdir(parents=True, exist_ok=True)
+    except Exception as e:
+        logger.error(f"❌ TTS: Failed to create audio directory: {e}")
+        tracker.end("tts_generation")
+        return {"final_audio_path": None}
+    # Generate unique filename
+    customer_id = state.get('customer_id', 'UNKNOWN')
+    import datetime
+    timestamp = datetime.datetime.now().strftime("%Y%m%d_%H%M%S_%f")[:19]
+    audio_filename = f"bot_response_{customer_id}_{timestamp}.mp3"
+    audio_path = audio_dir / audio_filename
+    try:
+        logger.info(f"📢 TTS: Generating audio for: '{response_text[:50]}...'")
+        # Generate TTS
+        tts = gTTS(text=response_text, lang='en', slow=False)
+        tts.save(str(audio_path))
+        # Verify file was created
+        if audio_path.exists():
+            file_size = audio_path.stat().st_size
+            logger.info(f"✅ TTS: Audio generated successfully ({file_size} bytes) -> {audio_path}")
+            final_audio_path = str(audio_path)
+        else:
+            logger.error(f"❌ TTS: File created but not found at {audio_path}")
+            final_audio_path = None
+    except Exception as e:
+        logger.error(f"❌ TTS generation failed: {type(e).__name__}: {str(e)}")
+        import traceback
+        logger.error(traceback.format_exc())
+        final_audio_path = None
+    tracker.end("tts_generation")
+    return {"final_audio_path": final_audio_path}

orchestration/nodes/validation.py ADDED Viewed

	@@ -0,0 +1,61 @@

+"""Response quality validation"""
+from orchestration.state import ConversationState
+from typing import Dict, Any
+import logging
+from orchestration.latency_tracker import get_tracker
+logger = logging.getLogger(__name__)
+def validation_node(state: ConversationState) -> Dict[str, Any]:
+    """
+    Validate generated response against quality criteria:
+    1. Length checks (not too short, not too long)
+    2. Tone-sentiment consistency
+    3. Basic sanity checks
+    Returns:
+        state update with validation_passed boolean
+    """
+    tracker = get_tracker()
+    tracker.start("validation")
+    response = state.get('response', '')
+    sentiment = state.get('sentiment', {}).get('label', 'NEUTRAL')
+    # Check 1: Response length (between 50-500 characters)
+    response_length = len(response)
+    length_valid = 50 <= response_length <= 500
+    # Check 2: Tone-sentiment consistency
+    tone_checks = {
+        "NEGATIVE": {
+            "forbidden_words": ["happy", "excellent", "amazing"],
+            "required_sentiment": ["understand", "apologize", "help"]
+        },
+        "POSITIVE": {
+            "forbidden_words": ["sorry", "problem", "issue"],
+            "required_sentiment": ["great", "happy", "enjoy"]
+        }
+    }
+    response_lower = response.lower()
+    tone_valid = True
+    if sentiment in tone_checks:
+        checks = tone_checks[sentiment]
+        # Check forbidden words aren't present
+        forbidden_present = any(word in response_lower for word in checks.get("forbidden_words", []))
+        tone_valid = not forbidden_present
+    # Check 3: Response not empty
+    content_valid = len(response.strip()) > 0
+    # Overall validation
+    validation_passed = length_valid and content_valid and tone_valid
+    logger.info(f"✓ Validation: length={response_length} ({length_valid}), content={content_valid}, tone={tone_valid} -> {'PASS' if validation_passed else 'FAIL'}")
+    tracker.end("validation")
+    return {"validation_passed": validation_passed}

orchestration/state.py ADDED Viewed

	@@ -0,0 +1,67 @@

+"""
+LangGraph State Definition - Central State Management for Voice RAG Bot Workflow
+Defines all data flowing through the orchestration pipeline
+"""
+from typing import TypedDict, List, Optional, Dict, Any
+class ConversationState(TypedDict):
+    """
+    Complete state passed through LangGraph nodes
+    Fields:
+    - user_input: Original text from voice input (after STT)
+    - customer_id: Unique customer identifier for history tracking
+    - intent: Intent detection result with confidence score
+    - sentiment: Sentiment analysis result with label and confidence
+    - entities: Extracted entities from user input (optional)
+    - conversation_summary: LLM-generated summary of conversation
+    - kb_context: Retrieved context from knowledge base
+    - history_context: Retrieved context from customer history (persistent memory)
+    - response: Final LLM-generated response text
+    - validation_passed: Boolean flag for response validation
+    - final_audio_path: Path to generated TTS audio file
+    """
+    # Input & Context
+    user_input: str
+    customer_id: str
+    # NLP Analysis Results
+    intent: Dict[str, Any]  # {"intent": "...", "confidence": float}
+    sentiment: Dict[str, Any]  # {"label": "POSITIVE|NEGATIVE|NEUTRAL", "score": float}
+    entities: Optional[Dict[str, Any]]  # {"entity_type": [...], ...}
+    # Memory Management
+    conversation_summary: str  # LLM-generated summary
+    # RAG Contexts
+    kb_context: str  # Knowledge base retrieval results
+    history_context: str  # Customer history retrieval results
+    # Response Generation
+    response: str  # Final LLM-generated response
+    # Validation & Output
+    validation_passed: bool
+    final_audio_path: Optional[str]
+class ConversationStateOptional(TypedDict, total=False):
+    """
+    Optional version of ConversationState for partial updates
+    Allows nodes to update only the fields they produce
+    """
+    user_input: str
+    customer_id: str
+    intent: Dict[str, Any]
+    sentiment: Dict[str, Any]
+    entities: Optional[Dict[str, Any]]
+    conversation_summary: str
+    kb_context: str
+    history_context: str
+    response: str
+    validation_passed: bool
+    final_audio_path: Optional[str]

rag/__init__.py ADDED Viewed

	@@ -0,0 +1 @@


1	+ """Voice RAG Bot RAG (Retrieval-Augmented Generation) Package"""

rag/__pycache__/__init__.cpython-311.pyc ADDED Viewed

Binary file (227 Bytes). View file

rag/__pycache__/cache_manager.cpython-311.pyc ADDED Viewed

Binary file (6.14 kB). View file