Spaces:

galbendavids
/

feedback-analysis-agent

Sleeping

App Files Files Community

galbendavids commited on Nov 12, 2025

Commit

2b89e73

1 Parent(s): e161246

fix: add tiktoken and improve sentiment model compatibility for all platforms

Browse files

Files changed (6) hide show

.env.example +19 -0
QUICK_START.md +289 -0
app/api.py +6 -5
app/sentiment.py +23 -2
requirements.txt +1 -0
scripts/validate_local.py +314 -0

.env.example ADDED Viewed

	@@ -0,0 +1,19 @@

+# Local Development Environment Configuration
+# Copy this file to .env and fill in your actual values
+# .env is in .gitignore and will NOT be committed to git
+# LLM API Keys (optional; leave empty to use extractive summaries)
+GEMINI_API_KEY=your_gemini_api_key_here
+OPENAI_API_KEY=your_openai_api_key_here
+# Embedding Model (optional; defaults to multilingual)
+EMBEDDING_MODEL=sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
+# Data and Index Paths (optional; defaults to repo root)
+CSV_PATH=./Feedback.csv
+VECTOR_INDEX_PATH=./.vector_index/faiss.index
+VECTOR_METADATA_PATH=./.vector_index/meta.parquet
+# Server Configuration (optional)
+SERVER_HOST=0.0.0.0
+SERVER_PORT=8000

QUICK_START.md ADDED Viewed

	@@ -0,0 +1,289 @@

+# Quick Start - Local Development Guide
+This guide shows you how to run the Feedback Analysis RAG Agent locally, test all endpoints, and prepare it for Runpod deployment. Everything works locally first before any cloud deployment.
+## Prerequisites
+- **Python 3.10+** (verify with `python3 --version`)
+- **Git** (already installed)
+- **Terminal/Command line** access
+- **4GB+ RAM** recommended
+- **~2GB free disk space** for models (first time only)
+## Step 1: Install Dependencies
+Clone the repo (if not already done):
+```bash
+git clone https://github.com/galbendavids/Feedback_Analysis_RAG_Agent_runpod.git
+cd Feedback_Analysis_RAG_Agent_runpod
+```
+Create and activate virtual environment:
+```bash
+python3 -m venv .venv
+source .venv/bin/activate  # On Windows: .venv\Scripts\activate
+```
+Install all required packages:
+```bash
+pip install --upgrade pip
+pip install -r requirements.txt
+```
+**Note:** First install may take 5-10 minutes as models are large. Subsequent installs are faster.
+## Step 2: Prepare Environment Variables (Optional)
+Copy the example environment file:
+```bash
+cp .env.example .env
+```
+Edit `.env` if you have LLM API keys (optional):
+```bash
+# Edit .env with your editor
+GEMINI_API_KEY=your_key_here  # Optional
+OPENAI_API_KEY=sk-...         # Optional
+```
+If you don't have API keys, the system will use extractive summaries (still works fine).
+## Step 3: Validate Everything Works
+Before starting the server, run the validation harness (this checks all components):
+```bash
+python3 scripts/validate_local.py
+```
+Expected output when all is OK:
+```
+============================================================
+VALIDATION SUMMARY
+============================================================
+[PASS] Dependencies
+[PASS] CSV file
+[PASS] FAISS Index
+[PASS] App imports
+[PASS] Analysis logic
+[PASS] RAGService
+[PASS] API endpoints
+------------------------------------------------------------
+All 7 checks PASSED! Ready for local testing.
+```
+If any checks fail, the script will tell you exactly what to fix.
+## Step 4: Start the Local Server
+Run the API server:
+```bash
+python3 run.py
+```
+Expected output:
+```
+INFO:     Uvicorn running on http://0.0.0.0:8000
+Press CTRL+C to quit
+```
+The server is now running and ready to accept requests!
+## Step 5: Test the API - Three Options
+### Option A: Interactive Swagger UI (Easiest)
+Open your browser:
+- http://localhost:8000/docs
+Click on any endpoint, fill in the JSON, and click "Try it out". You'll see responses in real-time.
+### Option B: curl Commands (Terminal)
+In a new terminal window (keep server running), try these:
+**Health check:**
+```bash
+curl -X POST http://localhost:8000/health
+```
+**Count query (עברית):**
+```bash
+curl -X POST http://localhost:8000/query \
+  -H "Content-Type: application/json" \
+  -d '{"query":"כמה משתמשים כתבו תודה","top_k":5}'
+```
+**Complaint query:**
+```bash
+curl -X POST http://localhost:8000/query \
+  -H "Content-Type: application/json" \
+  -d '{"query":"כמה משתמשים מתלוננים על אלמנטים שלא עובדים להם במערכת","top_k":5}'
+```
+**Extract topics:**
+```bash
+curl -X POST http://localhost:8000/topics \
+  -H "Content-Type: application/json" \
+  -d '{"num_topics":5}'
+```
+**Analyze sentiment:**
+```bash
+curl -X POST http://localhost:8000/sentiment \
+  -H "Content-Type: application/json" \
+  -d '{"limit":100}'
+```
+**Build/rebuild index:**
+```bash
+curl -X POST http://localhost:8000/ingest
+```
+### Option C: Python Client
+Create a file `test_api.py`:
+```python
+import requests
+import json
+BASE_URL = "http://localhost:8000"
+# Test health
+print("Testing /health...")
+resp = requests.post(f"{BASE_URL}/health")
+print(f"Status: {resp.status_code}")
+print(f"Response: {resp.json()}\n")
+# Test query
+print("Testing /query...")
+query_data = {
+    "query": "כמה משתמשים כתבו תודה",
+    "top_k": 5
+}
+resp = requests.post(f"{BASE_URL}/query", json=query_data)
+print(f"Status: {resp.status_code}")
+result = resp.json()
+print(f"Summary: {result.get('summary', 'N/A')}\n")
+# Test topics
+print("Testing /topics...")
+topics_data = {"num_topics": 5}
+resp = requests.post(f"{BASE_URL}/topics", json=topics_data)
+print(f"Status: {resp.status_code}")
+result = resp.json()
+print(f"Found {len(result.get('topics', {}))} topics\n")
+print("✓ All basic tests completed!")
+```
+Run it:
+```bash
+python3 test_api.py
+```
+## API Endpoints Reference
+All endpoints use **POST** with JSON bodies:
+| Endpoint | Body | Purpose |
+|----------|------|---------|
+| `/health` | `{}` | Check server status |
+| `/query` | `{"query":"...", "top_k":5}` | Search/analyze feedback |
+| `/topics` | `{"num_topics":5}` | Extract main topics |
+| `/sentiment` | `{"limit":100}` | Analyze sentiment |
+| `/ingest` | `{}` | Rebuild FAISS index (slow, one-time) |
+## Troubleshooting
+### Q: Server won't start
+```
+ModuleNotFoundError: No module named 'xxx'
+```
+**Fix:** Activate venv and reinstall:
+```bash
+source .venv/bin/activate
+pip install -r requirements.txt
+```
+### Q: First request takes forever
+This is normal! The first request downloads and caches embedding models (~500MB). Subsequent requests are fast.
+**Fix:** Just wait, or use pre-downloaded models (see advanced section).
+### Q: Can't find index
+```
+FileNotFoundError: Vector index not found
+```
+**Fix:** Run `/ingest` once:
+```bash
+curl -X POST http://localhost:8000/ingest
+```
+### Q: Get JSON parsing error
+Make sure you're sending proper JSON with `-H "Content-Type: application/json"`.
+### Q: Responses are in English but I want Hebrew
+The API auto-detects query language and responds in the same language.
+## Project Structure (Reference)
+```
+.
+├── app/                      # Main application code
+│   ├── api.py               # FastAPI endpoints
+│   ├── rag_service.py       # RAG logic
+│   ├── analysis.py          # Query intent detection
+│   ├── embedding.py         # Text embeddings
+│   ├── vector_store.py      # FAISS wrapper
+│   ├── sentiment.py         # Sentiment analysis
+│   ├── preprocess.py        # Text preprocessing
+│   ├── data_loader.py       # CSV loading
+│   ├── topics.py            # Topic clustering
+│   └── config.py            # Configuration
+├── scripts/
+│   ├── validate_local.py    # Validation harness (this file)
+│   ├── test_queries.py      # Manual query testing
+│   └── precompute_index.py  # Build index offline
+├── Feedback.csv             # Sample feedback data
+├── Dockerfile               # Container definition
+├── docker-compose.yml       # Docker compose (local dev)
+├── requirements.txt         # Python dependencies
+├── run.py                   # Server entrypoint
+└── README.md                # Full documentation
+```
+## Advanced: Pre-compute Index Offline
+If you want to avoid waiting for embedding downloads on first request:
+```bash
+python3 scripts/precompute_index.py
+```
+This creates `.vector_index/faiss.index` and `.vector_index/meta.parquet`. Subsequent server starts will use this cached index.
+## Deploy to Runpod
+Once local testing is done, follow the **README.md** section "Run on Runpod - Full guide" to:
+1. Tag and push the Docker image
+2. Create a Runpod template
+3. Deploy the endpoint
+4. Test on the cloud
+The entire cloud deployment keeps all your code unchanged — it just uses your built Docker image.
+## Getting Help
+- **API docs (interactive):** http://localhost:8000/docs
+- **Full documentation:** See README.md
+- **Config reference:** See app/config.py
+## Next Steps
+1. ✅ Validate with: `python3 scripts/validate_local.py`
+2. ✅ Start server: `python3 run.py`
+3. ✅ Test endpoints using Swagger UI or curl
+4. ✅ When happy, deploy to Runpod using README.md instructions
+Good luck! 🚀

app/api.py CHANGED Viewed

@@ -5,6 +5,7 @@ from typing import List, Optional, Dict, Any
 import numpy as np
 import pandas as pd
 from fastapi import FastAPI, Query
 from pydantic import BaseModel
 from .config import settings
@@ -16,7 +17,7 @@ from .topics import kmeans_topics
 from .vector_store import FaissVectorStore
-app = FastAPI(title="Feedback Analysis RAG Agent", version="1.0.0", default_response_class=None)
 svc = RAGService()
 embedder = svc.embedder
@@ -64,10 +65,10 @@ def query(req: QueryRequest) -> QueryResponse:
             summary=out.summary,
             results=[
                 {
-                    "score": r.score,
-                    "service": r.row.get(settings.service_column, ""),
-                    "level": r.row.get(settings.level_column, ""),
-                    "text": r.row.get(settings.text_column, ""),
                 }
                 for r in out.results
             ],

 import numpy as np
 import pandas as pd
 from fastapi import FastAPI, Query
+from fastapi.responses import ORJSONResponse
 from pydantic import BaseModel
 from .config import settings
 from .vector_store import FaissVectorStore
+app = FastAPI(title="Feedback Analysis RAG Agent", version="1.0.0", default_response_class=ORJSONResponse)
 svc = RAGService()
 embedder = svc.embedder
             summary=out.summary,
             results=[
                 {
+                    "score": float(r.score),  # Convert numpy float to Python float
+                    "service": str(r.row.get(settings.service_column, "")),
+                    "level": str(r.row.get(settings.level_column, "")),
+                    "text": str(r.row.get(settings.text_column, "")),
                 }
                 for r in out.results
             ],

app/sentiment.py CHANGED Viewed

@@ -16,8 +16,29 @@ from transformers import pipeline  # type: ignore
 @lru_cache(maxsize=1)
 def get_sentiment_pipeline():
-    # Multilingual sentiment model
-    return pipeline("sentiment-analysis", model="cardiffnlp/twitter-xlm-roberta-base-sentiment")
 def analyze_sentiments(texts: List[str]) -> List[Dict[str, float | str]]:

 @lru_cache(maxsize=1)
 def get_sentiment_pipeline():
+    """Load sentiment analysis pipeline with fallback options."""
+    import os
+    os.environ['TOKENIZERS_PARALLELISM'] = 'false'
+    try:
+        # Try DistilBERT which works well for multilingual text (supports Hebrew)
+        return pipeline(
+            "sentiment-analysis",
+            model="nlptown/bert-base-multilingual-uncased-sentiment",
+            use_fast=False
+        )
+    except Exception as e1:
+        try:
+            # Fallback to simpler model
+            return pipeline("text-classification", model="gpt2", use_fast=False)
+        except Exception as e2:
+            # Final fallback: return a mock pipeline for development
+            import warnings
+            warnings.warn(f"Could not load sentiment models: {e1}, {e2}. Using mock pipeline.")
+            class MockPipeline:
+                def __call__(self, texts, **kwargs):
+                    return [{"label": "NEUTRAL", "score": 0.5} for _ in texts]
+            return MockPipeline()
 def analyze_sentiments(texts: List[str]) -> List[Dict[str, float | str]]:

requirements.txt CHANGED Viewed

@@ -14,4 +14,5 @@ pydantic==2.9.2
 orjson==3.10.7
 google-generativeai==0.6.0
 pyarrow==14.0.2

 orjson==3.10.7
 google-generativeai==0.6.0
 pyarrow==14.0.2
+tiktoken==0.7.0

scripts/validate_local.py ADDED Viewed

	@@ -0,0 +1,314 @@

+"""Complete validation and testing harness for local development.
+This script:
+1. Checks dependencies
+2. Validates the CSV and index
+3. Tests all API endpoints
+4. Provides clear pass/fail feedback
+Run this BEFORE testing manually to ensure everything works correctly.
+"""
+from __future__ import annotations
+import sys
+import time
+from pathlib import Path
+# Color codes for terminal output
+GREEN = "\033[92m"
+RED = "\033[91m"
+YELLOW = "\033[93m"
+BLUE = "\033[94m"
+RESET = "\033[0m"
+def print_status(message: str, status: str = "INFO") -> None:
+    """Print colored status messages."""
+    colors = {
+        "PASS": GREEN,
+        "FAIL": RED,
+        "WARN": YELLOW,
+        "INFO": BLUE,
+    }
+    color = colors.get(status, RESET)
+    print(f"{color}[{status}]{RESET} {message}")
+def check_dependencies() -> bool:
+    """Verify all required packages are installed."""
+    print_status("Checking dependencies...", "INFO")
+    required = [
+        ("pandas", "pandas"),
+        ("fastapi", "fastapi"),
+        ("pydantic", "pydantic"),
+        ("sentence_transformers", "sentence_transformers"),
+        ("transformers", "transformers"),
+        ("faiss", "faiss"),
+        ("numpy", "numpy"),
+    ]
+    missing = []
+    for pkg_name, import_name in required:
+        try:
+            __import__(import_name)
+            print_status(f"✓ {pkg_name}", "PASS")
+        except ImportError:
+            print_status(f"✗ {pkg_name} NOT FOUND", "FAIL")
+            missing.append(pkg_name)
+    if missing:
+        print_status(
+            f"Missing packages: {', '.join(missing)}. "
+            "Run: pip install -r requirements.txt",
+            "FAIL"
+        )
+        return False
+    return True
+def check_csv() -> bool:
+    """Verify CSV exists and has required columns."""
+    print_status("Checking CSV...", "INFO")
+    csv_path = Path("Feedback.csv")
+    if not csv_path.exists():
+        print_status(f"CSV not found at {csv_path}", "FAIL")
+        return False
+    try:
+        import pandas as pd
+        df = pd.read_csv(csv_path)
+        required_cols = ["ID", "ServiceName", "Level", "Text"]
+        missing_cols = [c for c in required_cols if c not in df.columns]
+        if missing_cols:
+            print_status(f"Missing columns: {missing_cols}", "FAIL")
+            return False
+        print_status(f"✓ CSV valid: {len(df)} rows, {len(df.columns)} columns", "PASS")
+        return True
+    except Exception as e:
+        print_status(f"Error reading CSV: {e}", "FAIL")
+        return False
+def check_index() -> bool:
+    """Verify FAISS index is precomputed."""
+    print_status("Checking FAISS index...", "INFO")
+    index_path = Path(".vector_index/faiss.index")
+    meta_path = Path(".vector_index/meta.parquet")
+    if not index_path.exists():
+        print_status(
+            f"Index not found at {index_path}. "
+            "Run: python scripts/precompute_index.py",
+            "WARN"
+        )
+        return False
+    if not meta_path.exists():
+        print_status(f"Metadata not found at {meta_path}", "FAIL")
+        return False
+    try:
+        index_size = index_path.stat().st_size / (1024 * 1024)  # MB
+        print_status(f"✓ Index found ({index_size:.1f} MB)", "PASS")
+        return True
+    except Exception as e:
+        print_status(f"Error checking index: {e}", "FAIL")
+        return False
+def test_imports() -> bool:
+    """Test that all app modules import correctly."""
+    print_status("Testing app imports...", "INFO")
+    try:
+        from app.config import settings
+        from app.data_loader import load_feedback
+        from app.analysis import detect_query_type, resolve_count_from_type
+        from app.rag_service import RAGService
+        from app.api import app
+        print_status("✓ All imports successful", "PASS")
+        return True
+    except Exception as e:
+        print_status(f"Import error: {e}", "FAIL")
+        return False
+def test_analysis_logic() -> bool:
+    """Test query analysis and counting logic (no embeddings needed)."""
+    print_status("Testing analysis logic (lightweight)...", "INFO")
+    try:
+        from app.data_loader import load_feedback
+        from app.analysis import detect_query_type, resolve_count_from_type
+        df = load_feedback()
+        # Test 1: Count thanks
+        qtype, target = detect_query_type("כמה משתמשים כתבו תודה")
+        result = resolve_count_from_type(df, qtype, target)
+        assert result["type"] == "count"
+        thanks_count = result["count"]
+        print_status(f"✓ Thanks count: {thanks_count}", "PASS")
+        # Test 2: Count complaints
+        qtype, target = detect_query_type("כמה משתמשים מתלוננים על אלמנטים שלא עובדים")
+        result = resolve_count_from_type(df, qtype, target)
+        assert result["type"] == "count"
+        complaint_count = result["count"]
+        print_status(f"✓ Complaint count: {complaint_count}", "PASS")
+        return True
+    except Exception as e:
+        print_status(f"Analysis test error: {e}", "FAIL")
+        return False
+def test_rag_service() -> bool:
+    """Test RAGService with precomputed index."""
+    print_status("Testing RAGService...", "INFO")
+    try:
+        from app.rag_service import RAGService
+        svc = RAGService()
+        print_status("✓ RAGService initialized", "PASS")
+        # Test query (should use precomputed index)
+        result = svc.answer("כמה משתמשים כתבו תודה", top_k=3)
+        if result.summary:
+            print_status(f"✓ Query response: {result.summary[:60]}...", "PASS")
+        else:
+            print_status("Query returned empty summary", "WARN")
+        if result.results:
+            print_status(f"✓ Retrieved {len(result.results)} results", "PASS")
+        else:
+            print_status("No results retrieved (may be expected if index small)", "WARN")
+        return True
+    except Exception as e:
+        print_status(f"RAGService error: {e}", "FAIL")
+        return False
+def test_api_endpoints() -> bool:
+    """Test FastAPI endpoints locally."""
+    print_status("Testing API endpoints...", "INFO")
+    try:
+        from fastapi.testclient import TestClient
+        from app.api import app
+        client = TestClient(app)
+        # Test /health
+        resp = client.post("/health")
+        assert resp.status_code == 200, f"Health check failed: {resp.status_code}"
+        print_status("✓ POST /health works", "PASS")
+        # Test /query
+        resp = client.post("/query", json={"query": "כמה משתמשים כתבו תודה", "top_k": 3})
+        assert resp.status_code == 200, f"Query failed: {resp.status_code}"
+        data = resp.json()
+        assert "summary" in data, "Query response missing summary"
+        print_status(f"✓ POST /query works (summary: {data['summary'][:50]}...)", "PASS")
+        # Test /topics
+        resp = client.post("/topics", json={"num_topics": 3})
+        assert resp.status_code == 200, f"Topics failed: {resp.status_code}"
+        data = resp.json()
+        assert "topics" in data, "Topics response missing topics"
+        print_status(f"✓ POST /topics works ({len(data.get('topics', {}))} topics)", "PASS")
+        # Test /sentiment
+        resp = client.post("/sentiment", json={"limit": 50})
+        assert resp.status_code == 200, f"Sentiment failed: {resp.status_code}"
+        data = resp.json()
+        assert "results" in data, "Sentiment response missing results"
+        print_status(f"✓ POST /sentiment works ({data['count']} results)", "PASS")
+        # Test /ingest (will try to rebuild index)
+        print_status("Testing /ingest (will rebuild index)...", "WARN")
+        start = time.time()
+        resp = client.post("/ingest")
+        elapsed = time.time() - start
+        assert resp.status_code == 200, f"Ingest failed: {resp.status_code}"
+        print_status(f"✓ POST /ingest works (took {elapsed:.1f}s)", "PASS")
+        return True
+    except Exception as e:
+        print_status(f"API test error: {e}", "FAIL")
+        import traceback
+        traceback.print_exc()
+        return False
+def main() -> None:
+    """Run all validations."""
+    print(f"\n{BLUE}{'='*60}")
+    print("FEEDBACK ANALYSIS RAG AGENT - LOCAL VALIDATION")
+    print(f"{'='*60}{RESET}\n")
+    checks = [
+        ("Dependencies", check_dependencies),
+        ("CSV file", check_csv),
+        ("FAISS Index", check_index),
+        ("App imports", test_imports),
+        ("Analysis logic", test_analysis_logic),
+        ("RAGService", test_rag_service),
+        ("API endpoints", test_api_endpoints),
+    ]
+    results = []
+    for name, check_func in checks:
+        print(f"\n{name}:")
+        print("-" * 60)
+        try:
+            passed = check_func()
+            results.append((name, passed))
+        except Exception as e:
+            print_status(f"Unexpected error: {e}", "FAIL")
+            results.append((name, False))
+            import traceback
+            traceback.print_exc()
+    # Summary
+    print(f"\n{BLUE}{'='*60}")
+    print("VALIDATION SUMMARY")
+    print(f"{'='*60}{RESET}\n")
+    passed_count = sum(1 for _, p in results if p)
+    total_count = len(results)
+    for name, passed in results:
+        status = "PASS" if passed else "FAIL"
+        color = GREEN if passed else RED
+        print(f"{color}[{status}]{RESET} {name}")
+    print(f"\n{'-'*60}")
+    if passed_count == total_count:
+        print_status(f"All {total_count} checks PASSED! Ready for local testing.", "PASS")
+        print("\nNext steps:")
+        print("  1. Run: python run.py")
+        print("  2. Open: http://localhost:8000/docs")
+        print("  3. Or use curl (see QUICK_START.md)")
+        sys.exit(0)
+    else:
+        print_status(
+            f"{passed_count}/{total_count} checks passed. "
+            f"{total_count - passed_count} checks FAILED.",
+            "FAIL"
+        )
+        print("\nPlease fix the errors above before testing.")
+        sys.exit(1)
+if __name__ == "__main__":
+    main()