FOIA_Doc_Search / README_PROD.md
GodsDevProject's picture
Upload 29 files
0f4b74d verified
|
raw
history blame
896 Bytes

FOIA HF Document Search — Production Build

Entry Point

  • app.py — orchestrates ingestion, semantic search, export, and audit hooks.

Ingestion Adapters (Present)

  • CIA Reading Room
  • FBI Vault
  • Generic Public FOIA

Missing / Stubbed Adapters (Recommended)

  • DoD (incl. components)
  • NSA
  • DIA
  • DHS
  • DEA
  • ICE

Vector Backend Assumptions

  • Current code supports abstract vector ops.
  • Recommended backends:
    • FAISS (local)
    • Chroma (persistent)
    • HuggingFace embeddings
    • OpenAI embeddings (optional)

Live Federated Search Upgrade

  • Async querying via asyncio + httpx
  • Adapter interface with rate limits
  • Response caching + deduplication
  • Circuit breakers for abuse prevention

Compliance

  • Respect robots.txt where applicable
  • Rate limiting per agency
  • Redaction before export
  • Audit logging enabled

Build Timestamp

2026-01-09T23:51:16.728748Z