--- title: ESG Document Intelligence Platform emoji: 🌿 colorFrom: green colorTo: blue sdk: gradio sdk_version: 6.9.0 app_file: app.py pinned: false license: apache-2.0 short_description: HyperRAG + Discourse Graph for ESG Report Analysis --- # 🌿 Multimodal ESG Document Intelligence Platform > **HyperRAG + Discourse Graph Reasoning for ESG Report Analysis** Upload any ESG / Sustainability PDF report and get: - πŸ’¬ **Contextual Q&A** β€” ask questions about the report, answered with page-level evidence - πŸ“Š **ESG Pillar Scores** β€” keyword-based E, S, G scoring + sector detection - 🚨 **Greenwashing Detection** β€” flags unsubstantiated claims with exact page references - πŸ•ΈοΈ **Discourse Graph Insights** β€” models relationships between claims, evidence, policies and metrics ## Architecture ``` PDF β†’ Text Extraction (pdfplumber) β†’ Chunking (400-word windows, 80-word overlap) β†’ Embeddings (sentence-transformers/all-MiniLM-L6-v2) β†’ Qdrant Vector Index (in-memory) β†’ Discourse Graph (NetworkX DiGraph) claims ──supported_by──▢ evidence policies ──measured_by──▢ metrics β†’ HyperRAG Retrieval vector search + graph neighbourhood expansion β†’ Flan-T5 Answer Generation ``` ## Key Technologies | Layer | Technology | |-------|-----------| | Vector Store | Qdrant (in-memory) | | Embeddings | `all-MiniLM-L6-v2` | | LLM | `google/flan-t5-base` | | Graph | NetworkX DiGraph | | Retrieval | HyperRAG (vector + graph) | | UI | Gradio | ## Usage 1. **Upload** an ESG report PDF in the *Upload & Process* tab 2. Click **Process Document** β€” wait ~30–60 s for indexing 3. Switch to any analysis tab and explore! ## Limitations - ESG scores are keyword-density heuristics (not certified ratings) - `flan-t5-base` is used for CPU compatibility; swap in a larger model for production - Greenwashing detection is pattern-based and requires expert review ## Running Locally ```bash git clone https://huggingface.co/spaces//esg-intelligence cd esg-intelligence pip install -r requirements.txt python app.py ``` ## License Apache 2.0 β€” research & demonstration use only.