Spaces:
Sleeping
Sleeping
| title: ESG Document Intelligence Platform | |
| emoji: ๐ฟ | |
| colorFrom: green | |
| colorTo: blue | |
| sdk: gradio | |
| sdk_version: 6.9.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| short_description: HyperRAG + Discourse Graph for ESG Report Analysis | |
| # ๐ฟ Multimodal ESG Document Intelligence Platform | |
| > **HyperRAG + Discourse Graph Reasoning for ESG Report Analysis** | |
| Upload any ESG / Sustainability PDF report and get: | |
| - ๐ฌ **Contextual Q&A** โ ask questions about the report, answered with page-level evidence | |
| - ๐ **ESG Pillar Scores** โ keyword-based E, S, G scoring + sector detection | |
| - ๐จ **Greenwashing Detection** โ flags unsubstantiated claims with exact page references | |
| - ๐ธ๏ธ **Discourse Graph Insights** โ models relationships between claims, evidence, policies and metrics | |
| ## Architecture | |
| ``` | |
| PDF โ Text Extraction (pdfplumber) | |
| โ Chunking (400-word windows, 80-word overlap) | |
| โ Embeddings (sentence-transformers/all-MiniLM-L6-v2) | |
| โ Qdrant Vector Index (in-memory) | |
| โ Discourse Graph (NetworkX DiGraph) | |
| claims โโsupported_byโโโถ evidence | |
| policies โโmeasured_byโโโถ metrics | |
| โ HyperRAG Retrieval | |
| vector search + graph neighbourhood expansion | |
| โ Flan-T5 Answer Generation | |
| ``` | |
| ## Key Technologies | |
| | Layer | Technology | | |
| |-------|-----------| | |
| | Vector Store | Qdrant (in-memory) | | |
| | Embeddings | `all-MiniLM-L6-v2` | | |
| | LLM | `google/flan-t5-base` | | |
| | Graph | NetworkX DiGraph | | |
| | Retrieval | HyperRAG (vector + graph) | | |
| | UI | Gradio | | |
| ## Usage | |
| 1. **Upload** an ESG report PDF in the *Upload & Process* tab | |
| 2. Click **Process Document** โ wait ~30โ60 s for indexing | |
| 3. Switch to any analysis tab and explore! | |
| ## Limitations | |
| - ESG scores are keyword-density heuristics (not certified ratings) | |
| - `flan-t5-base` is used for CPU compatibility; swap in a larger model for production | |
| - Greenwashing detection is pattern-based and requires expert review | |
| ## Running Locally | |
| ```bash | |
| git clone https://huggingface.co/spaces/<your-username>/esg-intelligence | |
| cd esg-intelligence | |
| pip install -r requirements.txt | |
| python app.py | |
| ``` | |
| ## License | |
| Apache 2.0 โ research & demonstration use only. |