esg-intelligence / README.md
GirishaBuilds01's picture
Update README.md
3d20a80 verified
---
title: ESG Document Intelligence Platform
emoji: ๐ŸŒฟ
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 6.9.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: HyperRAG + Discourse Graph for ESG Report Analysis
---
# ๐ŸŒฟ Multimodal ESG Document Intelligence Platform
> **HyperRAG + Discourse Graph Reasoning for ESG Report Analysis**
Upload any ESG / Sustainability PDF report and get:
- ๐Ÿ’ฌ **Contextual Q&A** โ€” ask questions about the report, answered with page-level evidence
- ๐Ÿ“Š **ESG Pillar Scores** โ€” keyword-based E, S, G scoring + sector detection
- ๐Ÿšจ **Greenwashing Detection** โ€” flags unsubstantiated claims with exact page references
- ๐Ÿ•ธ๏ธ **Discourse Graph Insights** โ€” models relationships between claims, evidence, policies and metrics
## Architecture
```
PDF โ†’ Text Extraction (pdfplumber)
โ†’ Chunking (400-word windows, 80-word overlap)
โ†’ Embeddings (sentence-transformers/all-MiniLM-L6-v2)
โ†’ Qdrant Vector Index (in-memory)
โ†’ Discourse Graph (NetworkX DiGraph)
claims โ”€โ”€supported_byโ”€โ”€โ–ถ evidence
policies โ”€โ”€measured_byโ”€โ”€โ–ถ metrics
โ†’ HyperRAG Retrieval
vector search + graph neighbourhood expansion
โ†’ Flan-T5 Answer Generation
```
## Key Technologies
| Layer | Technology |
|-------|-----------|
| Vector Store | Qdrant (in-memory) |
| Embeddings | `all-MiniLM-L6-v2` |
| LLM | `google/flan-t5-base` |
| Graph | NetworkX DiGraph |
| Retrieval | HyperRAG (vector + graph) |
| UI | Gradio |
## Usage
1. **Upload** an ESG report PDF in the *Upload & Process* tab
2. Click **Process Document** โ€” wait ~30โ€“60 s for indexing
3. Switch to any analysis tab and explore!
## Limitations
- ESG scores are keyword-density heuristics (not certified ratings)
- `flan-t5-base` is used for CPU compatibility; swap in a larger model for production
- Greenwashing detection is pattern-based and requires expert review
## Running Locally
```bash
git clone https://huggingface.co/spaces/<your-username>/esg-intelligence
cd esg-intelligence
pip install -r requirements.txt
python app.py
```
## License
Apache 2.0 โ€” research & demonstration use only.