RAG / README.md
sumitnewold's picture
Upload 10 files
76bd1fc verified
|
Raw
History Blame Contribute Delete
1.13 kB
---
title: FinAgent Infosys AR QA
emoji: πŸ“Š
colorFrom: blue
colorTo: indigo
sdk: streamlit
sdk_version: 1.58.0
app_file: app.py
pinned: false
license: mit
---
# πŸ“Š FinAgent β€” Infosys Annual Report Q&A
An agentic **Retrieval-Augmented Generation** app over the Infosys Integrated
Annual Report 2024-25. Hybrid retrieval (BM25 + dense + HyDE) β†’ cross-encoder
reranking β†’ LangGraph multi-agent orchestration β†’ FinBERT sentiment β†’ inline
Self-RAG faithfulness check, served with Streamlit.
## Setup on Hugging Face Spaces
This Space needs **one secret** to answer questions:
- **`GROQ_API_KEY`** β€” add it under **Settings β†’ Variables and secrets β†’ New secret**.
Get a free key at https://console.groq.com/keys
The UI and models load without it; only the LLM answers require it.
## Stack
Groq `llama-3.3-70b-versatile` Β· LangGraph Β· LangChain Β· ChromaDB (pre-built,
shipped in `finance_db/`) Β· HuggingFace embeddings (`all-mpnet-base-v2`) Β·
cross-encoder (`ms-marco-MiniLM-L-6-v2`) Β· FinBERT.
> First boot downloads ~1 GB of models and takes ~1–2 min. CPU basic (free,
> 16 GB RAM) is sufficient.