Digi-Biz - Clean Project Structure
β CLEANED UP!
All documentation moved to docs/, unused files removed.
π Final Structure
digi-biz/
βββ π Core Files
β βββ app.py # Streamlit app (MAIN)
β βββ api.py # FastAPI backend (alternative)
β βββ requirements.txt # Python dependencies
β βββ .env # Environment variables
β βββ .env.example # Example env file
β
βββ π€ Backend (Python)
β βββ backend/
β β βββ api/main.py # FastAPI server
β β βββ agents/ # 8 AI agents
β β β βββ file_discovery.py
β β β βββ document_parsing.py
β β β βββ table_extraction.py
β β β βββ media_extraction.py
β β β βββ vision_agent.py
β β β βββ indexing.py
β β β βββ schema_mapping_v2.py # NEW - Generic extraction
β β β βββ validation_agent.py
β β βββ models/
β β β βββ schemas.py # Data models
β β β βββ enums.py
β β βββ parsers/ # Document parsers
β β βββ utils/ # Utilities
β
βββ π Frontend (Next.js - Optional)
β βββ frontend/
β β βββ src/app/ # Next.js pages
β β βββ src/lib/api.ts # API client
β β βββ package.json
β
βββ π Documentation
β βββ docs/ # ALL .md files moved here
β β βββ README.md # Project overview
β β βββ HACKATHON_QUICKSTART.md
β β βββ CURRENT_STATUS.md
β β βββ [20+ more docs]
β βββ README.md # Main README (root)
β
βββ πΎ Storage
β βββ storage/
β βββ profiles/ # Generated profiles (JSON)
β βββ extracted/ # Extracted media
β
βββ π§ͺ Tests
βββ tests/
βββ agents/ # Agent tests
π― What's Kept
Essential Files:
- β
app.py- Streamlit app (primary interface) - β
backend/- All Python backend code - β
requirements.txt- Dependencies - β
.env- Configuration
Documentation:
- β
All
.mdfiles βdocs/folder - β
README.md- Clean, hackathon-ready
Optional:
- β οΈ
frontend/- Next.js (can be removed if not using) - β οΈ
tests/- Unit tests (keep for development)
ποΈ What Was Removed
- β
test_*.pyfiles (root level) - β
debug_*.pyfiles - β
resume.py - β Old agent versions (
schema_mapping.py,schema_mapping_simple.py) - β Duplicate/unused files
π Quick Start (Clean)
# 1. Install
pip install -r requirements.txt
# 2. Configure
cp .env.example .env
# Edit .env with your Groq API key
# 3. Run
streamlit run app.py
π File Count
| Category | Count |
|---|---|
| Core Files | 5 |
| Backend Agents | 8 |
| Backend Utils | 6 |
| Documentation | 26 (in docs/) |
| Tests | 5 |
| Total Python Files | ~30 |
Clean, organized, and hackathon-ready! π