Digi-Biz / docs /README.md
Deployment Bot
Automated deployment to Hugging Face
255cbd1

Digi-Biz πŸ“„

Agentic Business Digitization Framework

Transform unstructured business documents into structured digital profiles using AI agents.

Tests Python 3.10+ License: MIT


πŸš€ Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Configure Groq API

Get your free API key at https://console.groq.com

Create .env file:

GROQ_API_KEY=gsk_your_key_here
GROQ_VISION_MODEL=meta-llama/llama-4-scout-17b-16e-instruct

3. Run the App

streamlit run app.py

Open http://localhost:8501


✨ Features

βœ… Multi-Agent Pipeline - 6 specialized agents
βœ… Groq Vision - Image analysis with Llama-4-Scout (17B)
βœ… Vectorless RAG - Fast document retrieval
βœ… Production-Ready - Error handling, validation, logging
βœ… Interactive UI - Streamlit web interface


πŸ“Š What It Does

  1. Upload ZIP with business documents
  2. AI Agents Process:
    • File Discovery β†’ Classify files
    • Document Parsing β†’ Extract text/tables
    • Table Extraction β†’ Detect & classify
    • Media Extraction β†’ Extract images
    • Vision Analysis β†’ Describe images (Groq)
    • Indexing β†’ Build search index (RAG)
  3. View Results in interactive UI

🎯 Example Use Cases

Restaurant Digitization

  • Upload: Menu PDFs, food photos, price lists
  • Output: Digital menu with prices, food descriptions, categories

Travel Agency

  • Upload: Tour brochures, itinerary PDFs, destination photos
  • Output: Tour packages with itineraries, pricing, descriptions

Retail Store

  • Upload: Product catalogs, inventory spreadsheets, product photos
  • Output: Product inventory with descriptions, prices, categories

πŸ“ Project Structure

digi-biz/
β”œβ”€β”€ backend/agents/        # 6 AI agents
β”œβ”€β”€ backend/models/        # Data schemas
β”œβ”€β”€ backend/utils/         # Utilities
β”œβ”€β”€ tests/agents/          # Test suites
β”œβ”€β”€ app.py                 # Streamlit app
β”œβ”€β”€ requirements.txt       # Dependencies
└── docs/                  # Documentation

πŸ§ͺ Testing

All agents are thoroughly tested:

# Run all tests
pytest tests/ -v

# Test coverage
pytest tests/ --cov=backend

Test Results: 66/66 tests passing βœ…


πŸ“– Documentation


πŸ”§ Configuration

Environment Variables (.env)

# Groq API (required)
GROQ_API_KEY=gsk_xxxxx
GROQ_MODEL=gpt-oss-120b
GROQ_VISION_MODEL=meta-llama/llama-4-scout-17b-16e-instruct

# Optional: Ollama fallback
OLLAMA_HOST=http://localhost:11434

# Processing limits
MAX_FILE_SIZE=524288000    # 500MB
MAX_FILES_PER_ZIP=100

πŸŽ“ Agents

# Agent Purpose Status
1 File Discovery Extract & classify ZIP files βœ…
2 Document Parsing Parse PDF/DOCX βœ…
3 Table Extraction Detect & classify tables βœ…
4 Media Extraction Extract images/videos βœ…
5 Vision Agent Analyze images (Groq) βœ…
6 Indexing Agent Build RAG index βœ…

πŸ“Š Performance

Task Time
File Discovery (10 files) ~1-2s
Document Parsing (10 pages) ~0.5s
Table Extraction (5 tables) ~0.5s
Vision Analysis (1 image) ~2s
Total (typical folder) <2 min

πŸ› οΈ Tech Stack

  • Backend: Python 3.10+, Pydantic, asyncio
  • Document Parsing: pdfplumber, python-docx, openpyxl
  • Vision AI: Groq API (Llama-4-Scout-17B)
  • Frontend: Streamlit
  • Testing: pytest

πŸ“ License

MIT License - See LICENSE for details


🀝 Contributing

  1. Fork the repo
  2. Create feature branch
  3. Add tests
  4. Submit PR

πŸ“ž Support


Made with ❀️ using AI Agents