Multi_Modal_RAG / README.md
Sameer-Handsome173's picture
Update README.md
055f5c1 verified
metadata
title: Multimodal RAG Service
emoji: πŸš€
colorFrom: blue
colorTo: purple
sdk: docker
app_port: 7860
pinned: false

πŸš€ Multimodal RAG Service

A unified service for PDF ingestion and multimodal querying using RAG (Retrieval-Augmented Generation).

Features

  • πŸ“„ PDF text extraction
  • πŸ“Š Table extraction
  • πŸ–ΌοΈ Image extraction
  • πŸ€– Multimodal summarization
  • πŸ” Vector similarity search
  • πŸ’¬ Context-aware answering

API Endpoints

  • POST /ingest - Upload and process PDF files
  • POST /query?question=YOUR_QUESTION&k=5 - Query the system
  • GET /stats - View system statistics
  • GET /docs - Interactive API documentation

Usage

Visit the deployed space and use the /docs endpoint for interactive API documentation.