Spaces:
Sleeping
Sleeping
metadata
title: Multimodal RAG Service
emoji: π
colorFrom: blue
colorTo: purple
sdk: docker
app_port: 7860
pinned: false
π Multimodal RAG Service
A unified service for PDF ingestion and multimodal querying using RAG (Retrieval-Augmented Generation).
Features
- π PDF text extraction
- π Table extraction
- πΌοΈ Image extraction
- π€ Multimodal summarization
- π Vector similarity search
- π¬ Context-aware answering
API Endpoints
POST /ingest- Upload and process PDF filesPOST /query?question=YOUR_QUESTION&k=5- Query the systemGET /stats- View system statisticsGET /docs- Interactive API documentation
Usage
Visit the deployed space and use the /docs endpoint for interactive API documentation.