Spaces:
Runtime error
Runtime error
metadata
title: Menu OCR + Food Card Generator π½οΈ
emoji: π§Ύ
colorFrom: blue
colorTo: green
sdk: docker
sdk_version: '1.0'
app_file: main.py
pinned: false
π½οΈ AI-Powered Menu OCR & Food Card Generator
This project allows you to upload an image of a restaurant menu, extract food details using Google Gemini 1.5 Flash, and generate photorealistic food cards using Stable Diffusion β all inside a clean FastAPI + Jinja2 web app.
π§ Features
- β OCR & extraction using Google Gemini 1.5 Flash
- β Photorealistic image generation with Stable Diffusion
- β Responsive UI with Jinja2 templating
- β FastAPI backend with static image serving
- β Deployable on Hugging Face Spaces or Docker
π¦ Tech Stack
- FastAPI + Uvicorn
- Google Generative AI (
google-generativeai) - Hugging Face Diffusers (Stable Diffusion)
- PyTorch with CUDA support
- Jinja2 Templates
- PIL, Matplotlib, Multipart
π§ͺ How It Works
- Upload a restaurant menu image.
- Gemini extracts food names, descriptions, prices, and categories.
- Stable Diffusion generates high-quality images of each food item.
- Final cards are rendered in a neat grid layout via Jinja2.
π How to Run
π¦ Local Development
Install dependencies:
pip install -r requirements.txt