jonloporto's picture
Update README.md
606a244 verified

A newer version of the Gradio SDK is available: 6.5.1

Upgrade
metadata
title: Image to Voice
emoji: 🎤
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 6.2.0
app_file: app.py
pinned: false

Image to Voice Converter

Convert images to text descriptions and then to speech audio!

How it works

  1. Upload an image
  2. The AI analyzes the image and generates a text description
  3. The text is converted to speech using a text-to-speech model
  4. Download the audio file

Technologies Used

  • Hugging Face Transformers: For image-to-text conversion
  • Supertonic TTS: For text-to-speech synthesis
  • Gradio: For the web interface