File size: 588 Bytes
606a244
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
title: Image to Voice
emoji: 🎤
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 6.2.0
app_file: app.py
pinned: false
---

# Image to Voice Converter

Convert images to text descriptions and then to speech audio!

## How it works

1. Upload an image
2. The AI analyzes the image and generates a text description
3. The text is converted to speech using a text-to-speech model
4. Download the audio file

## Technologies Used

- **Hugging Face Transformers**: For image-to-text conversion
- **Supertonic TTS**: For text-to-speech synthesis
- **Gradio**: For the web interface