Spaces:
Sleeping
Sleeping
| title: Image to Voice | |
| emoji: 🎤 | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 6.2.0 | |
| app_file: app.py | |
| pinned: false | |
| # Image to Voice Converter | |
| Convert images to text descriptions and then to speech audio! | |
| ## How it works | |
| 1. Upload an image | |
| 2. The AI analyzes the image and generates a text description | |
| 3. The text is converted to speech using a text-to-speech model | |
| 4. Download the audio file | |
| ## Technologies Used | |
| - **Hugging Face Transformers**: For image-to-text conversion | |
| - **Supertonic TTS**: For text-to-speech synthesis | |
| - **Gradio**: For the web interface |