Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -8,5 +8,38 @@ sdk_version: 5.18.0
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
-
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
---
|
| 11 |
+
Here's a refined version of your text with improved clarity and flow:
|
| 12 |
+
|
| 13 |
+
---
|
| 14 |
+
Here's a refined version of your text with improved clarity and flow:
|
| 15 |
+
|
| 16 |
+
---
|
| 17 |
+
Here's a refined version of your text with improved clarity and flow:
|
| 18 |
+
|
| 19 |
+
---
|
| 20 |
+
|
| 21 |
+
### **Storytelling Text-to-Speech (TTS) Application**
|
| 22 |
+
|
| 23 |
+
#### **Project Objectives**
|
| 24 |
+
This project aims to develop an interactive Text-to-Speech (TTS) application that enables users to input stories and have them narrated using diverse voices in multiple languages.
|
| 25 |
+
|
| 26 |
+
#### **Implemented Pipelines**
|
| 27 |
+
The application leverages the Edge TTS service for high-quality voice synthesis, offering a range of neural voices across different languages. Key functionalities include:
|
| 28 |
+
|
| 29 |
+
- **Text Input:** Users can enter the story they wish to convert into speech.
|
| 30 |
+
- **Language Selection:** A dropdown menu allows users to choose from various languages, including English and Arabic.
|
| 31 |
+
- **Speaker Selection:** Based on the selected language, users can pick from a list of available speakers.
|
| 32 |
+
- **Audio Generation:** Clicking the “Generate Magical Audio” button processes the text and produces an audio file.
|
| 33 |
+
- **Audio Playback:** The generated audio file is displayed, allowing users to listen to the narration.
|
| 34 |
+
|
| 35 |
+
#### **How to Use the Interface**
|
| 36 |
+
1. **Enter Your Story:** Type your text into the provided input field.
|
| 37 |
+
2. **Select Language:** Choose your preferred language from the dropdown menu.
|
| 38 |
+
3. **Pick a Speaker:** Select a speaker corresponding to the chosen language.
|
| 39 |
+
4. **Generate Audio:** Click the "Generate Magical Audio" button to create the narration.
|
| 40 |
+
5. **Listen to the Output:** Once generated, the audio file will be available for playback.
|
| 41 |
+
|
| 42 |
+
#### **Justification for Model and Pipeline Choices**
|
| 43 |
+
- **Edge TTS Service:** Chosen for its advanced neural voice synthesis, Edge TTS delivers natural-sounding speech and supports multiple languages, enhancing the storytelling experience.
|
| 44 |
+
- **User-Friendly Interface:** The application is built using Gradio, ensuring an intuitive and interactive user experience for seamless text-to-speech conversion.
|
| 45 |
|
|
|