Spaces:

geeaiml
/

Audio_stories

Running

App Files Files Community

geeaiml commited on Feb 26, 2025

Commit

1a956dd

verified ·

1 Parent(s): 8c6b704

Update README.md

Browse files

Files changed (1) hide show

README.md +34 -1

README.md CHANGED Viewed

@@ -8,5 +8,38 @@ sdk_version: 5.18.0
 app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 app_file: app.py
 pinned: false
 ---
+Here's a refined version of your text with improved clarity and flow:
+---
+Here's a refined version of your text with improved clarity and flow:
+---
+Here's a refined version of your text with improved clarity and flow:
+---
+### **Storytelling Text-to-Speech (TTS) Application**
+#### **Project Objectives**
+This project aims to develop an interactive Text-to-Speech (TTS) application that enables users to input stories and have them narrated using diverse voices in multiple languages.
+#### **Implemented Pipelines**
+The application leverages the Edge TTS service for high-quality voice synthesis, offering a range of neural voices across different languages. Key functionalities include:
+- **Text Input:** Users can enter the story they wish to convert into speech.
+- **Language Selection:** A dropdown menu allows users to choose from various languages, including English and Arabic.
+- **Speaker Selection:** Based on the selected language, users can pick from a list of available speakers.
+- **Audio Generation:** Clicking the “Generate Magical Audio” button processes the text and produces an audio file.
+- **Audio Playback:** The generated audio file is displayed, allowing users to listen to the narration.
+#### **How to Use the Interface**
+1. **Enter Your Story:** Type your text into the provided input field.
+2. **Select Language:** Choose your preferred language from the dropdown menu.
+3. **Pick a Speaker:** Select a speaker corresponding to the chosen language.
+4. **Generate Audio:** Click the "Generate Magical Audio" button to create the narration.
+5. **Listen to the Output:** Once generated, the audio file will be available for playback.
+#### **Justification for Model and Pipeline Choices**
+- **Edge TTS Service:** Chosen for its advanced neural voice synthesis, Edge TTS delivers natural-sounding speech and supports multiple languages, enhancing the storytelling experience.
+- **User-Friendly Interface:** The application is built using Gradio, ensuring an intuitive and interactive user experience for seamless text-to-speech conversion.