Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available:
6.2.0
metadata
title: TTS Galary
emoji: 📣
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.44.1
app_file: app.py
pinned: true
TTS Galary
This demo showcases the multilingual capabilities of multiple TTS models, supporting both English and Chinese languages.
Features
- Text-to-speech generation for English and Chinese
- Gradio web interface for easy interaction
- Real-time audio generation and playback
- Example texts for quick testing
- Support for multiple TTS architectures including seq2seq models
Requirements
- Python 3.8 or higher
- Required Python packages (automatically installed by Hugging Face):
- chatterbox-tts
- gradio
- torchaudio
- torch
Usage
- Enter text in the input box
- Select the language (English or Chinese)
- Click "Generate Speech"
- Listen to the generated audio
Supported Languages
- English
- Chinese
Supported Models
- Chatterbox: Industrial-grade multilingual TTS solution
- KittenTTS: High-quality TTS with voice cloning capabilities
- Piper: Local on-device TTS with multiple voice options
- Faster Whisper: High-performance speech recognition model for audio transcription
- Kokoro: Lightweight TTS model with 82M parameters, Apache-licensed for production and personal use
Examples
The interface includes example texts for both languages to help you get started quickly.
Notes
- The first generation may take a moment as the model loads
- Subsequent generations will be faster
- For best results, use clear and properly punctuated text