Spaces:

crackuser
/

voiceclone-dev

Running

App Files Files Community

crackuser commited on Sep 10, 2025

Commit

60dcf48

verified ·

1 Parent(s): 9a26f4f

Update README.md

Browse files

Files changed (1) hide show

README.md +61 -15

README.md CHANGED Viewed

@@ -1,20 +1,66 @@
-# Voice Cloning Studio
-Real voice-to-voice and text-to-speech cloning using XTTS-v2 and Whisper.
-## Features
-- **Voice-to-Voice Cloning**: Transform input audio using reference voice
-- **Text-to-Speech**: Generate speech in cloned voice
-- **Multi-language Support**: 8+ languages supported
-- **High Quality**: Professional 24kHz audio output
-## How to Use
-1. Upload reference voice (6+ seconds)
-2. Either upload input audio or enter text
-3. Click clone/generate button
-4. Download result
-## Technical Details
-- **TTS Model**: XTTS-v2 (Coqui AI)
-- **Speech Recognition**: Whisper (OpenAI)
 - **Languages**: English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese

+---
+title: Voice Cloning Studio
+emoji: 🎤
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: "4.44.0"
+app_file: app.py
+pinned: false
+preload_from_hub:
+- coqui/XTTS-v2
+- openai/whisper-base
+---
+# 🎭 Voice Cloning Studio
+Real voice-to-voice and text-to-speech cloning using XTTS-v2 and Whisper AI.
+## ✨ Features
+- **🎤 Voice-to-Voice Cloning**: Transform input audio using reference voice characteristics
+- **📝 Text-to-Speech**: Generate speech in any cloned voice
+- **🌍 Multi-language Support**: 8+ languages supported
+- **🎵 High Quality**: Professional 24kHz audio output
+- **⚡ Real-time Processing**: Fast voice cloning with XTTS-v2
+## 🚀 How to Use
+### Voice-to-Voice Cloning
+1. **Upload Reference Voice** - 6+ seconds of clear speech from the person to clone
+2. **Upload Input Audio** - Speech content you want to transform
+3. **Select Language** - Choose target language
+4. **Click "Clone Voice"** - AI will extract content and apply reference voice
+5. **Download Result** - New audio with same content, different voice
+### Text-to-Speech Cloning
+1. **Upload Reference Voice** - Voice sample to clone
+2. **Enter Text** - Type what you want the cloned voice to say
+3. **Generate Speech** - Create natural speech in the cloned voice
+4. **Download Result** - High-quality synthesized audio
+## 🔧 Technical Details
+- **TTS Model**: XTTS-v2 (Coqui AI) - State-of-the-art voice cloning
+- **Speech Recognition**: Whisper (OpenAI) - Accurate transcription
 - **Languages**: English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese
+- **Quality**: 24kHz professional audio generation
+- **Processing**: CPU/GPU optimized with automatic fallbacks
+## 💡 Tips for Best Results
+- **Reference Audio**: Use clear, single-speaker recordings with minimal background noise
+- **Length**: 6-10 seconds of reference audio works best
+- **Quality**: Higher quality input leads to better cloning results
+- **Language**: Match reference voice language when possible for optimal results
+## 🛠️ Built With
+- [XTTS-v2](https://huggingface.co/coqui/XTTS-v2) - Voice cloning model
+- [Whisper](https://github.com/openai/whisper) - Speech recognition
+- [Gradio](https://gradio.app/) - Web interface
+- [HuggingFace Spaces](https://huggingface.co/spaces) - Hosting platform
+---
+**Note**: This space implements real voice cloning technology. Please use responsibly and respect others' voice rights and privacy.