crackuser commited on
Commit
60dcf48
Β·
verified Β·
1 Parent(s): 9a26f4f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +61 -15
README.md CHANGED
@@ -1,20 +1,66 @@
1
- # Voice Cloning Studio
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
- Real voice-to-voice and text-to-speech cloning using XTTS-v2 and Whisper.
4
 
5
- ## Features
6
- - **Voice-to-Voice Cloning**: Transform input audio using reference voice
7
- - **Text-to-Speech**: Generate speech in cloned voice
8
- - **Multi-language Support**: 8+ languages supported
9
- - **High Quality**: Professional 24kHz audio output
10
 
11
- ## How to Use
12
- 1. Upload reference voice (6+ seconds)
13
- 2. Either upload input audio or enter text
14
- 3. Click clone/generate button
15
- 4. Download result
16
 
17
- ## Technical Details
18
- - **TTS Model**: XTTS-v2 (Coqui AI)
19
- - **Speech Recognition**: Whisper (OpenAI)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  - **Languages**: English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: Voice Cloning Studio
3
+ emoji: 🎀
4
+ colorFrom: blue
5
+ colorTo: purple
6
+ sdk: gradio
7
+ sdk_version: "4.44.0"
8
+ app_file: app.py
9
+ pinned: false
10
+ preload_from_hub:
11
+ - coqui/XTTS-v2
12
+ - openai/whisper-base
13
+ ---
14
 
15
+ # 🎭 Voice Cloning Studio
16
 
17
+ Real voice-to-voice and text-to-speech cloning using XTTS-v2 and Whisper AI.
 
 
 
 
18
 
19
+ ## ✨ Features
 
 
 
 
20
 
21
+ - **🎀 Voice-to-Voice Cloning**: Transform input audio using reference voice characteristics
22
+ - **πŸ“ Text-to-Speech**: Generate speech in any cloned voice
23
+ - **🌍 Multi-language Support**: 8+ languages supported
24
+ - **🎡 High Quality**: Professional 24kHz audio output
25
+ - **⚑ Real-time Processing**: Fast voice cloning with XTTS-v2
26
+
27
+ ## πŸš€ How to Use
28
+
29
+ ### Voice-to-Voice Cloning
30
+ 1. **Upload Reference Voice** - 6+ seconds of clear speech from the person to clone
31
+ 2. **Upload Input Audio** - Speech content you want to transform
32
+ 3. **Select Language** - Choose target language
33
+ 4. **Click "Clone Voice"** - AI will extract content and apply reference voice
34
+ 5. **Download Result** - New audio with same content, different voice
35
+
36
+ ### Text-to-Speech Cloning
37
+ 1. **Upload Reference Voice** - Voice sample to clone
38
+ 2. **Enter Text** - Type what you want the cloned voice to say
39
+ 3. **Generate Speech** - Create natural speech in the cloned voice
40
+ 4. **Download Result** - High-quality synthesized audio
41
+
42
+ ## πŸ”§ Technical Details
43
+
44
+ - **TTS Model**: XTTS-v2 (Coqui AI) - State-of-the-art voice cloning
45
+ - **Speech Recognition**: Whisper (OpenAI) - Accurate transcription
46
  - **Languages**: English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese
47
+ - **Quality**: 24kHz professional audio generation
48
+ - **Processing**: CPU/GPU optimized with automatic fallbacks
49
+
50
+ ## πŸ’‘ Tips for Best Results
51
+
52
+ - **Reference Audio**: Use clear, single-speaker recordings with minimal background noise
53
+ - **Length**: 6-10 seconds of reference audio works best
54
+ - **Quality**: Higher quality input leads to better cloning results
55
+ - **Language**: Match reference voice language when possible for optimal results
56
+
57
+ ## πŸ› οΈ Built With
58
+
59
+ - [XTTS-v2](https://huggingface.co/coqui/XTTS-v2) - Voice cloning model
60
+ - [Whisper](https://github.com/openai/whisper) - Speech recognition
61
+ - [Gradio](https://gradio.app/) - Web interface
62
+ - [HuggingFace Spaces](https://huggingface.co/spaces) - Hosting platform
63
+
64
+ ---
65
+
66
+ **Note**: This space implements real voice cloning technology. Please use responsibly and respect others' voice rights and privacy.