Thomcles's picture
Update README.md
2b5c020 verified
---
license: cc0-1.0
language:
- cs
base_model:
- ResembleAI/chatterbox
pipeline_tag: text-to-speech
---
# Chatterbox Czech
## **training quality TTS with low ressource data**
<div align="center"><img width="400px" src="https://www.shutterstock.com/image-vector/travel-czech-republic-culture-elements-600nw-2588019031.jpg" alt="Czech-image" /></div>
### demo audios:
"Dobrý den, vítáme vás v našem testu syntézy řeči"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_0.mp3">Your browser does not support audio.</audio>
"Tři sta třiatřicet stříbrných křepelek přeletělo přes tři stříbrné střechy"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_1.mp3">Your browser does not support audio.</audio>
"Kolik stojí devět tisíc osm set sedmdesát pět korun ?"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_2.mp3">Your browser does not support audio.</audio>
"Prosím, nastav hlasitost na sedmdesát procent a přehraj znovu"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_3.mp3">Your browser does not support audio.</audio>
"Doktor Křivohlavý napsal článek o umělé inteligenci"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_4.mp3">Your browser does not support audio.</audio>
"Zvon zvoní, z dálky zní, ozvěna se vrací do údolí"
<audio controls src="https://huggingface.co/Thomcles/Chatterbox-TTS-French/resolve/main/demo_audios/cs_5.mp3">Your browser does not support audio.</audio>
### 💻 Inference Code
First, download the file from huggingface and place it in the current directory.
The pypi version is delayed, so you must use the github version.
```
!git clone https://github.com/resemble-ai/chatterbox.git chatterbox_git
```
```
pip install chatterbox-tts
```
```python
from chatterbox_git.src.chatterbox import mtl_tts
import torchaudio as ta
from safetensors.torch import load_file as load_safetensors
device = "cpu" # or mps or cuda
multilingual_model = mtl_tts.ChatterboxMultilingualTTS.from_pretrained(device=device)
# ----
# Then download the file from huggingface and place it in the current directory.
# ----
t3_state = load_safetensors("t3_cs.safetensors", device="cpu")
multilingual_model.t3.load_state_dict(t3_state)
multilingual_model.t3.to(device).eval()
czech_text = "Dobrý den, vítáme vás v našem testu syntézy řeči"
wav_czech = multilingual_model.generate(czech_text)
ta.save("test-cs.wav", wav_czech, multilingual_model.sr)
```
## contact :
e-mail : cyprienoucortex@gmail.com
## ☕ Support
I trained this model from my own financial resources with the sole aim of offering it to the huggingface open source community.
This model has cost me a lot of money. If you find this checkpoint useful and would like to support my work, you can do it via Ko-fi:
<p align="center">
<a href="https://ko-fi.com/thomcles" target="_blank" rel="noopener noreferrer">
<img src="https://storage.ko-fi.com/cdn/kofi3.png?v=3" alt="Buy Me a Coffee at ko-fi.com" width="200" rel="noopener noreferrer"/>
</a>
</p>