MiraTTS / README.md
YatharthS's picture
Update README.md
ff750bd verified
---
tags:
- tts
- audio
language:
- en
- zh
pipeline_tag: text-to-speech
license: cc-by-nc-sa-4.0
---
## MiraTTS
This is the model for the [MiraTTS](https://github.com/ysharma3501/MiraTTS) repository.
MiraTTS is a high quality TTS model that can generate clear and realistic speech at speeds as fast as 100x realtime.
## Key benefits
- Incredibly fast: Over 100x realtime by using Lmdeploy and batching.
- High quality: Generates clear and crisp 48khz audio outputs which is much higher quality then most models.
- Memory efficient: Works within 6gb vram.
- Low latency: Latency can be low as 100ms.
- Voice cloning: Can voice clone any voice with good quality.
Random samples, non cherry picked:
<audio controls>
<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example2.wav" type="audio/wav">
Your browser does not support the audio element.
</audio>
<audio controls>
<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example3.wav" type="audio/wav">
Your browser does not support the audio element.
</audio>
<audio controls>
<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example1.wav" type="audio/wav">
Your browser does not support the audio element.
</audio>
Thanks to Gapeleon for creating a great space for this model, you can try it here: https://huggingface.co/spaces/Gapeleon/Mira-TTS
If you find this model/code helpful, please give a like or star. Thank you.
Please check out the [github repo](https://github.com/ysharma3501/MiraTTS) for usage and finetuning notebooks.