YatharthS
/

MiraTTS

Model card Files Files and versions

MiraTTS / README.md

YatharthS's picture

Update README.md

ff750bd verified 28 days ago

|

history blame contribute delete

1.56 kB

	---
	tags:
	- tts
	- audio
	language:
	- en
	- zh
	pipeline_tag: text-to-speech
	license: cc-by-nc-sa-4.0
	---

	## MiraTTS

	This is the model for the [MiraTTS](https://github.com/ysharma3501/MiraTTS) repository.
	MiraTTS is a high quality TTS model that can generate clear and realistic speech at speeds as fast as 100x realtime.

	## Key benefits
	- Incredibly fast: Over 100x realtime by using Lmdeploy and batching.
	- High quality: Generates clear and crisp 48khz audio outputs which is much higher quality then most models.
	- Memory efficient: Works within 6gb vram.
	- Low latency: Latency can be low as 100ms.
	- Voice cloning: Can voice clone any voice with good quality.

	Random samples, non cherry picked:
	<audio controls>
	<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example2.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	<audio controls>
	<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example3.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	<audio controls>
	<source src="https://huggingface.co/YatharthS/MiraTTS/resolve/main/example1.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>

	Thanks to Gapeleon for creating a great space for this model, you can try it here: https://huggingface.co/spaces/Gapeleon/Mira-TTS

	If you find this model/code helpful, please give a like or star. Thank you.

	Please check out the [github repo](https://github.com/ysharma3501/MiraTTS) for usage and finetuning notebooks.