susnato
/

clvp_dev

Feature Extraction

Model card Files Files and versions

clvp_dev / README.md

susnato's picture

Create README.md

8819783 over 2 years ago

|

history blame contribute delete

661 Bytes

	DISCLAIMER : I do not own any weights present in this repository. All weights belong to the author of the
	paper - "Better speech synthesis through scaling", James Betker . I am storing the weights(temporarily) for the `tortoise-tts` integration
	to Huggingface. Please refer to this [PR](https://github.com/huggingface/transformers/pull/24745) to know more.




	<h3><u>About</u></h3>

	CLVP model is an integral part of `tortoise-tts` presented in the paper - "Better speech synthesis through scaling" by James Betker.
	CLVP uses an architecture similar to the CLIP text encoder, except it uses two of them: one for text
	tokens and the other for MEL tokens.