Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
**DISCLAIMER** : I do not own any weights present in this repository. All weights belong to the author of the
|
| 2 |
+
paper - "Better speech synthesis through scaling", James Betker . I am storing the weights(temporarily) for the `tortoise-tts` integration
|
| 3 |
+
to Huggingface. Please refer to this [PR](https://github.com/huggingface/transformers/pull/24745) to know more.
|
| 4 |
+
|
| 5 |
+
|
| 6 |
+
|
| 7 |
+
|
| 8 |
+
<h3><u>About</u></h3>
|
| 9 |
+
|
| 10 |
+
CLVP model is an integral part of `tortoise-tts` presented in the paper - "Better speech synthesis through scaling" by James Betker.
|
| 11 |
+
CLVP uses an architecture similar to the CLIP text encoder, except it uses two of them: one for text
|
| 12 |
+
tokens and the other for MEL tokens.
|