YourTTS paper
Browse files
README.md
CHANGED
|
@@ -50,7 +50,10 @@ xVAPitch_5820651 model sample: <audio controls>
|
|
| 50 |
Your browser does not support the audio element.
|
| 51 |
</audio>
|
| 52 |
|
| 53 |
-
|
|
|
|
|
|
|
|
|
|
| 54 |
- Multi-head attention with Relative Positional embedding - https://arxiv.org/pdf/1809.04281.pdf
|
| 55 |
- Transformer with Relative Potional Encoding- https://arxiv.org/abs/1803.02155
|
| 56 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|
|
|
|
| 50 |
Your browser does not support the audio element.
|
| 51 |
</audio>
|
| 52 |
|
| 53 |
+
Papers:
|
| 54 |
+
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418
|
| 55 |
+
|
| 56 |
+
Referenced papers within code:
|
| 57 |
- Multi-head attention with Relative Positional embedding - https://arxiv.org/pdf/1809.04281.pdf
|
| 58 |
- Transformer with Relative Potional Encoding- https://arxiv.org/abs/1803.02155
|
| 59 |
- SDP - https://arxiv.org/pdf/2106.06103.pdf
|