Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Paper
• 1910.10288 • Published
German, male single-speaker Tacotron2 TTS model trained with dynamic convolutional attention (DCA) on the Thorsten Voice dataset.
Trained by @Thorsten-Voice and originally published at: https://github.com/coqui-ai/TTS/releases/v0.6.1_models/
With Coqui TTS:
from TTS.api import TTS
tts = TTS('tts_models/de/thorsten/tacotron2-DCA')
tts.tts_to_file("Guten Tag", file_path="output.wav")