AOLCDROM
/

Tortoise-TTS-de

Model card Files Files and versions

AOLCDROM commited on Sep 2, 2023

Commit

efb610a

·

1 Parent(s): 88888bb

Update README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -6,6 +6,35 @@ requires the tokenizer in tokenizers/
 Voice latents pre-computed in voices/
 ---
 license: other
 language:

 Voice latents pre-computed in voices/
+For use in MRQ Voice Cloning WebUI:
+Requires the tokenizer used in training, and code changes to disable text cleaners. At minimum, change english_cleaners to basic_cleaners.
+Code changes:
+modules\tortoise-tts\tortoise\utils\tokenizer.py
+Change Line 201:           txt = english_cleaners(txt) and replace it
+with txt = basic_cleaners(txt)
+modules\tortoise-tts\build\lib\tortoise\utils\tokenizer.py
+Change Line 201:           txt = english_cleaners(txt) and replace it
+with txt = basic_cleaners(txt)
+\modules\dlas\dlas\data\audio\paired_voice_audio_dataset.py
+Line 133:         return text_to_sequence(txt, ['english_cleaners'])
+and replace it with: return text_to_sequence(txt, ['basic_cleaners'])
+modules\dlas\dlas\data\audio\voice_tokenizer.py
+Line  14: from dlas.models.audio.tts.tacotron2.text.cleaners import
+english_cleaners
+to: from dlas.models.audio.tts.tacotron2.text.cleaners import
+english_cleaners, basic_cleaners
+    Line  85:             txt = english_cleaners(txt) to txt =
+basic_cleaners(txt)
+    Line 134:         word = english_cleaners(word) to basic_cleaners(word)
+Copy and paste German text into the tokenizer tester on the utilities
+tab, and you should see it tokenized with all of the special
+characters, and no [UNK].
 ---
 license: other
 language: