| datasets: | |
| - mesolitica/TTS | |
| language: | |
| - ms | |
| # StyleTTS2 MS | |
| Forked at https://github.com/mesolitica/StyleTTS2-MS, only trained on first stage. | |
| ## Pre-trained modules | |
| 1. Forked original [yl4579/AuxiliaryASR](https://github.com/yl4579/AuxiliaryASR) at [mesolitica/AuxiliaryASR-Phonemizer](https://github.com/mesolitica/AuxiliaryASR-Phonemizer) to use `ms` phonemizer and trained on [mesolitica/tts-combine-annotated](https://huggingface.co/datasets/mesolitica/tts-combine-annotated) dataset. | |
| 2. Forked original [PL-BERT](https://arxiv.org/abs/2301.08810) at [malaysia-ai/PL-BERT-MS](https://github.com/malaysia-ai/PL-BERT-MS) to use custom word tokenizer and pretrained on Malay Wikipedia and local news. | |
| ## Checkpoints | |
| We uploaded full checkpoints with optimizer states at [checkpoints-first-stage](checkpoints-first-stage). | |
| ## Dataset | |
| We train on [Mesolitica/TTS](https://huggingface.co/datasets/mesolitica/TTS). |