# StyleTTS2 + Vocos with LibriTTS Dataset ``` StyleTTS2 ├── README.md └── Vocos └── LibriTTS └── [checkpoint files] ``` This model was trained using the train-clean-100 and train-clean-360 subsets of the LibriTTS dataset. ## Model Information - **Model Architecture**: StyleTTS2 + Vocos - **Training Data**: LibriTTS train-clean-100 + train-clean-360 - **License**: MIT The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos) ## License This model is released under the MIT License. This is one of the most permissive open-source licenses, allowing for both commercial and non-commercial use, modification, and distribution. --- # StyleTTS2 + Vocos with AIHUB Dataset ``` StyleTTS2 ├── README.md └── Vocos └── AIHUB └── [checkpoint files] ``` This model was trained using multiple datasets from AIHUB: 1. **Korean Data** (~1000 hours) - Source: [감성 및 발화스타일 동시 고려 음성합성 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71349) 2. **English & Japanese Data** (~1000 hours) - Source: [다국어 통·번역 낭독체 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71524) 3. **Chinese Data** (~500 hours) - Source: [한-영 및 한-중 음성발화 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=&topMenu=&aihubDataSe=data&dataSetSn=71261) Total samples: ~1.4M ## Model Information - **Model Architecture**: StyleTTS2 + Vocos - **Training Data**: AIHUB Multilingual Dataset - **License**: CC BY-NC 4.0 The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos) ## License This model is released under the CC BY-NC 4.0 License. This license allows for non-commercial use, modification, and distribution, as long as appropriate credit is given.