# StyleTTS2 + Vocos with LibriTTS Dataset

```
StyleTTS2
├── README.md
└── Vocos
    └── LibriTTS
        └── [checkpoint files]
```

This model was trained using the train-clean-100 and train-clean-360 subsets of the LibriTTS dataset.

## Model Information

- **Model Architecture**: StyleTTS2 + Vocos
- **Training Data**: LibriTTS train-clean-100 + train-clean-360
- **License**: MIT

The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos)

## License

This model is released under the MIT License. This is one of the most permissive open-source licenses, allowing for both commercial and non-commercial use, modification, and distribution.

---

# StyleTTS2 + Vocos with AIHUB Dataset

```
StyleTTS2
├── README.md
└── Vocos
    └── AIHUB
        └── [checkpoint files]
```

This model was trained using multiple datasets from AIHUB:

1. **Korean Data** (~1000 hours)
   - Source: [감성 및 발화스타일 동시 고려 음성합성 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71349)

2. **English & Japanese Data** (~1000 hours)
   - Source: [다국어 통·번역 낭독체 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71524)

3. **Chinese Data** (~500 hours)
   - Source: [한-영 및 한-중 음성발화 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=&topMenu=&aihubDataSe=data&dataSetSn=71261)

Total samples: ~1.4M

## Model Information

- **Model Architecture**: StyleTTS2 + Vocos
- **Training Data**: AIHUB Multilingual Dataset
- **License**: CC BY-NC 4.0

The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos)

## License

This model is released under the CC BY-NC 4.0 License. This license allows for non-commercial use, modification, and distribution, as long as appropriate credit is given.