File size: 2,047 Bytes
ddc047c
 
 
 
 
 
 
 
 
cc17649
 
 
 
 
371e46d
cc17649
 
 
 
 
ddc047c
 
 
 
 
 
 
 
cc17649
 
 
 
ddc047c
cc17649
 
 
ddc047c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
cc17649
 
ddc047c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
# StyleTTS2 + Vocos with LibriTTS Dataset

```

StyleTTS2

β”œβ”€β”€ README.md

└── Vocos

    └── LibriTTS

        └── [checkpoint files]

```

This model was trained using the train-clean-100 and train-clean-360 subsets of the LibriTTS dataset.

## Model Information

- **Model Architecture**: StyleTTS2 + Vocos
- **Training Data**: LibriTTS train-clean-100 + train-clean-360
- **License**: MIT

The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos)

## License

This model is released under the MIT License. This is one of the most permissive open-source licenses, allowing for both commercial and non-commercial use, modification, and distribution.

---

# StyleTTS2 + Vocos with AIHUB Dataset

```

StyleTTS2

β”œβ”€β”€ README.md

└── Vocos

    └── AIHUB

        └── [checkpoint files]

```

This model was trained using multiple datasets from AIHUB:

1. **Korean Data** (~1000 hours)
   - Source: [감성 및 λ°œν™”μŠ€νƒ€μΌ λ™μ‹œ κ³ λ € μŒμ„±ν•©μ„± 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71349)

2. **English & Japanese Data** (~1000 hours)
   - Source: [λ‹€κ΅­μ–΄ ν†΅Β·λ²ˆμ—­ 낭독체 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=71524)

3. **Chinese Data** (~500 hours)
   - Source: [ν•œ-영 및 ν•œ-쀑 μŒμ„±λ°œν™” 데이터](https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=&topMenu=&aihubDataSe=data&dataSetSn=71261)

Total samples: ~1.4M

## Model Information

- **Model Architecture**: StyleTTS2 + Vocos
- **Training Data**: AIHUB Multilingual Dataset
- **License**: CC BY-NC 4.0

The training and inference code can be found at: [StyleTTS2-Vocos](https://github.com/5Hyeons/StyleTTS2-Vocos)

## License

This model is released under the CC BY-NC 4.0 License. This license allows for non-commercial use, modification, and distribution, as long as appropriate credit is given.