Update README.md
Browse files
README.md
CHANGED
|
@@ -22,13 +22,13 @@ network specifications and layer configurations are provided in Appendix A.1 The
|
|
| 22 |
ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
|
| 23 |
discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
|
| 24 |
STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:
|
| 25 |
-
** |
|
| 28 |
|-----------------------------|--------------------------|
|
| 29 |
| Chinese Audiobook | 38000 |
|
| 30 |
| Chinese Common Audio | 20000 |
|
| 31 |
-
| English
|
| 32 |
| Music | 2000 |
|
| 33 |
| **Total** | **100000** |
|
| 34 |
|
|
|
|
| 22 |
ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
|
| 23 |
discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
|
| 24 |
STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:
|
| 25 |
+

|
| 26 |
Distribution of DistilCodec training data is shown in below table:
|
| 27 |
| **Data Category** | **Data Size (in hours)** |
|
| 28 |
|-----------------------------|--------------------------|
|
| 29 |
| Chinese Audiobook | 38000 |
|
| 30 |
| Chinese Common Audio | 20000 |
|
| 31 |
+
| English Speech | 40000 |
|
| 32 |
| Music | 2000 |
|
| 33 |
| **Total** | **100000** |
|
| 34 |
|