Update README.md
Browse files
README.md
CHANGED
|
@@ -17,8 +17,7 @@ The Joint Laboratory of International Digital Economy Academy (IDEA) and Emdoor,
|
|
| 17 |
The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
|
| 18 |
similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
|
| 19 |
while the vector quantization module implements the GRFVQ scheme. The decoder
|
| 20 |
-
employs a ConvTranspose1d based architectural configuration similar to HiFiGAN.
|
| 21 |
-
network specifications and layer configurations are provided in Appendix A.1 The training methodol-
|
| 22 |
ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
|
| 23 |
discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
|
| 24 |
STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:
|
|
|
|
| 17 |
The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
|
| 18 |
similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
|
| 19 |
while the vector quantization module implements the GRFVQ scheme. The decoder
|
| 20 |
+
employs a ConvTranspose1d based architectural configuration similar to HiFiGAN. The training methodol-
|
|
|
|
| 21 |
ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
|
| 22 |
discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
|
| 23 |
STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:
|