Ray0323 commited on
Commit
d46fafa
·
verified ·
1 Parent(s): f5733ad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -17,8 +17,7 @@ The Joint Laboratory of International Digital Economy Academy (IDEA) and Emdoor,
17
  The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
18
  similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
19
  while the vector quantization module implements the GRFVQ scheme. The decoder
20
- employs a ConvTranspose1d based architectural configuration similar to HiFiGAN. Detailed
21
- network specifications and layer configurations are provided in Appendix A.1 The training methodol-
22
  ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
23
  discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
24
  STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:
 
17
  The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
18
  similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
19
  while the vector quantization module implements the GRFVQ scheme. The decoder
20
+ employs a ConvTranspose1d based architectural configuration similar to HiFiGAN. The training methodol-
 
21
  ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
22
  discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
23
  STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec: