IDEA-Emdoor
/

DistilCodec-v1.0

Model card Files Files and versions

Ray0323 commited on May 22, 2025

Commit

d46fafa

·

verified ·

1 Parent(s): f5733ad

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -17,8 +17,7 @@ The Joint Laboratory of International Digital Economy Academy (IDEA) and Emdoor,
 The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
 similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
 while the vector quantization module implements the GRFVQ scheme. The decoder
-employs a ConvTranspose1d based architectural configuration similar to HiFiGAN. Detailed
-network specifications and layer configurations are provided in Appendix A.1 The training methodol-
 ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
 discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
 STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec:

 The foundational network architecture of DistilCodec adopts an Encoder-VQ-Decoder framework
 similar to that proposed in Soundstream. The encoder employs a ConvNeXt-V2 structure,
 while the vector quantization module implements the GRFVQ scheme. The decoder
+employs a ConvTranspose1d based architectural configuration similar to HiFiGAN. The training methodol-
 ogy of DistilCodec follows a similar approach to HiFiGAN, incorporating three types of
 discriminators: Multi-Period Discriminator (MPD), Multi-Scale Discriminator (MSD), and Multi-
 STFT Discriminator (MSFTFD). Here is the architecture of Distilcodec: