YatharthS commited on
Commit
c01369b
·
verified ·
1 Parent(s): 995410d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -2
README.md CHANGED
@@ -1,6 +1,18 @@
1
  ---
2
  license: cc-by-4.0
3
  ---
4
- LayaCodec: A High-Fidelity Open-Source Codec for Foundational Generative AI
5
 
6
- Coming soon...
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-4.0
3
  ---
4
+ # LayaCodec
5
 
6
+ LayaCodec: Rapid, High-Fidelity Audio Compression: Reaching the Pareto Frontier in Neural Audio Codecs
7
+
8
+
9
+ This is a neural audio codec/tokenizer that encodes 16khz at a rate from 12.5 t/s to 50 t/s using a single 8192 size codebook and decodes it into 44.1khz audio.
10
+ This allows for much faster and scalable TTS models compared to othern modern codecs for several reasons.
11
+ 1. **Much**lower token rates than other single pass codecs such as Xcodec2(50 t/s), Snac(83 t/s), Dac(774 t/s), etc.
12
+ 2. **Much** smaller codebook size(8192) compared to Xcodec2(65536) for faster TTS model training speed
13
+ 3. Over 40x faster then most diffusion based codecs allowing for **much** simpler and larger scale TTS models where codecs are not the bottleneck.
14
+ 4. Decodes audio into 44.1khz which is much higher quality then the common 24khz or 16khz sampling rate.
15
+
16
+ This is still W.I.P, it has only seen a few hundred hours of training data but surprisingly good quality. It will still need some more training.
17
+
18
+ Thanks very much to the authors of FocalCodec and Anime-XCodec2.