Update README.md
Browse files
README.md
CHANGED
|
@@ -8,8 +8,8 @@ LayaCodec: Rapid, High-Fidelity Audio Compression: Reaching the Pareto Frontier
|
|
| 8 |
|
| 9 |
This is a neural audio codec/tokenizer that encodes 16khz at a rate from 12.5 t/s to 50 t/s using a single 8192 size codebook and decodes it into 44.1khz audio.
|
| 10 |
This allows for much faster and scalable TTS models compared to othern modern codecs for several reasons.
|
| 11 |
-
1. **Much**lower token rates than other single pass codecs such as Xcodec2(50 t/s), Snac(83 t/s), Dac(774 t/s), etc.
|
| 12 |
-
2. **Much** smaller codebook size(8192) compared to Xcodec2(65536) for faster TTS model training speed
|
| 13 |
3. Over 40x faster then most diffusion based codecs allowing for **much** simpler and larger scale TTS models where codecs are not the bottleneck.
|
| 14 |
4. Decodes audio into 44.1khz which is much higher quality then the common 24khz or 16khz sampling rate.
|
| 15 |
|
|
|
|
| 8 |
|
| 9 |
This is a neural audio codec/tokenizer that encodes 16khz at a rate from 12.5 t/s to 50 t/s using a single 8192 size codebook and decodes it into 44.1khz audio.
|
| 10 |
This allows for much faster and scalable TTS models compared to othern modern codecs for several reasons.
|
| 11 |
+
1. **Much** lower token rates than other single pass codecs such as Xcodec2(50 t/s), Snac(83 t/s), Dac(774 t/s), etc.
|
| 12 |
+
2. **Much** smaller codebook size(8192) compared to Xcodec2(65536) for faster TTS model training speed.
|
| 13 |
3. Over 40x faster then most diffusion based codecs allowing for **much** simpler and larger scale TTS models where codecs are not the bottleneck.
|
| 14 |
4. Decodes audio into 44.1khz which is much higher quality then the common 24khz or 16khz sampling rate.
|
| 15 |
|