neuphonic
/

neucodec

speech-language-models

Model card Files Files and versions

jiamengjiameng commited on Aug 8, 2025

Commit

46c990c

·

verified ·

1 Parent(s): 1a22b93

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -19,12 +19,12 @@ datasets:
 NeuCodec is a Finite Scalar Quantisation (FSQ) based 0.8kbps audio codec for speech tokenization.
 It takes advantage of the following features:
-* It uses both audio ([BigCodec](https://arxiv.org/pdf/2409.05377)) and semantic ([Wav2Vec2-BERT](https://huggingface.co/facebook/w2v-bert-2.0)) encoders.
 * We make use of Finite Scalar Quantisation (FSQ) resulting in a single vector for the quantised output, which makes it ideal for downstream modeling with Speech Language Models.
 * At 50 tokens/sec and 16 bits per token, the overall bit-rate is 0.8kbps.
 * The codec takes in 16kHz input and outputs 24kHz using an upsampling decoder.
-Our work is largely based on extending the work of [X-Codec2.0](https://huggingface.co/HKUSTAudio/xcodec2).
 - **Developed by:** Neuphonic
 - **Model type:** Neural Audio Codec

 NeuCodec is a Finite Scalar Quantisation (FSQ) based 0.8kbps audio codec for speech tokenization.
 It takes advantage of the following features:
 * We make use of Finite Scalar Quantisation (FSQ) resulting in a single vector for the quantised output, which makes it ideal for downstream modeling with Speech Language Models.
 * At 50 tokens/sec and 16 bits per token, the overall bit-rate is 0.8kbps.
 * The codec takes in 16kHz input and outputs 24kHz using an upsampling decoder.
+* The FSQ encoding scheme allows for bit-level error resistance suitable for unreliable and noisy channels.
+NeuCodec is largely based on extending the work of [X-Codec2.0](https://huggingface.co/HKUSTAudio/xcodec2).
 - **Developed by:** Neuphonic
 - **Model type:** Neural Audio Codec