Update README.md
Browse files
README.md
CHANGED
|
@@ -183,11 +183,22 @@ The Low Frame-rate Speech Codec is trained on a total of 28.7k hrs of speech dat
|
|
| 183 |
|
| 184 |
We evaluated our codec using multiple objective audio quality metrics across two distinct test sets. Additionally, we compared our model's performance with state-of-the-art codecs. For further details, please refer to [our paper](https://arxiv.org/abs/2409.12117).
|
| 185 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 186 |
| Dataset | Squim MOS (β) |SI-SDR(β) |Mel Dist. (β) |STFT Dist.(β) | CER (β)|
|
| 187 |
|:-----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|
|
| 188 |
| MLS | 4.43 | 4.46 | 0.147 | 0.061 | 2.09 |
|
| 189 |
| DAPS | 4.68 | 6.93 | 0.142 | 0.058 | 0.86 |
|
| 190 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 191 |
|
| 192 |
|
| 193 |
## Software Integration
|
|
|
|
| 183 |
|
| 184 |
We evaluated our codec using multiple objective audio quality metrics across two distinct test sets. Additionally, we compared our model's performance with state-of-the-art codecs. For further details, please refer to [our paper](https://arxiv.org/abs/2409.12117).
|
| 185 |
|
| 186 |
+
Please note that the released checkpoint yields slightly different results compared to those reported in the paper. Due to legal data constraints, we retrained the model after removing one speaker from the training set. This retraining was performed for 170k steps, compared to the original 124k steps, leading to slight improvements across almost all metrics.
|
| 187 |
+
|
| 188 |
+
Paper results:
|
| 189 |
+
|
| 190 |
| Dataset | Squim MOS (β) |SI-SDR(β) |Mel Dist. (β) |STFT Dist.(β) | CER (β)|
|
| 191 |
|:-----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|
|
| 192 |
| MLS | 4.43 | 4.46 | 0.147 | 0.061 | 2.09 |
|
| 193 |
| DAPS | 4.68 | 6.93 | 0.142 | 0.058 | 0.86 |
|
| 194 |
|
| 195 |
+
Released checkpoint results:
|
| 196 |
+
|
| 197 |
+
| Dataset | Squim MOS (β) |SI-SDR(β) |Mel Dist. (β) |STFT Dist.(β) | CER (β)|
|
| 198 |
+
|:-----------:|:----------:|:----------:|:----------:|:-----------:|:-----------:|
|
| 199 |
+
| MLS | 4.43 | 4.77 | 0.143 | 0.060 | 2.16 |
|
| 200 |
+
| DAPS | 4.69 | 8.07 | 0.136 | 0.056 | 0.77 |
|
| 201 |
+
|
| 202 |
|
| 203 |
|
| 204 |
## Software Integration
|