Update README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,7 @@ base_model:
|
|
| 38 |
* [Acknowledgments](#acknowledgments)
|
| 39 |
* [Citation](#citation)
|
| 40 |
|
| 41 |
-
|
| 42 |
|
| 43 |
## Highlights
|
| 44 |
|
|
@@ -47,7 +47,7 @@ base_model:
|
|
| 47 |
* **Same model, two samplers:** Use the displacement sampler for 1-step (or few-step) inference; fall back to Euler along the instantaneous field if you prefer multi-step.
|
| 48 |
* **Competitive & fast:** strong ESTOI / SI-SDR / DNSMOS with **very low RTF** on VoiceBank-DEMAND.
|
| 49 |
|
| 50 |
-
|
| 51 |
|
| 52 |
## What’s inside
|
| 53 |
|
|
@@ -56,7 +56,7 @@ base_model:
|
|
| 56 |
* **Audio front-end**: complex STFT pipeline; configurable transforms & normalization.
|
| 57 |
* **Metrics**: PESQ, ESTOI, SI-SDR; end-to-end **RTF** measurement.
|
| 58 |
|
| 59 |
-
|
| 60 |
|
| 61 |
## Quick start
|
| 62 |
|
|
@@ -149,7 +149,7 @@ python evaluate.py \
|
|
| 149 |
> `evaluate.py` writes **enhanced WAVs**.
|
| 150 |
> If `--odesolver` is not given, it **auto-picks** (`euler_mf` when MF-SE was used; otherwise `euler`).
|
| 151 |
|
| 152 |
-
|
| 153 |
|
| 154 |
## Configuration
|
| 155 |
|
|
@@ -169,7 +169,7 @@ Common flags you may want to tweak:
|
|
| 169 |
|
| 170 |
* Defined in `backbones/` and `SpecsDataModule` (STFT, transforms, normalization)
|
| 171 |
|
| 172 |
-
|
| 173 |
|
| 174 |
## Repository structure
|
| 175 |
|
|
@@ -211,19 +211,19 @@ Many design choices (complex STFT pipeline, training infrastructure) are inspire
|
|
| 211 |
|
| 212 |
* **VoiceBank–DEMAND (16 kHz)**: We have hosted the weight files on Google Drive and added the link here.— [Google Drive Link](https://drive.google.com/file/d/1QAxgd5BWrxiNi0q2qD3n1Xcv6bW0X86-/view?usp=sharing)
|
| 213 |
|
| 214 |
-
|
| 215 |
|
| 216 |
## Acknowledgments
|
| 217 |
|
| 218 |
We gratefully acknowledge **Prof. Xie Chen’s group (X-LANCE Lab, SJTU)** for their **valuable guidance and support** on training practices and engineering tips that helped this work a lot.
|
| 219 |
|
| 220 |
-
|
| 221 |
|
| 222 |
## Citation
|
| 223 |
|
| 224 |
* **Citation:** The paper is currently under review. We will add a BibTeX entry and article link once available.
|
| 225 |
|
| 226 |
-
|
| 227 |
|
| 228 |
|
| 229 |
**Questions or issues?** Please open a GitHub issue or pull request.
|
|
|
|
| 38 |
* [Acknowledgments](#acknowledgments)
|
| 39 |
* [Citation](#citation)
|
| 40 |
|
| 41 |
+
|
| 42 |
|
| 43 |
## Highlights
|
| 44 |
|
|
|
|
| 47 |
* **Same model, two samplers:** Use the displacement sampler for 1-step (or few-step) inference; fall back to Euler along the instantaneous field if you prefer multi-step.
|
| 48 |
* **Competitive & fast:** strong ESTOI / SI-SDR / DNSMOS with **very low RTF** on VoiceBank-DEMAND.
|
| 49 |
|
| 50 |
+
|
| 51 |
|
| 52 |
## What’s inside
|
| 53 |
|
|
|
|
| 56 |
* **Audio front-end**: complex STFT pipeline; configurable transforms & normalization.
|
| 57 |
* **Metrics**: PESQ, ESTOI, SI-SDR; end-to-end **RTF** measurement.
|
| 58 |
|
| 59 |
+
|
| 60 |
|
| 61 |
## Quick start
|
| 62 |
|
|
|
|
| 149 |
> `evaluate.py` writes **enhanced WAVs**.
|
| 150 |
> If `--odesolver` is not given, it **auto-picks** (`euler_mf` when MF-SE was used; otherwise `euler`).
|
| 151 |
|
| 152 |
+
|
| 153 |
|
| 154 |
## Configuration
|
| 155 |
|
|
|
|
| 169 |
|
| 170 |
* Defined in `backbones/` and `SpecsDataModule` (STFT, transforms, normalization)
|
| 171 |
|
| 172 |
+
|
| 173 |
|
| 174 |
## Repository structure
|
| 175 |
|
|
|
|
| 211 |
|
| 212 |
* **VoiceBank–DEMAND (16 kHz)**: We have hosted the weight files on Google Drive and added the link here.— [Google Drive Link](https://drive.google.com/file/d/1QAxgd5BWrxiNi0q2qD3n1Xcv6bW0X86-/view?usp=sharing)
|
| 213 |
|
| 214 |
+
|
| 215 |
|
| 216 |
## Acknowledgments
|
| 217 |
|
| 218 |
We gratefully acknowledge **Prof. Xie Chen’s group (X-LANCE Lab, SJTU)** for their **valuable guidance and support** on training practices and engineering tips that helped this work a lot.
|
| 219 |
|
| 220 |
+
|
| 221 |
|
| 222 |
## Citation
|
| 223 |
|
| 224 |
* **Citation:** The paper is currently under review. We will add a BibTeX entry and article link once available.
|
| 225 |
|
| 226 |
+
|
| 227 |
|
| 228 |
|
| 229 |
**Questions or issues?** Please open a GitHub issue or pull request.
|