liduojia commited on
Commit
535a9f5
·
verified ·
1 Parent(s): e6aaf0d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -38,7 +38,7 @@ base_model:
38
  * [Acknowledgments](#acknowledgments)
39
  * [Citation](#citation)
40
 
41
- ---
42
 
43
  ## Highlights
44
 
@@ -47,7 +47,7 @@ base_model:
47
  * **Same model, two samplers:** Use the displacement sampler for 1-step (or few-step) inference; fall back to Euler along the instantaneous field if you prefer multi-step.
48
  * **Competitive & fast:** strong ESTOI / SI-SDR / DNSMOS with **very low RTF** on VoiceBank-DEMAND.
49
 
50
- ---
51
 
52
  ## What’s inside
53
 
@@ -56,7 +56,7 @@ base_model:
56
  * **Audio front-end**: complex STFT pipeline; configurable transforms & normalization.
57
  * **Metrics**: PESQ, ESTOI, SI-SDR; end-to-end **RTF** measurement.
58
 
59
- ---
60
 
61
  ## Quick start
62
 
@@ -149,7 +149,7 @@ python evaluate.py \
149
  > `evaluate.py` writes **enhanced WAVs**.
150
  > If `--odesolver` is not given, it **auto-picks** (`euler_mf` when MF-SE was used; otherwise `euler`).
151
 
152
- ---
153
 
154
  ## Configuration
155
 
@@ -169,7 +169,7 @@ Common flags you may want to tweak:
169
 
170
  * Defined in `backbones/` and `SpecsDataModule` (STFT, transforms, normalization)
171
 
172
- ---
173
 
174
  ## Repository structure
175
 
@@ -211,19 +211,19 @@ Many design choices (complex STFT pipeline, training infrastructure) are inspire
211
 
212
  * **VoiceBank–DEMAND (16 kHz)**: We have hosted the weight files on Google Drive and added the link here.— [Google Drive Link](https://drive.google.com/file/d/1QAxgd5BWrxiNi0q2qD3n1Xcv6bW0X86-/view?usp=sharing)
213
 
214
- ---
215
 
216
  ## Acknowledgments
217
 
218
  We gratefully acknowledge **Prof. Xie Chen’s group (X-LANCE Lab, SJTU)** for their **valuable guidance and support** on training practices and engineering tips that helped this work a lot.
219
 
220
- ---
221
 
222
  ## Citation
223
 
224
  * **Citation:** The paper is currently under review. We will add a BibTeX entry and article link once available.
225
 
226
- ---
227
 
228
 
229
  **Questions or issues?** Please open a GitHub issue or pull request.
 
38
  * [Acknowledgments](#acknowledgments)
39
  * [Citation](#citation)
40
 
41
+
42
 
43
  ## Highlights
44
 
 
47
  * **Same model, two samplers:** Use the displacement sampler for 1-step (or few-step) inference; fall back to Euler along the instantaneous field if you prefer multi-step.
48
  * **Competitive & fast:** strong ESTOI / SI-SDR / DNSMOS with **very low RTF** on VoiceBank-DEMAND.
49
 
50
+
51
 
52
  ## What’s inside
53
 
 
56
  * **Audio front-end**: complex STFT pipeline; configurable transforms & normalization.
57
  * **Metrics**: PESQ, ESTOI, SI-SDR; end-to-end **RTF** measurement.
58
 
59
+
60
 
61
  ## Quick start
62
 
 
149
  > `evaluate.py` writes **enhanced WAVs**.
150
  > If `--odesolver` is not given, it **auto-picks** (`euler_mf` when MF-SE was used; otherwise `euler`).
151
 
152
+
153
 
154
  ## Configuration
155
 
 
169
 
170
  * Defined in `backbones/` and `SpecsDataModule` (STFT, transforms, normalization)
171
 
172
+
173
 
174
  ## Repository structure
175
 
 
211
 
212
  * **VoiceBank–DEMAND (16 kHz)**: We have hosted the weight files on Google Drive and added the link here.— [Google Drive Link](https://drive.google.com/file/d/1QAxgd5BWrxiNi0q2qD3n1Xcv6bW0X86-/view?usp=sharing)
213
 
214
+
215
 
216
  ## Acknowledgments
217
 
218
  We gratefully acknowledge **Prof. Xie Chen’s group (X-LANCE Lab, SJTU)** for their **valuable guidance and support** on training practices and engineering tips that helped this work a lot.
219
 
220
+
221
 
222
  ## Citation
223
 
224
  * **Citation:** The paper is currently under review. We will add a BibTeX entry and article link once available.
225
 
226
+
227
 
228
 
229
  **Questions or issues?** Please open a GitHub issue or pull request.