Add pipeline tag and update citation info
#2
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,10 +1,12 @@
|
|
| 1 |
---
|
| 2 |
-
license: mit
|
| 3 |
-
datasets:
|
| 4 |
-
- JacobLinCool/VoiceBank-DEMAND-16k
|
| 5 |
base_model:
|
| 6 |
- liduojia/MeanFlowSE
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
---
|
|
|
|
| 8 |
<div align="center">
|
| 9 |
<p align="center">
|
| 10 |
<h1>MeanFlowSE — One-Step Generative Speech Enhancement</h1>
|
|
@@ -88,7 +90,7 @@ Defaults assume 16 kHz audio, centered frames, Hann windows, and a complex STFT
|
|
| 88 |
|
| 89 |
```bash
|
| 90 |
# Edit DATA_DIR and GPUs inside the script if needed
|
| 91 |
-
bash train_vbd.sh
|
| 92 |
```
|
| 93 |
|
| 94 |
Or run directly:
|
|
@@ -130,7 +132,7 @@ Use the helper script:
|
|
| 130 |
MODE=onestep STEPS=1 \
|
| 131 |
TEST_DATA_DIR=<BASE_DIR> \
|
| 132 |
CKPT_INPUT=path/to/best.ckpt \
|
| 133 |
-
bash run_inference.sh
|
| 134 |
```
|
| 135 |
|
| 136 |
Or call the evaluator:
|
|
@@ -221,10 +223,16 @@ We gratefully acknowledge **Prof. Xie Chen’s group (X-LANCE Lab, SJTU)** for t
|
|
| 221 |
|
| 222 |
## Citation
|
| 223 |
|
| 224 |
-
|
| 225 |
-
|
| 226 |
-
|
| 227 |
-
|
| 228 |
-
|
| 229 |
-
|
| 230 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- liduojia/MeanFlowSE
|
| 4 |
+
datasets:
|
| 5 |
+
- JacobLinCool/VoiceBank-DEMAND-16k
|
| 6 |
+
license: mit
|
| 7 |
+
pipeline_tag: audio-to-audio
|
| 8 |
---
|
| 9 |
+
|
| 10 |
<div align="center">
|
| 11 |
<p align="center">
|
| 12 |
<h1>MeanFlowSE — One-Step Generative Speech Enhancement</h1>
|
|
|
|
| 90 |
|
| 91 |
```bash
|
| 92 |
# Edit DATA_DIR and GPUs inside the script if needed
|
| 93 |
+
bash scripts/train_vbd.sh
|
| 94 |
```
|
| 95 |
|
| 96 |
Or run directly:
|
|
|
|
| 132 |
MODE=onestep STEPS=1 \
|
| 133 |
TEST_DATA_DIR=<BASE_DIR> \
|
| 134 |
CKPT_INPUT=path/to/best.ckpt \
|
| 135 |
+
bash scripts/run_inference.sh
|
| 136 |
```
|
| 137 |
|
| 138 |
Or call the evaluator:
|
|
|
|
| 223 |
|
| 224 |
## Citation
|
| 225 |
|
| 226 |
+
Our article has been accepted by **ICASSP 2026**.
|
| 227 |
+
|
| 228 |
+
```bibtex
|
| 229 |
+
@misc{li2025meanflowseonestepgenerativespeech,
|
| 230 |
+
title = {MeanFlowSE: one-step generative speech enhancement via conditional mean flow},
|
| 231 |
+
author = {Duojia Li and Shenghui Lu and Hongchen Pan and Zongyi Zhan and Qingyang Hong and Lin Li},
|
| 232 |
+
year = {2025},
|
| 233 |
+
eprint = {2509.14858},
|
| 234 |
+
archivePrefix = {arXiv},
|
| 235 |
+
primaryClass = {cs.SD},
|
| 236 |
+
url = {https://arxiv.org/abs/2509.14858}
|
| 237 |
+
}
|
| 238 |
+
```
|