File size: 737 Bytes
21d8c9a d5f6989 21d8c9a c15e4ad a20eb37 c15e4ad a20eb37 d5f6989 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 | ---
license: cc-by-nc-sa-4.0
pipeline_tag: audio-to-audio
---
## Reference
```
@misc{wang2025solospeechenhancingintelligibilityquality,
title={SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline},
author={Helin Wang and Jiarui Hai and Dongchao Yang and Chen Chen and Kai Li and Junyi Peng and Thomas Thebaud and Laureano Moro Velazquez and Jesus Villalba and Najim Dehak},
year={2025},
eprint={2505.19314},
archivePrefix={arXiv},
primaryClass={eess.AS},
url={https://arxiv.org/abs/2505.19314},
}
```
Github repository: https://github.com/WangHelin1997/SoloSpeech
Project page: https://wanghelin1997.github.io/SoloSpeech-Demo/ |