File size: 737 Bytes
21d8c9a
 
d5f6989
21d8c9a
 
c15e4ad
a20eb37
c15e4ad
 
 
 
 
 
 
 
a20eb37
d5f6989
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: cc-by-nc-sa-4.0
pipeline_tag: audio-to-audio
---

## Reference
```
@misc{wang2025solospeechenhancingintelligibilityquality,
      title={SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline}, 
      author={Helin Wang and Jiarui Hai and Dongchao Yang and Chen Chen and Kai Li and Junyi Peng and Thomas Thebaud and Laureano Moro Velazquez and Jesus Villalba and Najim Dehak},
      year={2025},
      eprint={2505.19314},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2505.19314}, 
}
```

Github repository: https://github.com/WangHelin1997/SoloSpeech

Project page: https://wanghelin1997.github.io/SoloSpeech-Demo/