File size: 996 Bytes
83761ab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0debf82
e366e21
83761ab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
license: mit
language:
- en
tags:
- accent-conversion
- speech
- conformer
- flow-matching
library_name: cosyaccent
---

# CosyAccent

Model weights for **CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data** (ICASSP 2026).

- **Paper:** https://arxiv.org/abs/2602.19166v1
- **Data:** https://huggingface.co/datasets/Piping/L2-LibriTTSR
- **Code & documentation:** https://github.com/P1ping/CosyAccent

## Usage

See the GitHub repository for full installation and inference instructions.

Quick start:

```bash
pip install -r requirements.txt
python infer_wav.py \
  --source_wav /path/to/source.wav \
  --output_wav outputs/result.wav
```

## Citation

```bibtex
@inproceedings{bai2026cosyaccent,
  title={CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data},
  author={Bai, Qibing and Shi, Shuhao and Wang, Shuai and Ju, Yukai and Wang, Yannan and Li, Haizhou},
  booktitle={ICASSP 2026},
  year={2026}
}
```