Update model card: add paper and code links, update pipeline tag

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +30 -14
README.md CHANGED
@@ -1,23 +1,27 @@
1
  ---
2
- license: mit
3
  library_name: parallel-wavegan
4
- pipeline_tag: text-to-speech
 
5
  tags:
6
- - vocoder
7
- - parallel-wavegan
8
- - joycent
9
- - mandarin
10
- - 16khz
11
  ---
12
 
13
  # Joycent ParallelWaveGAN Vocoder
14
 
15
- This repository stores the ParallelWaveGAN vocoder used by Joycent Mandarin
16
- accent text-to-speech inference.
 
 
 
 
17
 
18
- The model generates 16 kHz audio from 80-bin mel spectrograms. Keep
19
- `checkpoint-50000steps.pkl` and `config.yml` in the same directory when loading
20
- the model with ParallelWaveGAN:
21
 
22
  ```python
23
  import yaml
@@ -31,5 +35,17 @@ vocoder.remove_weight_norm()
31
  vocoder.eval()
32
  ```
33
 
34
- The Joycent implementation and inference instructions are available in the
35
- [Joycent repository](https://github.com/anonymous-accent-tts/Joycent_demo).
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  library_name: parallel-wavegan
3
+ license: mit
4
+ pipeline_tag: audio-to-audio
5
  tags:
6
+ - vocoder
7
+ - parallel-wavegan
8
+ - joycent
9
+ - mandarin
10
+ - 16khz
11
  ---
12
 
13
  # Joycent ParallelWaveGAN Vocoder
14
 
15
+ This repository stores the ParallelWaveGAN vocoder used by Joycent Mandarin accent text-to-speech inference, as presented in the paper [Joycent: Diffusion-based Accent TTS without Accented Phone Prediction](https://huggingface.co/papers/2606.16417).
16
+
17
+ The model generates 16 kHz audio from 80-bin mel spectrograms.
18
+
19
+ - **Official Code:** [oshindow/Joycent-code](https://github.com/oshindow/Joycent-code)
20
+ - **Project Page:** [Demos](https://anonymous-accent-tts.github.io/Joycent-demo/)
21
 
22
+ ## Usage
23
+
24
+ Keep `checkpoint-50000steps.pkl` and `config.yml` in the same directory when loading the model with ParallelWaveGAN:
25
 
26
  ```python
27
  import yaml
 
35
  vocoder.eval()
36
  ```
37
 
38
+ The Joycent implementation and inference instructions are available in the [official repository](https://github.com/oshindow/Joycent-code).
39
+
40
+ ## Citation
41
+
42
+ ```bibtex
43
+ @misc{wang2026joycentdiffusionbasedaccenttts,
44
+ title={Joycent: Diffusion-based Accent TTS without Accented Phone Prediction},
45
+ author={Xintong Wang and Ye Wang},
46
+ year={2026},
47
+ eprint={2606.16417},
48
+ archivePrefix={arXiv},
49
+ primaryClass={cs.SD},
50
+ }
51
+ ```