File size: 1,084 Bytes
ecab193 edef1a2 8e847b5 edef1a2 ecab193 edef1a2 ecab193 edef1a2 87cf724 d93b2ad 87cf724 bfd505b d93b2ad 87cf724 bfd505b 87cf724 d93b2ad bfd505b 1f1c98f bfd505b 87cf724 1f6cf4c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
---
language:
- zh
- en
tags:
- speech-synthesis
- speech-to-speech
- voice-conversion
- pytorch
- audio
- chinese-tts
- multi-speaker
- convolution
- encoder-decoder
license: apache-2.0
datasets:
- vctk
library_name: pytorch
---
# Convbased
Github: [https://github.com/Convbased/Convbased-Studio](https://github.com/Convbased/Convbased-Studio)
This project focuses on training high-quality pre-trained models.
| Feature Extraction | Vocoder | Sample Rate 40k | Sample Rate 48k |
|-----------|--------|-----|-----|
| contentvec | hifigannsf | β | β
|
| contentvec | sifigan | β | β
|
| contentvec | bigvgan | β
| β |
| spin | hifigannsf | β | β
|
| spin | sifigan | β | β
|
| spin-v2 | bigvgan | β
| β |
| chinese-hubert-base | hifigannsf | β | β
|
*Training code from [Applio](https://github.com/IAHispano/Applio).*
*Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at [Convbased Studio](https://weights.chat/).* |