File size: 1,084 Bytes

ecab193
edef1a2
 
 
 
 
8e847b5
edef1a2
 
 
 
 
 
 
ecab193
edef1a2
 
 
ecab193
edef1a2
 
 
87cf724
d93b2ad
87cf724
bfd505b
d93b2ad
87cf724
bfd505b
 
 
87cf724
d93b2ad
bfd505b
1f1c98f
bfd505b
 
87cf724
1f6cf4c

---

language: 
  - zh
  - en
tags:
  - speech-synthesis
  - speech-to-speech
  - voice-conversion
  - pytorch
  - audio
  - chinese-tts
  - multi-speaker
  - convolution
  - encoder-decoder
license: apache-2.0
datasets:
- vctk
library_name: pytorch
---


# Convbased

Github: [https://github.com/Convbased/Convbased-Studio](https://github.com/Convbased/Convbased-Studio)

This project focuses on training high-quality pre-trained models.


| Feature Extraction | Vocoder | Sample Rate 40k | Sample Rate 48k |
|-----------|--------|-----|-----|
| contentvec | hifigannsf | ❌ | ✅ |
| contentvec | sifigan | ❌ | ✅ |
| contentvec | bigvgan | ✅ | ❌ |
| spin | hifigannsf | ❌ | ✅ |
| spin | sifigan | ❌ | ✅ |
| spin-v2 | bigvgan | ✅ | ❌ |
| chinese-hubert-base | hifigannsf | ❌ | ✅ |


*Training code from [Applio](https://github.com/IAHispano/Applio).*

*Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at [Convbased Studio](https://weights.chat/).*