File size: 1,084 Bytes
ecab193
edef1a2
 
 
 
 
8e847b5
edef1a2
 
 
 
 
 
 
ecab193
edef1a2
 
 
ecab193
edef1a2
 
 
87cf724
d93b2ad
87cf724
bfd505b
d93b2ad
87cf724
bfd505b
 
 
87cf724
d93b2ad
bfd505b
1f1c98f
bfd505b
 
87cf724
1f6cf4c
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---

language: 
  - zh
  - en
tags:
  - speech-synthesis
  - speech-to-speech
  - voice-conversion
  - pytorch
  - audio
  - chinese-tts
  - multi-speaker
  - convolution
  - encoder-decoder
license: apache-2.0
datasets:
- vctk
library_name: pytorch
---


# Convbased

Github: [https://github.com/Convbased/Convbased-Studio](https://github.com/Convbased/Convbased-Studio)

This project focuses on training high-quality pre-trained models.


| Feature Extraction | Vocoder | Sample Rate 40k | Sample Rate 48k |
|-----------|--------|-----|-----|
| contentvec | hifigannsf | ❌ | βœ… |
| contentvec | sifigan | ❌ | βœ… |
| contentvec | bigvgan | βœ… | ❌ |
| spin | hifigannsf | ❌ | βœ… |
| spin | sifigan | ❌ | βœ… |
| spin-v2 | bigvgan | βœ… | ❌ |
| chinese-hubert-base | hifigannsf | ❌ | βœ… |


*Training code from [Applio](https://github.com/IAHispano/Applio).*

*Dedicated to advancing Chinese speech synthesis technology. These base models have been used for fine-tuning most models at [Convbased Studio](https://weights.chat/).*