Wataru commited on
Commit
3768f68
Β·
verified Β·
1 Parent(s): c7a5c60

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md CHANGED
@@ -1,3 +1,48 @@
1
  ---
2
  license: cc-by-nc-4.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
  ---
4
+
5
+ # UTDUSS vocodder model
6
+ In this repo, we provide model weight of the [descript audio codec](https://arxiv.org/abs/2306.06546) used for the [Interspeech2024 Speech Processing Using Discrete Speech Unit Challenge](https://www.wavlab.org/activities/2024/Interspeech2024-Discrete-Speech-Unit-Challenge/)
7
+ # Prerequesties
8
+
9
+ [official dac library](https://github.com/descriptinc/descript-audio-codec) which can be installed with the following command.
10
+ ```bash
11
+ pip install descript-audio-codec
12
+ ```
13
+
14
+ # Provided weights
15
+
16
+ ## Vocoder task
17
+ | model name on paper | model name on this repo |
18
+ |---|---|
19
+ |πŸ˜€ | expresso_16k_2code.pth|
20
+ |πŸ˜€ w/o hyper-parameter tuning| expresso_16k_2code_official.pth|
21
+ |πŸ˜€ w/o data exclusion| expresso_16k_2code_wo_data.pth|
22
+ |πŸ˜€ w/o matching sampling rate| expresso_24k_2code_ab.pth|
23
+
24
+ ## Acoustic +Vocoder (TTS) task
25
+ Please note that the weight for acoustic model is not provided.
26
+
27
+ | model name on paper | model name on this repo |
28
+ |---|---|
29
+ |πŸ˜€ | expresso_16k_2code.pth|
30
+ |πŸ˜€ w/o hyper-parameter tuning| expresso_16k_2code_official.pth|
31
+ |πŸ˜€ w/o data exclusion| expresso_16k_2code_wo_data.pth|
32
+ |πŸ˜€ w/o matching sampling rate| expresso_24k_2code_ab.pth|
33
+
34
+ # Sample code
35
+
36
+ ```python
37
+ import dac
38
+ import torch
39
+ from pathlib import Path
40
+ model_url = "https://huggingface.co/sarulab-speech/UTDUSS-Vocoder-Expresso/resolve/main/expresso_16k_2code.pth"
41
+ model_path = Path(f"/tmp/utduss/{model_url.split('/')[-1]}")
42
+ model_path.parent.mkdir(parents=True,exist_ok=True)
43
+ torch.hub.download_url_to_file(model_url,model_path)
44
+ model = dac.DAC.load(model_path)
45
+ ```
46
+
47
+ # Contributors
48
+ * [δΈ­η”° 亘](https://wataru-nakata.github.io/)