miscovery commited on
Commit
7f959ae
·
verified ·
1 Parent(s): 952201e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-to-speech
4
+ - tts
5
+ - pytorch
6
+ language:
7
+ - ar
8
+ license: mit
9
+ ---
10
+
11
+ # Multi-Speaker TTS Model
12
+
13
+ Trained Text-to-Speech model with the following specs:
14
+
15
+ ## Model Details
16
+ - **Speakers:** 1
17
+ - **Mel Channels:** 80
18
+ - **Sample Rate:** 22050
19
+ - **Vocab Size:** 50000
20
+
21
+ ## Training Stats
22
+ - **Final Mel Loss:** 5.3586
23
+ - **Validation Mel Loss:** 5.4784
24
+ - **Training Epochs:** 105
25
+
26
+ ## Mel Normalization
27
+ - **Mean:** -4.8906
28
+ - **Std:** 4.9261
29
+
30
+ ## Usage
31
+
32
+ ```python
33
+ import torch
34
+ from transformers import AutoTokenizer
35
+
36
+ # Load model
37
+ checkpoint = torch.load('checkpoint_best.pt')
38
+ model = MultiSpeakerTTS(checkpoint['config'])
39
+ model.load_state_dict(checkpoint['model_state_dict'])
40
+ model.eval()
41
+
42
+ # Generate
43
+ tokenizer = AutoTokenizer.from_pretrained("miscovery/tokenizer_v3")
44
+ text = "Your text here"
45
+ tokens = tokenizer.encode(text)
46
+ # ... inference code ...
47
+ ```
48
+
49
+ ## Speaker Mapping
50
+ {
51
+ "mahmoud": 0
52
+ }