babybirdprd commited on
Commit
35665d2
·
1 Parent(s): e3a83a4

Initial LuxTTS model artifacts for luxtts-candle

Browse files
LICENSE ADDED
@@ -0,0 +1,201 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Apache License
2
+ Version 2.0, January 2004
3
+ http://www.apache.org/licenses/
4
+
5
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
6
+
7
+ 1. Definitions.
8
+
9
+ "License" shall mean the terms and conditions for use, reproduction,
10
+ and distribution as defined by Sections 1 through 9 of this document.
11
+
12
+ "Licensor" shall mean the copyright owner or entity authorized by
13
+ the copyright owner that is granting the License.
14
+
15
+ "Legal Entity" shall mean the union of the acting entity and all
16
+ other entities that control, are controlled by, or are under common
17
+ control with that entity. For the purposes of this definition,
18
+ "control" means (i) the power, direct or indirect, to cause the
19
+ direction or management of such entity, whether by contract or
20
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
21
+ outstanding shares, or (iii) beneficial ownership of such entity.
22
+
23
+ "You" (or "Your") shall mean an individual or Legal Entity
24
+ exercising permissions granted by this License.
25
+
26
+ "Source" form shall mean the preferred form for making modifications,
27
+ including but not limited to software source code, documentation
28
+ source, and configuration files.
29
+
30
+ "Object" form shall mean any form resulting from mechanical
31
+ transformation or translation of a Source form, including but
32
+ not limited to compiled object code, generated documentation,
33
+ and conversions to other media types.
34
+
35
+ "Work" shall mean the work of authorship, whether in Source or
36
+ Object form, made available under the License, as indicated by a
37
+ copyright notice that is included in or attached to the work
38
+ (an example is provided in the Appendix below).
39
+
40
+ "Derivative Works" shall mean any work, whether in Source or Object
41
+ form, that is based on (or derived from) the Work and for which the
42
+ editorial revisions, annotations, elaborations, or other modifications
43
+ represent, as a whole, an original work of authorship. For the purposes
44
+ of this License, Derivative Works shall not include works that remain
45
+ separable from, or merely link (or bind by name) to the interfaces of,
46
+ the Work and Derivative Works thereof.
47
+
48
+ "Contribution" shall mean any work of authorship, including
49
+ the original version of the Work and any modifications or additions
50
+ to that Work or Derivative Works thereof, that is intentionally
51
+ submitted to Licensor for inclusion in the Work by the copyright owner
52
+ or by an individual or Legal Entity authorized to submit on behalf of
53
+ the copyright owner. For the purposes of this definition, "submitted"
54
+ means any form of electronic, verbal, or written communication sent
55
+ to the Licensor or its representatives, including but not limited to
56
+ communication on electronic mailing lists, source code control systems,
57
+ and issue tracking systems that are managed by, or on behalf of, the
58
+ Licensor for the purpose of discussing and improving the Work, but
59
+ excluding communication that is conspicuously marked or otherwise
60
+ designated in writing by the copyright owner as "Not a Contribution."
61
+
62
+ "Contributor" shall mean Licensor and any individual or Legal Entity
63
+ on behalf of whom a Contribution has been received by Licensor and
64
+ subsequently incorporated within the Work.
65
+
66
+ 2. Grant of Copyright License. Subject to the terms and conditions of
67
+ this License, each Contributor hereby grants to You a perpetual,
68
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
69
+ copyright license to reproduce, prepare Derivative Works of,
70
+ publicly display, publicly perform, sublicense, and distribute the
71
+ Work and such Derivative Works in Source or Object form.
72
+
73
+ 3. Grant of Patent License. Subject to the terms and conditions of
74
+ this License, each Contributor hereby grants to You a perpetual,
75
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
76
+ (except as stated in this section) patent license to make, have made,
77
+ use, offer to sell, sell, import, and otherwise transfer the Work,
78
+ where such license applies only to those patent claims licensable
79
+ by such Contributor that are necessarily infringed by their
80
+ Contribution(s) alone or by combination of their Contribution(s)
81
+ with the Work to which such Contribution(s) was submitted. If You
82
+ institute patent litigation against any entity (including a
83
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
84
+ or a Contribution incorporated within the Work constitutes direct
85
+ or contributory patent infringement, then any patent licenses
86
+ granted to You under this License for that Work shall terminate
87
+ as of the date such litigation is filed.
88
+
89
+ 4. Redistribution. You may reproduce and distribute copies of the
90
+ Work or Derivative Works thereof in any medium, with or without
91
+ modifications, and in Source or Object form, provided that You
92
+ meet the following conditions:
93
+
94
+ (a) You must give any other recipients of the Work or
95
+ Derivative Works a copy of this License; and
96
+
97
+ (b) You must cause any modified files to carry prominent notices
98
+ stating that You changed the files; and
99
+
100
+ (c) You must retain, in the Source form of any Derivative Works
101
+ that You distribute, all copyright, patent, trademark, and
102
+ attribution notices from the Source form of the Work,
103
+ excluding those notices that do not pertain to any part of
104
+ the Derivative Works; and
105
+
106
+ (d) If the Work includes a "NOTICE" text file as part of its
107
+ distribution, then any Derivative Works that You distribute must
108
+ include a readable copy of the attribution notices contained
109
+ within such NOTICE file, excluding those notices that do not
110
+ pertain to any part of the Derivative Works, in at least one
111
+ of the following places: within a NOTICE text file distributed
112
+ as part of the Derivative Works; within the Source form or
113
+ documentation, if provided along with the Derivative Works; or,
114
+ within a display generated by the Derivative Works, if and
115
+ wherever such third-party notices normally appear. The contents
116
+ of the NOTICE file are for informational purposes only and
117
+ do not modify the License. You may add Your own attribution
118
+ notices within Derivative Works that You distribute, alongside
119
+ or as an addendum to the NOTICE text from the Work, provided
120
+ that such additional attribution notices cannot be construed
121
+ as modifying the License.
122
+
123
+ You may add Your own copyright statement to Your modifications and
124
+ may provide additional or different license terms and conditions
125
+ for use, reproduction, or distribution of Your modifications, or
126
+ for any such Derivative Works as a whole, provided Your use,
127
+ reproduction, and distribution of the Work otherwise complies with
128
+ the conditions stated in this License.
129
+
130
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
131
+ any Contribution intentionally submitted for inclusion in the Work
132
+ by You to the Licensor shall be under the terms and conditions of
133
+ this License, without any additional terms or conditions.
134
+ Notwithstanding the above, nothing herein shall supersede or modify
135
+ the terms of any separate license agreement you may have executed
136
+ with Licensor regarding such Contributions.
137
+
138
+ 6. Trademarks. This License does not grant permission to use the trade
139
+ names, trademarks, service marks, or product names of the Licensor,
140
+ except as required for reasonable and customary use in describing the
141
+ origin of the Work and reproducing the content of the NOTICE file.
142
+
143
+ 7. Disclaimer of Warranty. Unless required by applicable law or
144
+ agreed to in writing, Licensor provides the Work (and each
145
+ Contributor provides its Contributions) on an "AS IS" BASIS,
146
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
147
+ implied, including, without limitation, any warranties or conditions
148
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
149
+ PARTICULAR PURPOSE. You are solely responsible for determining the
150
+ appropriateness of using or redistributing the Work and assume any
151
+ risks associated with Your exercise of permissions under this License.
152
+
153
+ 8. Limitation of Liability. In no event and under no legal theory,
154
+ whether in tort (including negligence), contract, or otherwise,
155
+ unless required by applicable law (such as deliberate and grossly
156
+ negligent acts) or agreed to in writing, shall any Contributor be
157
+ liable to You for damages, including any direct, indirect, special,
158
+ incidental, or consequential damages of any character arising as a
159
+ result of this License or out of the use or inability to use the
160
+ Work (including but not limited to damages for loss of goodwill,
161
+ work stoppage, computer failure or malfunction, or any and all
162
+ other commercial damages or losses), even if such Contributor
163
+ has been advised of the possibility of such damages.
164
+
165
+ 9. Accepting Warranty or Additional Liability. While redistributing
166
+ the Work or Derivative Works thereof, You may choose to offer,
167
+ and charge a fee for, acceptance of support, warranty, indemnity,
168
+ or other liability obligations and/or rights consistent with this
169
+ License. However, in accepting such obligations, You may act only
170
+ on Your own behalf and on Your sole responsibility, not on behalf
171
+ of any other Contributor, and only if You agree to indemnify,
172
+ defend, and hold each Contributor harmless for any liability
173
+ incurred by, or claims asserted against, such Contributor by reason
174
+ of your accepting any such warranty or additional liability.
175
+
176
+ END OF TERMS AND CONDITIONS
177
+
178
+ APPENDIX: How to apply the Apache License to your work.
179
+
180
+ To apply the Apache License to your work, attach the following
181
+ boilerplate notice, with the fields enclosed by brackets "[]"
182
+ replaced with your own identifying information. (Don't include
183
+ the brackets!) The text should be enclosed in the appropriate
184
+ comment syntax for the file format. We also recommend that a
185
+ file or class name and description of purpose be included on the
186
+ same "printed page" as the copyright notice for easier
187
+ identification within third-party archives.
188
+
189
+ Copyright [yyyy] [name of copyright owner]
190
+
191
+ Licensed under the Apache License, Version 2.0 (the "License");
192
+ you may not use this file except in compliance with the License.
193
+ You may obtain a copy of the License at
194
+
195
+ http://www.apache.org/licenses/LICENSE-2.0
196
+
197
+ Unless required by applicable law or agreed to in writing, software
198
+ distributed under the License is distributed on an "AS IS" BASIS,
199
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
200
+ See the License for the specific language governing permissions and
201
+ limitations under the License.
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ pipeline_tag: text-to-speech
6
+ ---
7
+
8
+ # LuxTTS Candle Model Artifacts
9
+
10
+ This repository hosts model artifacts for **luxtts-candle** (Rust/Candle inference).
11
+
12
+ ## Included files
13
+
14
+ - `config.json`
15
+ - `tokens.txt`
16
+ - `model.pt`
17
+ - `text_encoder.onnx`
18
+ - `fm_decoder.onnx`
19
+ - `text_encoder_int8.onnx`
20
+ - `fm_decoder_int8.onnx`
21
+ - `vocoder/config.yaml`
22
+ - `vocoder/vocos.bin`
23
+ - `LICENSE`
24
+
25
+ ## Notes
26
+
27
+ - `luxtts-candle` currently requires the fp32 ONNX files (`text_encoder.onnx`, `fm_decoder.onnx`) plus `model.pt` and vocoder files.
28
+ - Int8 ONNX files are included for future compatibility work.
29
+ - The vocoder safetensors file is generated locally by `luxtts-candle` from `vocoder/vocos.bin` when needed.
30
+
31
+ ## License
32
+
33
+ Apache-2.0 (see `LICENSE`).
config.json ADDED
@@ -0,0 +1,26 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model" : {
3
+ "fm_decoder_downsampling_factor" : [1,2,4,2,1],
4
+ "fm_decoder_num_layers" : [2,2,4,4,4],
5
+ "fm_decoder_cnn_module_kernel" : [31,15,7,15,31],
6
+ "fm_decoder_feedforward_dim" : 1536,
7
+ "fm_decoder_num_heads" : 4,
8
+ "fm_decoder_dim" : 512,
9
+ "text_encoder_num_layers" : 4,
10
+ "text_encoder_feedforward_dim" : 512,
11
+ "text_encoder_cnn_module_kernel" : 9,
12
+ "text_encoder_num_heads" : 4,
13
+ "text_encoder_dim" : 192,
14
+ "query_head_dim" : 32,
15
+ "value_head_dim" : 12,
16
+ "pos_head_dim" : 4,
17
+ "pos_dim" : 48,
18
+ "time_embed_dim" : 192,
19
+ "text_embed_dim" : 192,
20
+ "feat_dim": 100
21
+ },
22
+ "feature" : {
23
+ "sampling_rate": 24000,
24
+ "type": "vocos"
25
+ }
26
+ }
fm_decoder.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4510d4f5f049f14ef80207fca695e13c820e2cea61635f402954950bc62b1e3c
3
+ size 477534010
fm_decoder_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cc2e08a96610d7ea1b227398e97cdbbe0414499741d3aec0b8113db2a2ab251
3
+ size 124657100
model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:745855037478eb888cfa7a3603c1aa9f663f22a72d94cc1c37787228ff422095
3
+ size 491318136
text_encoder.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:495eca2d5f8a911f5c361bcce5bd55cdd2508ccdd26ce3e9bf1d3c29eb974861
3
+ size 17633735
text_encoder_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2de9a761a85e5ddd125dee6e05bad1c7ee92c11b83b4d775dab216a6aa41379
3
+ size 5570211
tokens.txt ADDED
@@ -0,0 +1,360 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ _ 0
2
+ ^ 1
3
+ $ 2
4
+ 3
5
+ ! 4
6
+ ' 5
7
+ ( 6
8
+ ) 7
9
+ , 8
10
+ - 9
11
+ . 10
12
+ : 11
13
+ ; 12
14
+ ? 13
15
+ a 14
16
+ b 15
17
+ c 16
18
+ d 17
19
+ e 18
20
+ f 19
21
+ h 20
22
+ i 21
23
+ j 22
24
+ k 23
25
+ l 24
26
+ m 25
27
+ n 26
28
+ o 27
29
+ p 28
30
+ q 29
31
+ r 30
32
+ s 31
33
+ t 32
34
+ u 33
35
+ v 34
36
+ w 35
37
+ x 36
38
+ y 37
39
+ z 38
40
+ æ 39
41
+ ç 40
42
+ ð 41
43
+ ø 42
44
+ ħ 43
45
+ ŋ 44
46
+ œ 45
47
+ ǀ 46
48
+ ǁ 47
49
+ ǂ 48
50
+ ǃ 49
51
+ ɐ 50
52
+ ɑ 51
53
+ ɒ 52
54
+ ɓ 53
55
+ ɔ 54
56
+ ɕ 55
57
+ ɖ 56
58
+ ɗ 57
59
+ ɘ 58
60
+ ə 59
61
+ ɚ 60
62
+ ɛ 61
63
+ ɜ 62
64
+ ɞ 63
65
+ ɟ 64
66
+ ɠ 65
67
+ ɡ 66
68
+ ɢ 67
69
+ ɣ 68
70
+ ɤ 69
71
+ ɥ 70
72
+ ɦ 71
73
+ ɧ 72
74
+ ɨ 73
75
+ ɪ 74
76
+ ɫ 75
77
+ ɬ 76
78
+ ɭ 77
79
+ ɮ 78
80
+ ɯ 79
81
+ ɰ 80
82
+ ɱ 81
83
+ ɲ 82
84
+ ɳ 83
85
+ ɴ 84
86
+ ɵ 85
87
+ ɶ 86
88
+ ɸ 87
89
+ ɹ 88
90
+ ɺ 89
91
+ ɻ 90
92
+ ɽ 91
93
+ ɾ 92
94
+ ʀ 93
95
+ ʁ 94
96
+ ʂ 95
97
+ ʃ 96
98
+ ʄ 97
99
+ ʈ 98
100
+ ʉ 99
101
+ ʊ 100
102
+ ʋ 101
103
+ ʌ 102
104
+ ʍ 103
105
+ ʎ 104
106
+ ʏ 105
107
+ ʐ 106
108
+ ʑ 107
109
+ ʒ 108
110
+ ʔ 109
111
+ ʕ 110
112
+ ʘ 111
113
+ ʙ 112
114
+ ʛ 113
115
+ ʜ 114
116
+ ʝ 115
117
+ ʟ 116
118
+ ʡ 117
119
+ ʢ 118
120
+ ʲ 119
121
+ ˈ 120
122
+ ˌ 121
123
+ ː 122
124
+ ˑ 123
125
+ ˞ 124
126
+ β 125
127
+ θ 126
128
+ χ 127
129
+ ᵻ 128
130
+ ⱱ 129
131
+ 0 130
132
+ 1 131
133
+ 2 132
134
+ 3 133
135
+ 4 134
136
+ 5 135
137
+ 6 136
138
+ 7 137
139
+ 8 138
140
+ 9 139
141
+ ̧ 140
142
+ ̃ 141
143
+ ̪ 142
144
+ ̯ 143
145
+ ̩ 144
146
+ ʰ 145
147
+ ˤ 146
148
+ ε 147
149
+ ↓ 148
150
+ # 149
151
+ " 150
152
+ ↑ 151
153
+ ̺ 152
154
+ ̻ 153
155
+ g 154
156
+ ʦ 155
157
+ X 156
158
+ ̝ 157
159
+ ̊ 158
160
+ a1 159
161
+ a2 160
162
+ a3 161
163
+ a4 162
164
+ a5 163
165
+ ai1 164
166
+ ai2 165
167
+ ai3 166
168
+ ai4 167
169
+ ai5 168
170
+ an1 169
171
+ an2 170
172
+ an3 171
173
+ an4 172
174
+ an5 173
175
+ ang1 174
176
+ ang2 175
177
+ ang3 176
178
+ ang4 177
179
+ ang5 178
180
+ ao1 179
181
+ ao2 180
182
+ ao3 181
183
+ ao4 182
184
+ ao5 183
185
+ b0 184
186
+ c0 185
187
+ ch0 186
188
+ d0 187
189
+ e1 188
190
+ e2 189
191
+ e3 190
192
+ e4 191
193
+ e5 192
194
+ ei1 193
195
+ ei2 194
196
+ ei3 195
197
+ ei4 196
198
+ ei5 197
199
+ en1 198
200
+ en2 199
201
+ en3 200
202
+ en4 201
203
+ en5 202
204
+ eng1 203
205
+ eng2 204
206
+ eng3 205
207
+ eng4 206
208
+ eng5 207
209
+ er2 208
210
+ er3 209
211
+ er4 210
212
+ er5 211
213
+ f0 212
214
+ g0 213
215
+ g2 214
216
+ g3 215
217
+ g4 216
218
+ g5 217
219
+ h0 218
220
+ i1 219
221
+ i2 220
222
+ i3 221
223
+ i4 222
224
+ i5 223
225
+ ia1 224
226
+ ia2 225
227
+ ia3 226
228
+ ia4 227
229
+ ia5 228
230
+ ian1 229
231
+ ian2 230
232
+ ian3 231
233
+ ian4 232
234
+ ian5 233
235
+ iang1 234
236
+ iang2 235
237
+ iang3 236
238
+ iang4 237
239
+ iang5 238
240
+ iao1 239
241
+ iao2 240
242
+ iao3 241
243
+ iao4 242
244
+ iao5 243
245
+ ie1 244
246
+ ie2 245
247
+ ie3 246
248
+ ie4 247
249
+ ie5 248
250
+ in1 249
251
+ in2 250
252
+ in3 251
253
+ in4 252
254
+ in5 253
255
+ ing1 254
256
+ ing2 255
257
+ ing3 256
258
+ ing4 257
259
+ ing5 258
260
+ iong1 259
261
+ iong2 260
262
+ iong3 261
263
+ iong4 262
264
+ iu1 263
265
+ iu2 264
266
+ iu3 265
267
+ iu4 266
268
+ iu5 267
269
+ j0 268
270
+ k0 269
271
+ l0 270
272
+ m0 271
273
+ m1 272
274
+ m2 273
275
+ m4 274
276
+ m5 275
277
+ n0 276
278
+ n2 277
279
+ n3 278
280
+ n4 279
281
+ n5 280
282
+ ng5 281
283
+ o1 282
284
+ o2 283
285
+ o3 284
286
+ o4 285
287
+ o5 286
288
+ ong1 287
289
+ ong2 288
290
+ ong3 289
291
+ ong4 290
292
+ ong5 291
293
+ ou1 292
294
+ ou2 293
295
+ ou3 294
296
+ ou4 295
297
+ ou5 296
298
+ p0 297
299
+ q0 298
300
+ r0 299
301
+ s0 300
302
+ sh0 301
303
+ t0 302
304
+ u1 303
305
+ u2 304
306
+ u3 305
307
+ u4 306
308
+ u5 307
309
+ ua1 308
310
+ ua2 309
311
+ ua3 310
312
+ ua4 311
313
+ uai1 312
314
+ uai2 313
315
+ uai3 314
316
+ uai4 315
317
+ uai5 316
318
+ uan1 317
319
+ uan2 318
320
+ uan3 319
321
+ uan4 320
322
+ uan5 321
323
+ uang1 322
324
+ uang2 323
325
+ uang3 324
326
+ uang4 325
327
+ uang5 326
328
+ ue1 327
329
+ ue2 328
330
+ ue3 329
331
+ ue4 330
332
+ ui1 331
333
+ ui2 332
334
+ ui3 333
335
+ ui4 334
336
+ ui5 335
337
+ un1 336
338
+ un2 337
339
+ un3 338
340
+ un4 339
341
+ un5 340
342
+ uo1 341
343
+ uo2 342
344
+ uo3 343
345
+ uo4 344
346
+ uo5 345
347
+ v2 346
348
+ v3 347
349
+ v4 348
350
+ ve3 349
351
+ ve4 350
352
+ w0 351
353
+ x0 352
354
+ y0 353
355
+ z0 354
356
+ zh0 355
357
+ ê1 356
358
+ ê2 357
359
+ ê3 358
360
+ ê4 359
vocoder/config.yaml ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ feature_extractor:
2
+ class_path: vocos.feature_extractors.MelSpectrogramFeatures
3
+ init_args:
4
+ sample_rate: 24000
5
+ n_fft: 1024
6
+ hop_length: 256
7
+ n_mels: 100
8
+ padding: center
9
+
10
+ backbone:
11
+ class_path: vocos.models.VocosBackbone
12
+ init_args:
13
+ input_channels: 100
14
+ dim: 512
15
+ intermediate_dim: 1536
16
+ num_layers: 8
17
+
18
+ head:
19
+ class_path: vocos.heads.ISTFTHead
20
+ init_args:
21
+ dim: 512
22
+ n_fft: 1024
23
+ hop_length: 256
24
+ padding: center
25
+
26
+ head_48k:
27
+ class_path: vocos.heads.ISTFTHead
28
+ init_args:
29
+ dim: 512
30
+ n_fft: 1024
31
+ hop_length: 256
32
+ padding: center
33
+
34
+ upsampler:
35
+ class_path: linacodec.vocoder.upsampler_block.UpSamplerBlock
36
+ init_args:
37
+ in_channels: 512
38
+ upsample_factors: [2, 1]
39
+ kernel_sizes: [8, 8]
vocoder/vocos.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:116b9875a0369d6a0156d752b4548121fe75fdc81d39943e81c46ac9bfa72d11
3
+ size 63972079