indiejoseph commited on
Commit
2f950a2
·
verified ·
1 Parent(s): 4c21da6

Upload model

Browse files
Files changed (3) hide show
  1. README.md +199 -0
  2. config.json +181 -730
  3. model.safetensors +2 -2
README.md ADDED
@@ -0,0 +1,199 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags: []
4
+ ---
5
+
6
+ # Model Card for Model ID
7
+
8
+ <!-- Provide a quick summary of what the model is/does. -->
9
+
10
+
11
+
12
+ ## Model Details
13
+
14
+ ### Model Description
15
+
16
+ <!-- Provide a longer summary of what this model is. -->
17
+
18
+ This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
+
20
+ - **Developed by:** [More Information Needed]
21
+ - **Funded by [optional]:** [More Information Needed]
22
+ - **Shared by [optional]:** [More Information Needed]
23
+ - **Model type:** [More Information Needed]
24
+ - **Language(s) (NLP):** [More Information Needed]
25
+ - **License:** [More Information Needed]
26
+ - **Finetuned from model [optional]:** [More Information Needed]
27
+
28
+ ### Model Sources [optional]
29
+
30
+ <!-- Provide the basic links for the model. -->
31
+
32
+ - **Repository:** [More Information Needed]
33
+ - **Paper [optional]:** [More Information Needed]
34
+ - **Demo [optional]:** [More Information Needed]
35
+
36
+ ## Uses
37
+
38
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
+
40
+ ### Direct Use
41
+
42
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
+
44
+ [More Information Needed]
45
+
46
+ ### Downstream Use [optional]
47
+
48
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
+
50
+ [More Information Needed]
51
+
52
+ ### Out-of-Scope Use
53
+
54
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
+
56
+ [More Information Needed]
57
+
58
+ ## Bias, Risks, and Limitations
59
+
60
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
+
62
+ [More Information Needed]
63
+
64
+ ### Recommendations
65
+
66
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
+
68
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
+
70
+ ## How to Get Started with the Model
71
+
72
+ Use the code below to get started with the model.
73
+
74
+ [More Information Needed]
75
+
76
+ ## Training Details
77
+
78
+ ### Training Data
79
+
80
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
+
82
+ [More Information Needed]
83
+
84
+ ### Training Procedure
85
+
86
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
+
88
+ #### Preprocessing [optional]
89
+
90
+ [More Information Needed]
91
+
92
+
93
+ #### Training Hyperparameters
94
+
95
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
+
97
+ #### Speeds, Sizes, Times [optional]
98
+
99
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
+
101
+ [More Information Needed]
102
+
103
+ ## Evaluation
104
+
105
+ <!-- This section describes the evaluation protocols and provides the results. -->
106
+
107
+ ### Testing Data, Factors & Metrics
108
+
109
+ #### Testing Data
110
+
111
+ <!-- This should link to a Dataset Card if possible. -->
112
+
113
+ [More Information Needed]
114
+
115
+ #### Factors
116
+
117
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
+
119
+ [More Information Needed]
120
+
121
+ #### Metrics
122
+
123
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
+
125
+ [More Information Needed]
126
+
127
+ ### Results
128
+
129
+ [More Information Needed]
130
+
131
+ #### Summary
132
+
133
+
134
+
135
+ ## Model Examination [optional]
136
+
137
+ <!-- Relevant interpretability work for the model goes here -->
138
+
139
+ [More Information Needed]
140
+
141
+ ## Environmental Impact
142
+
143
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
+
145
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
+
147
+ - **Hardware Type:** [More Information Needed]
148
+ - **Hours used:** [More Information Needed]
149
+ - **Cloud Provider:** [More Information Needed]
150
+ - **Compute Region:** [More Information Needed]
151
+ - **Carbon Emitted:** [More Information Needed]
152
+
153
+ ## Technical Specifications [optional]
154
+
155
+ ### Model Architecture and Objective
156
+
157
+ [More Information Needed]
158
+
159
+ ### Compute Infrastructure
160
+
161
+ [More Information Needed]
162
+
163
+ #### Hardware
164
+
165
+ [More Information Needed]
166
+
167
+ #### Software
168
+
169
+ [More Information Needed]
170
+
171
+ ## Citation [optional]
172
+
173
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
+
175
+ **BibTeX:**
176
+
177
+ [More Information Needed]
178
+
179
+ **APA:**
180
+
181
+ [More Information Needed]
182
+
183
+ ## Glossary [optional]
184
+
185
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
+
187
+ [More Information Needed]
188
+
189
+ ## More Information [optional]
190
+
191
+ [More Information Needed]
192
+
193
+ ## Model Card Authors [optional]
194
+
195
+ [More Information Needed]
196
+
197
+ ## Model Card Contact
198
+
199
+ [More Information Needed]
config.json CHANGED
@@ -3,741 +3,192 @@
3
  "CTCTransformerModel"
4
  ],
5
  "blank_id": 0,
6
- "hidden_size": 384,
 
 
 
 
 
7
  "id2label": {
8
- "0": "<pad>",
9
- "1": "|",
10
- "2": "AA0",
11
- "3": "AA1",
12
- "4": "AA2",
13
- "5": "AE0",
14
- "6": "AE1",
15
- "7": "AE2",
16
- "8": "AH0",
17
- "9": "AH1",
18
- "10": "AH2",
19
- "11": "AO0",
20
- "12": "AO1",
21
- "13": "AO2",
22
- "14": "AW0",
23
- "15": "AW1",
24
- "16": "AW2",
25
- "17": "AY0",
26
- "18": "AY1",
27
- "19": "AY2",
28
- "20": "B",
29
- "21": "CH",
30
- "22": "D",
31
- "23": "DH",
32
- "24": "EH0",
33
- "25": "EH1",
34
- "26": "EH2",
35
- "27": "ER0",
36
- "28": "ER1",
37
- "29": "ER2",
38
- "30": "EY0",
39
- "31": "EY1",
40
- "32": "EY2",
41
- "33": "F",
42
- "34": "G",
43
- "35": "HH",
44
- "36": "IH0",
45
- "37": "IH1",
46
- "38": "IH2",
47
- "39": "IY0",
48
- "40": "IY1",
49
- "41": "IY2",
50
- "42": "JH",
51
- "43": "K",
52
- "44": "L",
53
- "45": "M",
54
- "46": "N",
55
- "47": "NG",
56
- "48": "Ng",
57
- "49": "OW0",
58
- "50": "OW1",
59
- "51": "OW2",
60
- "52": "OY0",
61
- "53": "OY1",
62
- "54": "OY2",
63
- "55": "P",
64
- "56": "R",
65
- "57": "S",
66
- "58": "SH",
67
- "59": "T",
68
- "60": "TH",
69
- "61": "UH0",
70
- "62": "UH1",
71
- "63": "UH2",
72
- "64": "UW0",
73
- "65": "UW1",
74
- "66": "UW2",
75
- "67": "V",
76
- "68": "W",
77
- "69": "Y",
78
- "70": "Z",
79
- "71": "ZH",
80
- "72": "aa",
81
- "73": "aa_1",
82
- "74": "aa_2",
83
- "75": "aa_3",
84
- "76": "aa_4",
85
- "77": "aa_5",
86
- "78": "aa_6",
87
- "79": "aai",
88
- "80": "aai_1",
89
- "81": "aai_2",
90
- "82": "aai_3",
91
- "83": "aai_4",
92
- "84": "aai_5",
93
- "85": "aai_6",
94
- "86": "aak",
95
- "87": "aak_1",
96
- "88": "aak_2",
97
- "89": "aak_3",
98
- "90": "aak_6",
99
- "91": "aam",
100
- "92": "aam_1",
101
- "93": "aam_2",
102
- "94": "aam_3",
103
- "95": "aam_4",
104
- "96": "aam_5",
105
- "97": "aam_6",
106
- "98": "aan_1",
107
- "99": "aan_2",
108
- "100": "aan_3",
109
- "101": "aan_4",
110
- "102": "aan_5",
111
- "103": "aan_6",
112
- "104": "aang_1",
113
- "105": "aang_2",
114
- "106": "aang_3",
115
- "107": "aang_4",
116
- "108": "aang_5",
117
- "109": "aang_6",
118
- "110": "aap",
119
- "111": "aap_2",
120
- "112": "aap_3",
121
- "113": "aap_6",
122
- "114": "aat",
123
- "115": "aat_1",
124
- "116": "aat_2",
125
- "117": "aat_3",
126
- "118": "aat_6",
127
- "119": "aau",
128
- "120": "aau_1",
129
- "121": "aau_2",
130
- "122": "aau_3",
131
- "123": "aau_4",
132
- "124": "aau_5",
133
- "125": "aau_6",
134
- "126": "ai",
135
- "127": "ai_1",
136
- "128": "ai_2",
137
- "129": "ai_3",
138
- "130": "ai_4",
139
- "131": "ai_5",
140
- "132": "ai_6",
141
- "133": "ak",
142
- "134": "ak_1",
143
- "135": "ak_6",
144
- "136": "am",
145
- "137": "am_1",
146
- "138": "am_2",
147
- "139": "am_3",
148
- "140": "am_4",
149
- "141": "am_6",
150
- "142": "an_1",
151
- "143": "an_2",
152
- "144": "an_3",
153
- "145": "an_4",
154
- "146": "an_5",
155
- "147": "an_6",
156
- "148": "ang_1",
157
- "149": "ang_2",
158
- "150": "ang_3",
159
- "151": "ang_4",
160
- "152": "ang_6",
161
- "153": "ap_1",
162
- "154": "ap_2",
163
- "155": "ap_6",
164
- "156": "at_1",
165
- "157": "at_2",
166
- "158": "at_4",
167
- "159": "at_6",
168
- "160": "au",
169
- "161": "au_1",
170
- "162": "au_2",
171
- "163": "au_3",
172
- "164": "au_4",
173
- "165": "au_5",
174
- "166": "au_6",
175
- "167": "b",
176
- "168": "c",
177
- "169": "d",
178
- "170": "e_1",
179
- "171": "e_2",
180
- "172": "e_3",
181
- "173": "e_4",
182
- "174": "e_5",
183
- "175": "e_6",
184
- "176": "ei_1",
185
- "177": "ei_2",
186
- "178": "ei_3",
187
- "179": "ei_4",
188
- "180": "ei_5",
189
- "181": "ei_6",
190
- "182": "ek_1",
191
- "183": "ek_3",
192
- "184": "ek_6",
193
- "185": "eng_1",
194
- "186": "eng_2",
195
- "187": "eng_3",
196
- "188": "eng_4",
197
- "189": "eng_5",
198
- "190": "eng_6",
199
- "191": "eoi_1",
200
- "192": "eoi_2",
201
- "193": "eoi_3",
202
- "194": "eoi_4",
203
- "195": "eoi_5",
204
- "196": "eoi_6",
205
- "197": "eon_1",
206
- "198": "eon_2",
207
- "199": "eon_3",
208
- "200": "eon_4",
209
- "201": "eon_5",
210
- "202": "eon_6",
211
- "203": "eot_1",
212
- "204": "eot_2",
213
- "205": "eot_6",
214
- "206": "eu_6",
215
- "207": "f",
216
- "208": "g",
217
- "209": "gw",
218
- "210": "h",
219
- "211": "i_1",
220
- "212": "i_2",
221
- "213": "i_3",
222
- "214": "i_4",
223
- "215": "i_5",
224
- "216": "i_6",
225
- "217": "ik_1",
226
- "218": "ik_4",
227
- "219": "ik_6",
228
- "220": "im_1",
229
- "221": "im_2",
230
- "222": "im_3",
231
- "223": "im_4",
232
- "224": "im_5",
233
- "225": "im_6",
234
- "226": "in_1",
235
- "227": "in_2",
236
- "228": "in_3",
237
- "229": "in_4",
238
- "230": "in_5",
239
- "231": "in_6",
240
- "232": "ing_1",
241
- "233": "ing_2",
242
- "234": "ing_3",
243
- "235": "ing_4",
244
- "236": "ing_5",
245
- "237": "ing_6",
246
- "238": "ip_2",
247
- "239": "ip_3",
248
- "240": "ip_6",
249
- "241": "it_1",
250
- "242": "it_3",
251
- "243": "it_6",
252
- "244": "iu_1",
253
- "245": "iu_2",
254
- "246": "iu_3",
255
- "247": "iu_4",
256
- "248": "iu_5",
257
- "249": "iu_6",
258
- "250": "j",
259
- "251": "k",
260
- "252": "kw",
261
- "253": "l",
262
- "254": "m",
263
- "255": "n",
264
- "256": "ng",
265
- "257": "o",
266
- "258": "o_1",
267
- "259": "o_2",
268
- "260": "o_3",
269
- "261": "o_4",
270
- "262": "o_5",
271
- "263": "o_6",
272
- "264": "oe_1",
273
- "265": "oe_2",
274
- "266": "oe_3",
275
- "267": "oek_2",
276
- "268": "oek_3",
277
- "269": "oek_4",
278
- "270": "oek_6",
279
- "271": "oeng_1",
280
- "272": "oeng_2",
281
- "273": "oeng_3",
282
- "274": "oeng_4",
283
- "275": "oeng_5",
284
- "276": "oeng_6",
285
- "277": "oi",
286
- "278": "oi_1",
287
- "279": "oi_2",
288
- "280": "oi_3",
289
- "281": "oi_4",
290
- "282": "oi_5",
291
- "283": "oi_6",
292
- "284": "ok",
293
- "285": "ok_1",
294
- "286": "ok_2",
295
- "287": "ok_3",
296
- "288": "ok_6",
297
- "289": "on",
298
- "290": "on_1",
299
- "291": "on_2",
300
- "292": "on_3",
301
- "293": "on_4",
302
- "294": "on_6",
303
- "295": "ong_1",
304
- "296": "ong_2",
305
- "297": "ong_3",
306
- "298": "ong_4",
307
- "299": "ong_5",
308
- "300": "ong_6",
309
- "301": "ot_3",
310
- "302": "ou",
311
- "303": "ou_1",
312
- "304": "ou_2",
313
- "305": "ou_3",
314
- "306": "ou_4",
315
- "307": "ou_5",
316
- "308": "ou_6",
317
- "309": "p",
318
- "310": "s",
319
- "311": "t",
320
- "312": "u_1",
321
- "313": "u_2",
322
- "314": "u_3",
323
- "315": "u_4",
324
- "316": "u_5",
325
- "317": "u_6",
326
- "318": "ui_1",
327
- "319": "ui_2",
328
- "320": "ui_3",
329
- "321": "ui_4",
330
- "322": "ui_5",
331
- "323": "ui_6",
332
- "324": "uk",
333
- "325": "uk_1",
334
- "326": "uk_2",
335
- "327": "uk_6",
336
- "328": "un_1",
337
- "329": "un_2",
338
- "330": "un_3",
339
- "331": "un_4",
340
- "332": "un_5",
341
- "333": "un_6",
342
- "334": "ung",
343
- "335": "ung_1",
344
- "336": "ung_2",
345
- "337": "ung_3",
346
- "338": "ung_4",
347
- "339": "ung_5",
348
- "340": "ung_6",
349
- "341": "ut_1",
350
- "342": "ut_3",
351
- "343": "ut_6",
352
- "344": "w",
353
- "345": "yu_1",
354
- "346": "yu_2",
355
- "347": "yu_3",
356
- "348": "yu_4",
357
- "349": "yu_5",
358
- "350": "yu_6",
359
- "351": "yun_1",
360
- "352": "yun_2",
361
- "353": "yun_3",
362
- "354": "yun_4",
363
- "355": "yun_5",
364
- "356": "yun_6",
365
- "357": "yut_1",
366
- "358": "yut_2",
367
- "359": "yut_3",
368
- "360": "yut_4",
369
- "361": "yut_6",
370
- "362": "z"
371
  },
372
- "intermediate_size": 1024,
373
  "label2id": {
374
- "<pad>": 0,
375
- "AA0": 2,
376
- "AA1": 3,
377
- "AA2": 4,
378
- "AE0": 5,
379
- "AE1": 6,
380
- "AE2": 7,
381
- "AH0": 8,
382
- "AH1": 9,
383
- "AH2": 10,
384
- "AO0": 11,
385
- "AO1": 12,
386
- "AO2": 13,
387
- "AW0": 14,
388
- "AW1": 15,
389
- "AW2": 16,
390
- "AY0": 17,
391
- "AY1": 18,
392
- "AY2": 19,
393
- "B": 20,
394
- "CH": 21,
395
- "D": 22,
396
- "DH": 23,
397
- "EH0": 24,
398
- "EH1": 25,
399
- "EH2": 26,
400
- "ER0": 27,
401
- "ER1": 28,
402
- "ER2": 29,
403
- "EY0": 30,
404
- "EY1": 31,
405
- "EY2": 32,
406
- "F": 33,
407
- "G": 34,
408
- "HH": 35,
409
- "IH0": 36,
410
- "IH1": 37,
411
- "IH2": 38,
412
- "IY0": 39,
413
- "IY1": 40,
414
- "IY2": 41,
415
- "JH": 42,
416
- "K": 43,
417
- "L": 44,
418
- "M": 45,
419
- "N": 46,
420
- "NG": 47,
421
- "Ng": 48,
422
- "OW0": 49,
423
- "OW1": 50,
424
- "OW2": 51,
425
- "OY0": 52,
426
- "OY1": 53,
427
- "OY2": 54,
428
- "P": 55,
429
- "R": 56,
430
- "S": 57,
431
- "SH": 58,
432
- "T": 59,
433
- "TH": 60,
434
- "UH0": 61,
435
- "UH1": 62,
436
- "UH2": 63,
437
- "UW0": 64,
438
- "UW1": 65,
439
- "UW2": 66,
440
- "V": 67,
441
- "W": 68,
442
- "Y": 69,
443
- "Z": 70,
444
- "ZH": 71,
445
- "aa": 72,
446
- "aa_1": 73,
447
- "aa_2": 74,
448
- "aa_3": 75,
449
- "aa_4": 76,
450
- "aa_5": 77,
451
- "aa_6": 78,
452
- "aai": 79,
453
- "aai_1": 80,
454
- "aai_2": 81,
455
- "aai_3": 82,
456
- "aai_4": 83,
457
- "aai_5": 84,
458
- "aai_6": 85,
459
- "aak": 86,
460
- "aak_1": 87,
461
- "aak_2": 88,
462
- "aak_3": 89,
463
- "aak_6": 90,
464
- "aam": 91,
465
- "aam_1": 92,
466
- "aam_2": 93,
467
- "aam_3": 94,
468
- "aam_4": 95,
469
- "aam_5": 96,
470
- "aam_6": 97,
471
- "aan_1": 98,
472
- "aan_2": 99,
473
- "aan_3": 100,
474
- "aan_4": 101,
475
- "aan_5": 102,
476
- "aan_6": 103,
477
- "aang_1": 104,
478
- "aang_2": 105,
479
- "aang_3": 106,
480
- "aang_4": 107,
481
- "aang_5": 108,
482
- "aang_6": 109,
483
- "aap": 110,
484
- "aap_2": 111,
485
- "aap_3": 112,
486
- "aap_6": 113,
487
- "aat": 114,
488
- "aat_1": 115,
489
- "aat_2": 116,
490
- "aat_3": 117,
491
- "aat_6": 118,
492
- "aau": 119,
493
- "aau_1": 120,
494
- "aau_2": 121,
495
- "aau_3": 122,
496
- "aau_4": 123,
497
- "aau_5": 124,
498
- "aau_6": 125,
499
- "ai": 126,
500
- "ai_1": 127,
501
- "ai_2": 128,
502
- "ai_3": 129,
503
- "ai_4": 130,
504
- "ai_5": 131,
505
- "ai_6": 132,
506
- "ak": 133,
507
- "ak_1": 134,
508
- "ak_6": 135,
509
- "am": 136,
510
- "am_1": 137,
511
- "am_2": 138,
512
- "am_3": 139,
513
- "am_4": 140,
514
- "am_6": 141,
515
- "an_1": 142,
516
- "an_2": 143,
517
- "an_3": 144,
518
- "an_4": 145,
519
- "an_5": 146,
520
- "an_6": 147,
521
- "ang_1": 148,
522
- "ang_2": 149,
523
- "ang_3": 150,
524
- "ang_4": 151,
525
- "ang_6": 152,
526
- "ap_1": 153,
527
- "ap_2": 154,
528
- "ap_6": 155,
529
- "at_1": 156,
530
- "at_2": 157,
531
- "at_4": 158,
532
- "at_6": 159,
533
- "au": 160,
534
- "au_1": 161,
535
- "au_2": 162,
536
- "au_3": 163,
537
- "au_4": 164,
538
- "au_5": 165,
539
- "au_6": 166,
540
- "b": 167,
541
- "c": 168,
542
- "d": 169,
543
- "e_1": 170,
544
- "e_2": 171,
545
- "e_3": 172,
546
- "e_4": 173,
547
- "e_5": 174,
548
- "e_6": 175,
549
- "ei_1": 176,
550
- "ei_2": 177,
551
- "ei_3": 178,
552
- "ei_4": 179,
553
- "ei_5": 180,
554
- "ei_6": 181,
555
- "ek_1": 182,
556
- "ek_3": 183,
557
- "ek_6": 184,
558
- "eng_1": 185,
559
- "eng_2": 186,
560
- "eng_3": 187,
561
- "eng_4": 188,
562
- "eng_5": 189,
563
- "eng_6": 190,
564
- "eoi_1": 191,
565
- "eoi_2": 192,
566
- "eoi_3": 193,
567
- "eoi_4": 194,
568
- "eoi_5": 195,
569
- "eoi_6": 196,
570
- "eon_1": 197,
571
- "eon_2": 198,
572
- "eon_3": 199,
573
- "eon_4": 200,
574
- "eon_5": 201,
575
- "eon_6": 202,
576
- "eot_1": 203,
577
- "eot_2": 204,
578
- "eot_6": 205,
579
- "eu_6": 206,
580
- "f": 207,
581
- "g": 208,
582
- "gw": 209,
583
- "h": 210,
584
- "i_1": 211,
585
- "i_2": 212,
586
- "i_3": 213,
587
- "i_4": 214,
588
- "i_5": 215,
589
- "i_6": 216,
590
- "ik_1": 217,
591
- "ik_4": 218,
592
- "ik_6": 219,
593
- "im_1": 220,
594
- "im_2": 221,
595
- "im_3": 222,
596
- "im_4": 223,
597
- "im_5": 224,
598
- "im_6": 225,
599
- "in_1": 226,
600
- "in_2": 227,
601
- "in_3": 228,
602
- "in_4": 229,
603
- "in_5": 230,
604
- "in_6": 231,
605
- "ing_1": 232,
606
- "ing_2": 233,
607
- "ing_3": 234,
608
- "ing_4": 235,
609
- "ing_5": 236,
610
- "ing_6": 237,
611
- "ip_2": 238,
612
- "ip_3": 239,
613
- "ip_6": 240,
614
- "it_1": 241,
615
- "it_3": 242,
616
- "it_6": 243,
617
- "iu_1": 244,
618
- "iu_2": 245,
619
- "iu_3": 246,
620
- "iu_4": 247,
621
- "iu_5": 248,
622
- "iu_6": 249,
623
- "j": 250,
624
- "k": 251,
625
- "kw": 252,
626
- "l": 253,
627
- "m": 254,
628
- "n": 255,
629
- "ng": 256,
630
- "o": 257,
631
- "o_1": 258,
632
- "o_2": 259,
633
- "o_3": 260,
634
- "o_4": 261,
635
- "o_5": 262,
636
- "o_6": 263,
637
- "oe_1": 264,
638
- "oe_2": 265,
639
- "oe_3": 266,
640
- "oek_2": 267,
641
- "oek_3": 268,
642
- "oek_4": 269,
643
- "oek_6": 270,
644
- "oeng_1": 271,
645
- "oeng_2": 272,
646
- "oeng_3": 273,
647
- "oeng_4": 274,
648
- "oeng_5": 275,
649
- "oeng_6": 276,
650
- "oi": 277,
651
- "oi_1": 278,
652
- "oi_2": 279,
653
- "oi_3": 280,
654
- "oi_4": 281,
655
- "oi_5": 282,
656
- "oi_6": 283,
657
- "ok": 284,
658
- "ok_1": 285,
659
- "ok_2": 286,
660
- "ok_3": 287,
661
- "ok_6": 288,
662
- "on": 289,
663
- "on_1": 290,
664
- "on_2": 291,
665
- "on_3": 292,
666
- "on_4": 293,
667
- "on_6": 294,
668
- "ong_1": 295,
669
- "ong_2": 296,
670
- "ong_3": 297,
671
- "ong_4": 298,
672
- "ong_5": 299,
673
- "ong_6": 300,
674
- "ot_3": 301,
675
- "ou": 302,
676
- "ou_1": 303,
677
- "ou_2": 304,
678
- "ou_3": 305,
679
- "ou_4": 306,
680
- "ou_5": 307,
681
- "ou_6": 308,
682
- "p": 309,
683
- "s": 310,
684
- "t": 311,
685
- "u_1": 312,
686
- "u_2": 313,
687
- "u_3": 314,
688
- "u_4": 315,
689
- "u_5": 316,
690
- "u_6": 317,
691
- "ui_1": 318,
692
- "ui_2": 319,
693
- "ui_3": 320,
694
- "ui_4": 321,
695
- "ui_5": 322,
696
- "ui_6": 323,
697
- "uk": 324,
698
- "uk_1": 325,
699
- "uk_2": 326,
700
- "uk_6": 327,
701
- "un_1": 328,
702
- "un_2": 329,
703
- "un_3": 330,
704
- "un_4": 331,
705
- "un_5": 332,
706
- "un_6": 333,
707
- "ung": 334,
708
- "ung_1": 335,
709
- "ung_2": 336,
710
- "ung_3": 337,
711
- "ung_4": 338,
712
- "ung_5": 339,
713
- "ung_6": 340,
714
- "ut_1": 341,
715
- "ut_3": 342,
716
- "ut_6": 343,
717
- "w": 344,
718
- "yu_1": 345,
719
- "yu_2": 346,
720
- "yu_3": 347,
721
- "yu_4": 348,
722
- "yu_5": 349,
723
- "yu_6": 350,
724
- "yun_1": 351,
725
- "yun_2": 352,
726
- "yun_3": 353,
727
- "yun_4": 354,
728
- "yun_5": 355,
729
- "yun_6": 356,
730
- "yut_1": 357,
731
- "yut_2": 358,
732
- "yut_3": 359,
733
- "yut_4": 360,
734
- "yut_6": 361,
735
- "z": 362,
736
- "|": 1
737
  },
738
  "max_position_embeddings": 1024,
739
- "num_attention_heads": 6,
740
- "num_hidden_layers": 6,
 
 
741
  "torch_dtype": "float32",
742
  "transformers_version": "4.50.3",
743
  "vocab_size": 6561
 
3
  "CTCTransformerModel"
4
  ],
5
  "blank_id": 0,
6
+ "bos_token_id": 1,
7
+ "ctc_loss_reduction": "mean",
8
+ "ctc_zero_infinity": true,
9
+ "dropout": 0.1,
10
+ "eos_token_id": 2,
11
+ "hidden_size": 768,
12
  "id2label": {
13
+ "0": "[PAD]",
14
+ "1": "[UNK]",
15
+ "2": "|",
16
+ "3": "a1",
17
+ "4": "a2",
18
+ "5": "a3",
19
+ "6": "a4",
20
+ "7": "a5",
21
+ "8": "a6",
22
+ "9": "aa1",
23
+ "10": "aa2",
24
+ "11": "aa3",
25
+ "12": "aa4",
26
+ "13": "aa5",
27
+ "14": "aa6",
28
+ "15": "b",
29
+ "16": "c",
30
+ "17": "d",
31
+ "18": "e1",
32
+ "19": "e2",
33
+ "20": "e3",
34
+ "21": "e4",
35
+ "22": "e5",
36
+ "23": "e6",
37
+ "24": "eo1",
38
+ "25": "eo2",
39
+ "26": "eo3",
40
+ "27": "eo4",
41
+ "28": "eo5",
42
+ "29": "eo6",
43
+ "30": "f",
44
+ "31": "g",
45
+ "32": "gw",
46
+ "33": "h",
47
+ "34": "i",
48
+ "35": "i1",
49
+ "36": "i2",
50
+ "37": "i3",
51
+ "38": "i4",
52
+ "39": "i5",
53
+ "40": "i6",
54
+ "41": "j",
55
+ "42": "k",
56
+ "43": "kw",
57
+ "44": "l",
58
+ "45": "m",
59
+ "46": "m4",
60
+ "47": "m6",
61
+ "48": "n",
62
+ "49": "ng",
63
+ "50": "ng4",
64
+ "51": "ng5",
65
+ "52": "ng6",
66
+ "53": "o1",
67
+ "54": "o2",
68
+ "55": "o3",
69
+ "56": "o4",
70
+ "57": "o5",
71
+ "58": "o6",
72
+ "59": "oe1",
73
+ "60": "oe2",
74
+ "61": "oe3",
75
+ "62": "oe4",
76
+ "63": "oe5",
77
+ "64": "oe6",
78
+ "65": "p",
79
+ "66": "s",
80
+ "67": "t",
81
+ "68": "u",
82
+ "69": "u1",
83
+ "70": "u2",
84
+ "71": "u3",
85
+ "72": "u4",
86
+ "73": "u5",
87
+ "74": "u6",
88
+ "75": "w",
89
+ "76": "yu1",
90
+ "77": "yu2",
91
+ "78": "yu3",
92
+ "79": "yu4",
93
+ "80": "yu5",
94
+ "81": "yu6",
95
+ "82": "z",
96
+ "83": "<s>",
97
+ "84": "</s>"
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
98
  },
99
+ "intermediate_size": 3072,
100
  "label2id": {
101
+ "</s>": 84,
102
+ "<s>": 83,
103
+ "[PAD]": 0,
104
+ "[UNK]": 1,
105
+ "a1": 3,
106
+ "a2": 4,
107
+ "a3": 5,
108
+ "a4": 6,
109
+ "a5": 7,
110
+ "a6": 8,
111
+ "aa1": 9,
112
+ "aa2": 10,
113
+ "aa3": 11,
114
+ "aa4": 12,
115
+ "aa5": 13,
116
+ "aa6": 14,
117
+ "b": 15,
118
+ "c": 16,
119
+ "d": 17,
120
+ "e1": 18,
121
+ "e2": 19,
122
+ "e3": 20,
123
+ "e4": 21,
124
+ "e5": 22,
125
+ "e6": 23,
126
+ "eo1": 24,
127
+ "eo2": 25,
128
+ "eo3": 26,
129
+ "eo4": 27,
130
+ "eo5": 28,
131
+ "eo6": 29,
132
+ "f": 30,
133
+ "g": 31,
134
+ "gw": 32,
135
+ "h": 33,
136
+ "i": 34,
137
+ "i1": 35,
138
+ "i2": 36,
139
+ "i3": 37,
140
+ "i4": 38,
141
+ "i5": 39,
142
+ "i6": 40,
143
+ "j": 41,
144
+ "k": 42,
145
+ "kw": 43,
146
+ "l": 44,
147
+ "m": 45,
148
+ "m4": 46,
149
+ "m6": 47,
150
+ "n": 48,
151
+ "ng": 49,
152
+ "ng4": 50,
153
+ "ng5": 51,
154
+ "ng6": 52,
155
+ "o1": 53,
156
+ "o2": 54,
157
+ "o3": 55,
158
+ "o4": 56,
159
+ "o5": 57,
160
+ "o6": 58,
161
+ "oe1": 59,
162
+ "oe2": 60,
163
+ "oe3": 61,
164
+ "oe4": 62,
165
+ "oe5": 63,
166
+ "oe6": 64,
167
+ "p": 65,
168
+ "s": 66,
169
+ "t": 67,
170
+ "u": 68,
171
+ "u1": 69,
172
+ "u2": 70,
173
+ "u3": 71,
174
+ "u4": 72,
175
+ "u5": 73,
176
+ "u6": 74,
177
+ "w": 75,
178
+ "yu1": 76,
179
+ "yu2": 77,
180
+ "yu3": 78,
181
+ "yu4": 79,
182
+ "yu5": 80,
183
+ "yu6": 81,
184
+ "z": 82,
185
+ "|": 2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
186
  },
187
  "max_position_embeddings": 1024,
188
+ "num_attention_heads": 12,
189
+ "num_hidden_layers": 12,
190
+ "pad_token_id": 0,
191
+ "src_pad_token_id": 6561,
192
  "torch_dtype": "float32",
193
  "transformers_version": "4.50.3",
194
  "vocab_size": 6561
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:47947574cf1d0161d5e51a17465e3c5d6500449f7c2d421d04b8f3c2d241a750
3
- size 43783580
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:17a876fa353c0598b740ef001d4aaec4f7cd9b93e5cff72f6f193d098f771633
3
+ size 360659380