mmcauliffe commited on
Commit
d379d5e
·
verified ·
1 Parent(s): d7a9988

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,11 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ acoustic/final.alimdl filter=lfs diff=lfs merge=lfs -text
37
+ acoustic/final.mdl filter=lfs diff=lfs merge=lfs -text
38
+ acoustic/phone_lm.fst filter=lfs diff=lfs merge=lfs -text
39
+ acoustic/tree filter=lfs diff=lfs merge=lfs -text
40
+ g2p/english_india_mfa/model.fst filter=lfs diff=lfs merge=lfs -text
41
+ g2p/english_nigeria_mfa/model.fst filter=lfs diff=lfs merge=lfs -text
42
+ g2p/english_uk_mfa/model.fst filter=lfs diff=lfs merge=lfs -text
43
+ g2p/english_us_mfa/model.fst filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,395 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More_considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution 4.0 International Public License
58
+
59
+ By exercising the Licensed Rights (defined below), You accept and agree
60
+ to be bound by the terms and conditions of this Creative Commons
61
+ Attribution 4.0 International Public License ("Public License"). To the
62
+ extent this Public License may be interpreted as a contract, You are
63
+ granted the Licensed Rights in consideration of Your acceptance of
64
+ these terms and conditions, and the Licensor grants You such rights in
65
+ consideration of benefits the Licensor receives from making the
66
+ Licensed Material available under these terms and conditions.
67
+
68
+
69
+ Section 1 -- Definitions.
70
+
71
+ a. Adapted Material means material subject to Copyright and Similar
72
+ Rights that is derived from or based upon the Licensed Material
73
+ and in which the Licensed Material is translated, altered,
74
+ arranged, transformed, or otherwise modified in a manner requiring
75
+ permission under the Copyright and Similar Rights held by the
76
+ Licensor. For purposes of this Public License, where the Licensed
77
+ Material is a musical work, performance, or sound recording,
78
+ Adapted Material is always produced where the Licensed Material is
79
+ synched in timed relation with a moving image.
80
+
81
+ b. Adapter's License means the license You apply to Your Copyright
82
+ and Similar Rights in Your contributions to Adapted Material in
83
+ accordance with the terms and conditions of this Public License.
84
+
85
+ c. Copyright and Similar Rights means copyright and/or similar rights
86
+ closely related to copyright including, without limitation,
87
+ performance, broadcast, sound recording, and Sui Generis Database
88
+ Rights, without regard to how the rights are labeled or
89
+ categorized. For purposes of this Public License, the rights
90
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
91
+ Rights.
92
+
93
+ d. Effective Technological Measures means those measures that, in the
94
+ absence of proper authority, may not be circumvented under laws
95
+ fulfilling obligations under Article 11 of the WIPO Copyright
96
+ Treaty adopted on December 20, 1996, and/or similar international
97
+ agreements.
98
+
99
+ e. Exceptions and Limitations means fair use, fair dealing, and/or
100
+ any other exception or limitation to Copyright and Similar Rights
101
+ that applies to Your use of the Licensed Material.
102
+
103
+ f. Licensed Material means the artistic or literary work, database,
104
+ or other material to which the Licensor applied this Public
105
+ License.
106
+
107
+ g. Licensed Rights means the rights granted to You subject to the
108
+ terms and conditions of this Public License, which are limited to
109
+ all Copyright and Similar Rights that apply to Your use of the
110
+ Licensed Material and that the Licensor has authority to license.
111
+
112
+ h. Licensor means the individual(s) or entity(ies) granting rights
113
+ under this Public License.
114
+
115
+ i. Share means to provide material to the public by any means or
116
+ process that requires permission under the Licensed Rights, such
117
+ as reproduction, public display, public performance, distribution,
118
+ dissemination, communication, or importation, and to make material
119
+ available to the public including in ways that members of the
120
+ public may access the material from a place and at a time
121
+ individually chosen by them.
122
+
123
+ j. Sui Generis Database Rights means rights other than copyright
124
+ resulting from Directive 96/9/EC of the European Parliament and of
125
+ the Council of 11 March 1996 on the legal protection of databases,
126
+ as amended and/or succeeded, as well as other essentially
127
+ equivalent rights anywhere in the world.
128
+
129
+ k. You means the individual or entity exercising the Licensed Rights
130
+ under this Public License. Your has a corresponding meaning.
131
+
132
+
133
+ Section 2 -- Scope.
134
+
135
+ a. License grant.
136
+
137
+ 1. Subject to the terms and conditions of this Public License,
138
+ the Licensor hereby grants You a worldwide, royalty-free,
139
+ non-sublicensable, non-exclusive, irrevocable license to
140
+ exercise the Licensed Rights in the Licensed Material to:
141
+
142
+ a. reproduce and Share the Licensed Material, in whole or
143
+ in part; and
144
+
145
+ b. produce, reproduce, and Share Adapted Material.
146
+
147
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
148
+ Exceptions and Limitations apply to Your use, this Public
149
+ License does not apply, and You do not need to comply with
150
+ its terms and conditions.
151
+
152
+ 3. Term. The term of this Public License is specified in Section
153
+ 6(a).
154
+
155
+ 4. Media and formats; technical modifications allowed. The
156
+ Licensor authorizes You to exercise the Licensed Rights in
157
+ all media and formats whether now known or hereafter created,
158
+ and to make technical modifications necessary to do so. The
159
+ Licensor waives and/or agrees not to assert any right or
160
+ authority to forbid You from making technical modifications
161
+ necessary to exercise the Licensed Rights, including
162
+ technical modifications necessary to circumvent Effective
163
+ Technological Measures. For purposes of this Public License,
164
+ simply making modifications authorized by this Section 2(a)
165
+ (4) never produces Adapted Material.
166
+
167
+ 5. Downstream recipients.
168
+
169
+ a. Offer from the Licensor -- Licensed Material. Every
170
+ recipient of the Licensed Material automatically
171
+ receives an offer from the Licensor to exercise the
172
+ Licensed Rights under the terms and conditions of this
173
+ Public License.
174
+
175
+ b. No downstream restrictions. You may not offer or impose
176
+ any additional or different terms or conditions on, or
177
+ apply any Effective Technological Measures to, the
178
+ Licensed Material if doing so restricts exercise of the
179
+ Licensed Rights by any recipient of the Licensed
180
+ Material.
181
+
182
+ 6. No endorsement. Nothing in this Public License constitutes or
183
+ may be construed as permission to assert or imply that You
184
+ are, or that Your use of the Licensed Material is, connected
185
+ with, or sponsored, endorsed, or granted official status by,
186
+ the Licensor or others designated to receive attribution as
187
+ provided in Section 3(a)(1)(A)(i).
188
+
189
+ b. Other rights.
190
+
191
+ 1. Moral rights, such as the right of integrity, are not
192
+ licensed under this Public License, nor are publicity,
193
+ privacy, and/or other similar personality rights; however, to
194
+ the extent possible, the Licensor waives and/or agrees not to
195
+ assert any such rights held by the Licensor to the limited
196
+ extent necessary to allow You to exercise the Licensed
197
+ Rights, but not otherwise.
198
+
199
+ 2. Patent and trademark rights are not licensed under this
200
+ Public License.
201
+
202
+ 3. To the extent possible, the Licensor waives any right to
203
+ collect royalties from You for the exercise of the Licensed
204
+ Rights, whether directly or through a collecting society
205
+ under any voluntary or waivable statutory or compulsory
206
+ licensing scheme. In all other cases the Licensor expressly
207
+ reserves any right to collect such royalties.
208
+
209
+
210
+ Section 3 -- License Conditions.
211
+
212
+ Your exercise of the Licensed Rights is expressly made subject to the
213
+ following conditions.
214
+
215
+ a. Attribution.
216
+
217
+ 1. If You Share the Licensed Material (including in modified
218
+ form), You must:
219
+
220
+ a. retain the following if it is supplied by the Licensor
221
+ with the Licensed Material:
222
+
223
+ i. identification of the creator(s) of the Licensed
224
+ Material and any others designated to receive
225
+ attribution, in any reasonable manner requested by
226
+ the Licensor (including by pseudonym if
227
+ designated);
228
+
229
+ ii. a copyright notice;
230
+
231
+ iii. a notice that refers to this Public License;
232
+
233
+ iv. a notice that refers to the disclaimer of
234
+ warranties;
235
+
236
+ v. a URI or hyperlink to the Licensed Material to the
237
+ extent reasonably practicable;
238
+
239
+ b. indicate if You modified the Licensed Material and
240
+ retain an indication of any previous modifications; and
241
+
242
+ c. indicate the Licensed Material is licensed under this
243
+ Public License, and include the text of, or the URI or
244
+ hyperlink to, this Public License.
245
+
246
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
247
+ reasonable manner based on the medium, means, and context in
248
+ which You Share the Licensed Material. For example, it may be
249
+ reasonable to satisfy the conditions by providing a URI or
250
+ hyperlink to a resource that includes the required
251
+ information.
252
+
253
+ 3. If requested by the Licensor, You must remove any of the
254
+ information required by Section 3(a)(1)(A) to the extent
255
+ reasonably practicable.
256
+
257
+ 4. If You Share Adapted Material You produce, the Adapter's
258
+ License You apply must not prevent recipients of the Adapted
259
+ Material from complying with this Public License.
260
+
261
+
262
+ Section 4 -- Sui Generis Database Rights.
263
+
264
+ Where the Licensed Rights include Sui Generis Database Rights that
265
+ apply to Your use of the Licensed Material:
266
+
267
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
268
+ to extract, reuse, reproduce, and Share all or a substantial
269
+ portion of the contents of the database;
270
+
271
+ b. if You include all or a substantial portion of the database
272
+ contents in a database in which You have Sui Generis Database
273
+ Rights, then the database in which You have Sui Generis Database
274
+ Rights (but not its individual contents) is Adapted Material; and
275
+
276
+ c. You must comply with the conditions in Section 3(a) if You Share
277
+ all or a substantial portion of the contents of the database.
278
+
279
+ For the avoidance of doubt, this Section 4 supplements and does not
280
+ replace Your obligations under this Public License where the Licensed
281
+ Rights include other Copyright and Similar Rights.
282
+
283
+
284
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
285
+
286
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
287
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
288
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
289
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
290
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
291
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
292
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
293
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
294
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
295
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
296
+
297
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
298
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
299
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
300
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
301
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
302
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
303
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
304
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
305
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
306
+
307
+ c. The disclaimer of warranties and limitation of liability provided
308
+ above shall be interpreted in a manner that, to the extent
309
+ possible, most closely approximates an absolute disclaimer and
310
+ waiver of all liability.
311
+
312
+
313
+ Section 6 -- Term and Termination.
314
+
315
+ a. This Public License applies for the term of the Copyright and
316
+ Similar Rights licensed here. However, if You fail to comply with
317
+ this Public License, then Your rights under this Public License
318
+ terminate automatically.
319
+
320
+ b. Where Your right to use the Licensed Material has terminated under
321
+ Section 6(a), it reinstates:
322
+
323
+ 1. automatically as of the date the violation is cured, provided
324
+ it is cured within 30 days of Your discovery of the
325
+ violation; or
326
+
327
+ 2. upon express reinstatement by the Licensor.
328
+
329
+ For the avoidance of doubt, this Section 6(b) does not affect any
330
+ right the Licensor may have to seek remedies for Your violations
331
+ of this Public License.
332
+
333
+ c. For the avoidance of doubt, the Licensor may also offer the
334
+ Licensed Material under separate terms or conditions or stop
335
+ distributing the Licensed Material at any time; however, doing so
336
+ will not terminate this Public License.
337
+
338
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
339
+ License.
340
+
341
+
342
+ Section 7 -- Other Terms and Conditions.
343
+
344
+ a. The Licensor shall not be bound by any additional or different
345
+ terms or conditions communicated by You unless expressly agreed.
346
+
347
+ b. Any arrangements, understandings, or agreements regarding the
348
+ Licensed Material not stated herein are separate from and
349
+ independent of the terms and conditions of this Public License.
350
+
351
+
352
+ Section 8 -- Interpretation.
353
+
354
+ a. For the avoidance of doubt, this Public License does not, and
355
+ shall not be interpreted to, reduce, limit, restrict, or impose
356
+ conditions on any use of the Licensed Material that could lawfully
357
+ be made without permission under this Public License.
358
+
359
+ b. To the extent possible, if any provision of this Public License is
360
+ deemed unenforceable, it shall be automatically reformed to the
361
+ minimum extent necessary to make it enforceable. If the provision
362
+ cannot be reformed, it shall be severed from this Public License
363
+ without affecting the enforceability of the remaining terms and
364
+ conditions.
365
+
366
+ c. No term or condition of this Public License will be waived and no
367
+ failure to comply consented to unless expressly agreed to by the
368
+ Licensor.
369
+
370
+ d. Nothing in this Public License constitutes or may be interpreted
371
+ as a limitation upon, or waiver of, any privileges and immunities
372
+ that apply to the Licensor or You, including from the legal
373
+ processes of any jurisdiction or authority.
374
+
375
+
376
+ =======================================================================
377
+
378
+ Creative Commons is not a party to its public
379
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
380
+ its public licenses to material it publishes and in those instances
381
+ will be considered the “Licensor.” The text of the Creative Commons
382
+ public licenses is dedicated to the public domain under the CC0 Public
383
+ Domain Dedication. Except for the limited purpose of indicating that
384
+ material is shared under a Creative Commons public license or as
385
+ otherwise permitted by the Creative Commons policies published at
386
+ creativecommons.org/policies, Creative Commons does not authorize the
387
+ use of the trademark "Creative Commons" or any other trademark or logo
388
+ of Creative Commons without its prior written consent including,
389
+ without limitation, in connection with any unauthorized modifications
390
+ to any of its public licenses or any other arrangements,
391
+ understandings, or agreements concerning use of licensed material. For
392
+ the avoidance of doubt, this paragraph does not form part of the
393
+ public licenses.
394
+
395
+ Creative Commons may be contacted at creativecommons.org.
README.md ADDED
@@ -0,0 +1,150 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # English MFA model
2
+
3
+ ## Model details
4
+
5
+ - **Maintainer:** [Montreal Corpus Tools](https://huggingface.co/MontrealCorpusTools)
6
+ - **Language:** [English](https://en.wikipedia.org/wiki/English_language)
7
+ - **Dialect:** N/A
8
+ - **Phone set:** [MFA](https://mfa-models.readthedocs.io/en/refactor/mfa_phone_set.html#english)
9
+ - **Features:** `MFCC`
10
+ - **Architecture:** `gmm-hmm`
11
+ - **Model version:** `v3.1.0`
12
+ - **Trained date:** `2024-06-12`
13
+ - **Compatible MFA version:** `v3.1.0`
14
+ - **License:** [CC BY 4.0](https://huggingface.co/MontrealCorpusTools/english_mfa/LICENSE)
15
+ - **Citation:**
16
+
17
+ ```bibtex
18
+ @techreport{mfa_english_mfa_acoustic_2024,
19
+ author={McAuliffe, Michael and Sonderegger, Morgan},
20
+ title={English MFA acoustic model v3.1.0},
21
+ address={\url{https://huggingface.co/MontrealCorpusTools/english_mfa}},
22
+ year={2024},
23
+ month={Jun},
24
+ }
25
+ ```
26
+
27
+ - If you have comments or questions about this model, you can check [previous MFA model discussion posts](https://github.com/MontrealCorpusTools/mfa-models/discussions?discussions_q=English+MFA+acoustic+model+v3.1.0) or create [a new one](https://github.com/MontrealCorpusTools/mfa-models/discussions/new).
28
+
29
+ ## Installation
30
+
31
+ Install from the [MFA command line](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/models/index.html):
32
+
33
+ ```
34
+ mfa model download acoustic english_mfa
35
+ ```
36
+
37
+ Or download from [the release page](https://github.com/MontrealCorpusTools/mfa-models/releases/tag/acoustic-english_mfa-v3.1.0).
38
+
39
+ ## Intended use
40
+
41
+ This model is intended for forced alignment of [English](https://en.wikipedia.org/wiki/English_language) transcripts.
42
+
43
+ This model uses the [MFA](https://mfa-models.readthedocs.io/en/refactor/mfa_phone_set.html#english) phone set for English, and was trained with the pronunciation dictionaries above. Pronunciations can be added on top of the dictionary, as long as no additional phones are introduced.
44
+
45
+ ## Performance Factors
46
+
47
+ As forced alignment is a relatively well-constrained problem (given accurate transcripts), this model should be applicable to a range of recording conditions and speakers. However, please note that it was trained on read speech in low-noise environments, so as your data diverges from that, you may run into alignment issues or need to [increase the beam size of MFA](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/configuration/#configuring-specific-commands) or see other recommendations in the [troubleshooting section below](#troubleshooting-issues).
48
+
49
+ Please note as well that MFA does not use state-of-the-art ASR models for forced alignment. You may get better performance (especially on speech-to-text tasks) using other frameworks like [Coqui](https://coqui.ai/).
50
+
51
+ ## Metrics
52
+
53
+ Acoustic models are typically generated as one component of a larger ASR system where the metric is word error rate (WER). For forced alignment, there is typically not the same sort of gold standard measure for most languages.
54
+
55
+ As a rough approximation of the acoustic model quality, we evaluated it against the corpus it was trained on alongside a language model trained from the same data. Key caveat here is that this is not a typical WER measure on held out data, so it should not be taken as a hard measure of how well an acoustic model will generalize to your data, but rather is more of a sanity check that the training data quality was sufficiently high.
56
+
57
+ Using the pronunciation dictionaries and language models above:
58
+
59
+ - **WER:** `0%`
60
+ - **CER:** `0%`
61
+
62
+ ## Ethical considerations
63
+
64
+ Deploying any Speech-to-Text model into any production setting has ethical implications. You should consider these implications before use.
65
+
66
+ ### Demographic Bias
67
+
68
+ You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.
69
+
70
+ ### Surveillance
71
+
72
+ Speech-to-Text technologies may be misused to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in many countries. You should not assume consent to record and analyze private speech.
73
+
74
+
75
+ ## Troubleshooting issues
76
+
77
+ Machine learning models (like this acoustic model) perform best on data that is similar to the data on which they were trained.
78
+
79
+ The primary sources of variability in forced alignment will be the applicability of the pronunciation dictionary and how similar the speech, demographics, and recording conditions are. If you encounter issues in alignment, there are couple of avenues to improve performance:
80
+
81
+ 1. [Increase the beam size of MFA](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/configuration/#configuring-specific-commands)
82
+
83
+ * MFA defaults to a narrow beam to ensure quick alignment and also as a way to detect potential issues in your dataset, but depending on your data, you might benefit from boosting the beam to 100 or higher.
84
+
85
+ 2. Add pronunciations to the pronunciation dictionary
86
+
87
+ * This model was trained a particular dialect/style, and so adding pronunciations more representative of the variety spoken in your dataset will help alignment.
88
+
89
+ 3. Check the quality of your data
90
+
91
+ * MFA includes a [validator utility](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/data_validation.html), which aims to detect issues in the dataset.
92
+ * Use MFA's [anchor utility](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/workflows/anchor.html) to visually inspect your data as MFA sees it and correct issues in transcription or OOV items.
93
+
94
+ 4. Adapt the model to your data
95
+
96
+ * MFA has an [adaptation command](https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/workflows/adapt_acoustic_model.html) to adapt some of the model to your data based on an initial alignment, and then run another alignment with the adapted model.
97
+
98
+ ## Training data
99
+
100
+ This model was trained on the following corpora:
101
+
102
+ * [Common Voice English v17](https://datacollective.mozillafoundation.org/datasets/cmj8u3p1w0075nxxbe8bedl00):
103
+ * **Hours:** `2322.80`
104
+ * **Speakers:** `71,160`
105
+ * **Utterances:** `1,625,987`
106
+
107
+ * [LibriSpeech English](https://openslr.org/12/):
108
+ * **Hours:** `982.10`
109
+ * **Speakers:** `2,484`
110
+ * **Utterances:** `292,367`
111
+
112
+ * [Corpus of Regional African American Language](https://oraal.github.io/coraal):
113
+ * **Hours:** `124.31`
114
+ * **Speakers:** `193`
115
+ * **Utterances:** `236,792`
116
+
117
+ * [Google Nigerian English](https://openslr.org/70/):
118
+ * **Hours:** `5.77`
119
+ * **Speakers:** `31`
120
+ * **Utterances:** `3,359`
121
+
122
+ * [Google UK and Ireland English](https://openslr.org/83/):
123
+ * **Hours:** `31.29`
124
+ * **Speakers:** `120`
125
+ * **Utterances:** `17,877`
126
+
127
+ * [NCHLT English](https://repo.sadilar.org/items/d944b028-6a86-4edf-a7d9-7f5e21544a41):
128
+ * **Hours:** `56.43`
129
+ * **Speakers:** `210`
130
+ * **Utterances:** `77,412`
131
+
132
+ * [ARU English corpus](https://datacat.liverpool.ac.uk/681/):
133
+ * **Hours:** `7.13`
134
+ * **Speakers:** `12`
135
+ * **Utterances:** `8,640`
136
+
137
+ * [ICE-Nigeria](https://sourceforge.net/projects/ice-nigeria/):
138
+ * **Hours:** `52.86`
139
+ * **Speakers:** `1,276`
140
+ * **Utterances:** `113,664`
141
+
142
+ * [A Scripted Pakistani English Daily-use Speech Corpus](https://magichub.com/datasets/pakistani-english-scripted-speech-corpus-daily-use-sentence/):
143
+ * **Hours:** `4.00`
144
+ * **Speakers:** `7`
145
+ * **Utterances:** `2,191`
146
+
147
+ * [L2-ARCTIC](https://psi.engr.tamu.edu/l2-arctic-corpus/):
148
+ * **Hours:** `27.51`
149
+ * **Speakers:** `24`
150
+ * **Utterances:** `27,042`
acoustic/final.alimdl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1780c629972e968a2fc4a7808c0076945df62865a0acb6ad09a02feed8d74edf
3
+ size 50106787
acoustic/final.mdl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d3138688bfab925e108dcc9519d9efe53584aa935b41eb130450cb8ac3d48e8
3
+ size 50106787
acoustic/graphemes.txt ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ <space> 1
3
+ [bracketed] 2
4
+ <cutoff> 3
5
+ [laughter] 4
6
+ ' 5
7
+ - 6
8
+ / 7
9
+ [ 8
10
+ ] 9
11
+ a 10
12
+ b 11
13
+ c 12
14
+ d 13
15
+ e 14
16
+ f 15
17
+ g 16
18
+ h 17
19
+ i 18
20
+ j 19
21
+ k 20
22
+ l 21
23
+ m 22
24
+ n 23
25
+ o 24
26
+ p 25
27
+ q 26
28
+ r 27
29
+ s 28
30
+ t 29
31
+ u 30
32
+ v 31
33
+ w 32
34
+ x 33
35
+ y 34
36
+ z 35
37
+ à 36
38
+ é 37
39
+ í 38
40
+ ô 39
41
+ ü 40
42
+ ‑ 41
43
+ ‘ 42
44
+ ’ 43
45
+ <unk> 44
46
+ #0 45
47
+ <s> 46
48
+ </s> 47
acoustic/lda.mat ADDED
Binary file (14.6 kB). View file
 
acoustic/meta.json ADDED
@@ -0,0 +1,487 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "phones": [
3
+ "a",
4
+ "aj",
5
+ "aw",
6
+ "aː",
7
+ "b",
8
+ "bʲ",
9
+ "c",
10
+ "cʰ",
11
+ "cʷ",
12
+ "d",
13
+ "dʒ",
14
+ "dʲ",
15
+ "d̪",
16
+ "e",
17
+ "ej",
18
+ "eː",
19
+ "f",
20
+ "fʲ",
21
+ "fʷ",
22
+ "h",
23
+ "i",
24
+ "iː",
25
+ "j",
26
+ "k",
27
+ "kp",
28
+ "kʰ",
29
+ "kʷ",
30
+ "l",
31
+ "m",
32
+ "mʲ",
33
+ "m̩",
34
+ "n",
35
+ "n̩",
36
+ "o",
37
+ "ow",
38
+ "oː",
39
+ "p",
40
+ "pʰ",
41
+ "pʲ",
42
+ "pʷ",
43
+ "s",
44
+ "t",
45
+ "tʃ",
46
+ "tʰ",
47
+ "tʲ",
48
+ "tʷ",
49
+ "t̪",
50
+ "u",
51
+ "uː",
52
+ "v",
53
+ "vʲ",
54
+ "vʷ",
55
+ "w",
56
+ "z",
57
+ "æ",
58
+ "ç",
59
+ "ð",
60
+ "ŋ",
61
+ "ɐ",
62
+ "ɑ",
63
+ "ɑː",
64
+ "ɒ",
65
+ "ɒː",
66
+ "ɔ",
67
+ "ɔj",
68
+ "ɖ",
69
+ "ə",
70
+ "əw",
71
+ "ɚ",
72
+ "ɛ",
73
+ "ɛː",
74
+ "ɜ",
75
+ "ɜː",
76
+ "ɝ",
77
+ "ɟ",
78
+ "ɟʷ",
79
+ "ɡ",
80
+ "ɡb",
81
+ "ɡʷ",
82
+ "ɪ",
83
+ "ɫ",
84
+ "ɫ̩",
85
+ "ɱ",
86
+ "ɲ",
87
+ "ɹ",
88
+ "ɾ",
89
+ "ɾʲ",
90
+ "ɾ̃",
91
+ "ʃ",
92
+ "ʈ",
93
+ "ʈʲ",
94
+ "ʈʷ",
95
+ "ʉ",
96
+ "ʉː",
97
+ "ʊ",
98
+ "ʋ",
99
+ "ʎ",
100
+ "ʒ",
101
+ "ʔ",
102
+ "θ"
103
+ ],
104
+ "phone_mapping": {
105
+ "<eps>": 0,
106
+ "sil": 1,
107
+ "spn": 2,
108
+ "a": 3,
109
+ "aj": 4,
110
+ "aw": 5,
111
+ "aː": 6,
112
+ "b": 7,
113
+ "bʲ": 8,
114
+ "c": 9,
115
+ "cʰ": 10,
116
+ "cʷ": 11,
117
+ "d": 12,
118
+ "dʒ": 13,
119
+ "dʲ": 14,
120
+ "d̪": 15,
121
+ "e": 16,
122
+ "ej": 17,
123
+ "eː": 18,
124
+ "f": 19,
125
+ "fʲ": 20,
126
+ "fʷ": 21,
127
+ "h": 22,
128
+ "i": 23,
129
+ "iː": 24,
130
+ "j": 25,
131
+ "k": 26,
132
+ "kp": 27,
133
+ "kʰ": 28,
134
+ "kʷ": 29,
135
+ "l": 30,
136
+ "m": 31,
137
+ "mʲ": 32,
138
+ "m̩": 33,
139
+ "n": 34,
140
+ "n̩": 35,
141
+ "o": 36,
142
+ "ow": 37,
143
+ "oː": 38,
144
+ "p": 39,
145
+ "pʰ": 40,
146
+ "pʲ": 41,
147
+ "pʷ": 42,
148
+ "s": 43,
149
+ "t": 44,
150
+ "tʃ": 45,
151
+ "tʰ": 46,
152
+ "tʲ": 47,
153
+ "tʷ": 48,
154
+ "t̪": 49,
155
+ "u": 50,
156
+ "uː": 51,
157
+ "v": 52,
158
+ "vʲ": 53,
159
+ "vʷ": 54,
160
+ "w": 55,
161
+ "z": 56,
162
+ "æ": 57,
163
+ "ç": 58,
164
+ "ð": 59,
165
+ "ŋ": 60,
166
+ "ɐ": 61,
167
+ "ɑ": 62,
168
+ "ɑː": 63,
169
+ "ɒ": 64,
170
+ "ɒː": 65,
171
+ "ɔ": 66,
172
+ "ɔj": 67,
173
+ "ɖ": 68,
174
+ "ə": 69,
175
+ "əw": 70,
176
+ "ɚ": 71,
177
+ "ɛ": 72,
178
+ "ɛː": 73,
179
+ "ɜ": 74,
180
+ "ɜː": 75,
181
+ "ɝ": 76,
182
+ "ɟ": 77,
183
+ "ɟʷ": 78,
184
+ "ɡ": 79,
185
+ "ɡb": 80,
186
+ "ɡʷ": 81,
187
+ "ɪ": 82,
188
+ "ɫ": 83,
189
+ "ɫ̩": 84,
190
+ "ɱ": 85,
191
+ "ɲ": 86,
192
+ "ɹ": 87,
193
+ "ɾ": 88,
194
+ "ɾʲ": 89,
195
+ "ɾ̃": 90,
196
+ "ʃ": 91,
197
+ "ʈ": 92,
198
+ "ʈʲ": 93,
199
+ "ʈʷ": 94,
200
+ "ʉ": 95,
201
+ "ʉː": 96,
202
+ "ʊ": 97,
203
+ "ʋ": 98,
204
+ "ʎ": 99,
205
+ "ʒ": 100,
206
+ "ʔ": 101,
207
+ "θ": 102
208
+ },
209
+ "phone_groups": {
210
+ "0": [
211
+ "kp",
212
+ "p",
213
+ "pʰ",
214
+ "pʲ",
215
+ "pʷ"
216
+ ],
217
+ "1": [
218
+ "b",
219
+ "bʲ",
220
+ "ɡb"
221
+ ],
222
+ "2": [
223
+ "f",
224
+ "fʲ",
225
+ "fʷ"
226
+ ],
227
+ "3": [
228
+ "v",
229
+ "vʲ",
230
+ "vʷ"
231
+ ],
232
+ "4": [
233
+ "θ"
234
+ ],
235
+ "5": [
236
+ "t̪"
237
+ ],
238
+ "6": [
239
+ "ð"
240
+ ],
241
+ "7": [
242
+ "d̪"
243
+ ],
244
+ "8": [
245
+ "t",
246
+ "tʰ",
247
+ "tʲ",
248
+ "tʷ",
249
+ "ʈ",
250
+ "ʈʲ",
251
+ "ʈʷ"
252
+ ],
253
+ "9": [
254
+ "ʔ"
255
+ ],
256
+ "10": [
257
+ "d",
258
+ "dʲ",
259
+ "ɖ"
260
+ ],
261
+ "11": [
262
+ "ɾ",
263
+ "ɾʲ"
264
+ ],
265
+ "12": [
266
+ "tʃ"
267
+ ],
268
+ "13": [
269
+ "dʒ"
270
+ ],
271
+ "14": [
272
+ "ʃ"
273
+ ],
274
+ "15": [
275
+ "ʒ"
276
+ ],
277
+ "16": [
278
+ "s"
279
+ ],
280
+ "17": [
281
+ "z"
282
+ ],
283
+ "18": [
284
+ "ɹ"
285
+ ],
286
+ "19": [
287
+ "m",
288
+ "m̩"
289
+ ],
290
+ "20": [
291
+ "mʲ"
292
+ ],
293
+ "21": [
294
+ "ɱ"
295
+ ],
296
+ "22": [
297
+ "n",
298
+ "n̩"
299
+ ],
300
+ "23": [
301
+ "ɲ"
302
+ ],
303
+ "24": [
304
+ "ɾ̃"
305
+ ],
306
+ "25": [
307
+ "ŋ"
308
+ ],
309
+ "26": [
310
+ "l"
311
+ ],
312
+ "27": [
313
+ "ɫ",
314
+ "ɫ̩"
315
+ ],
316
+ "28": [
317
+ "ʎ"
318
+ ],
319
+ "29": [
320
+ "ɟ",
321
+ "ɟʷ",
322
+ "ɡ",
323
+ "ɡʷ"
324
+ ],
325
+ "30": [
326
+ "c",
327
+ "cʰ",
328
+ "cʷ"
329
+ ],
330
+ "31": [
331
+ "k",
332
+ "kʰ",
333
+ "kʷ"
334
+ ],
335
+ "32": [
336
+ "ç"
337
+ ],
338
+ "33": [
339
+ "h"
340
+ ],
341
+ "34": [
342
+ "ɐ",
343
+ "ə"
344
+ ],
345
+ "35": [
346
+ "ɜ",
347
+ "ɜː"
348
+ ],
349
+ "36": [
350
+ "ɚ",
351
+ "ɝ"
352
+ ],
353
+ "37": [
354
+ "ʊ"
355
+ ],
356
+ "38": [
357
+ "ɪ"
358
+ ],
359
+ "39": [
360
+ "ɑ",
361
+ "ɑː"
362
+ ],
363
+ "40": [
364
+ "ɒ",
365
+ "ɒː",
366
+ "ɔ"
367
+ ],
368
+ "41": [
369
+ "a",
370
+ "aː"
371
+ ],
372
+ "42": [
373
+ "æ"
374
+ ],
375
+ "43": [
376
+ "aj"
377
+ ],
378
+ "44": [
379
+ "aw"
380
+ ],
381
+ "45": [
382
+ "i",
383
+ "iː"
384
+ ],
385
+ "46": [
386
+ "j"
387
+ ],
388
+ "47": [
389
+ "ɛ",
390
+ "ɛː"
391
+ ],
392
+ "48": [
393
+ "e",
394
+ "ej",
395
+ "eː"
396
+ ],
397
+ "49": [
398
+ "ʉ",
399
+ "ʉː"
400
+ ],
401
+ "50": [
402
+ "u",
403
+ "uː"
404
+ ],
405
+ "51": [
406
+ "w"
407
+ ],
408
+ "52": [
409
+ "ʋ"
410
+ ],
411
+ "53": [
412
+ "ɔj"
413
+ ],
414
+ "54": [
415
+ "o",
416
+ "ow",
417
+ "oː",
418
+ "əw"
419
+ ]
420
+ },
421
+ "version": "3.1.0",
422
+ "architecture": "gmm-hmm",
423
+ "train_date": "2024-06-12 12:16:18.584033",
424
+ "training": {
425
+ "audio_duration": 12862940.052134357,
426
+ "num_speakers": 75018,
427
+ "num_utterances": 2374755,
428
+ "num_oovs": 0,
429
+ "average_log_likelihood": -0.08382050453507844
430
+ },
431
+ "dictionaries": {
432
+ "names": [
433
+ "default",
434
+ "english_india_mfa",
435
+ "english_nigeria_mfa",
436
+ "english_uk_mfa",
437
+ "english_us_mfa",
438
+ "nonnative"
439
+ ],
440
+ "default": "default",
441
+ "silence_word": "<eps>",
442
+ "use_g2p": false,
443
+ "oov_word": "<unk>",
444
+ "bracketed_word": "[bracketed]",
445
+ "laughter_word": "[laughter]",
446
+ "clitic_marker": "'",
447
+ "position_dependent_phones": false
448
+ },
449
+ "language": "unknown",
450
+ "features": {
451
+ "type": "mfcc",
452
+ "use_energy": true,
453
+ "frame_shift": 10,
454
+ "frame_length": 25,
455
+ "snip_edges": false,
456
+ "low_frequency": 20,
457
+ "high_frequency": 7800,
458
+ "sample_frequency": 16000,
459
+ "dither": 0.0001,
460
+ "energy_floor": 1.0,
461
+ "num_coefficients": 13,
462
+ "num_mel_bins": 23,
463
+ "cepstral_lifter": 22,
464
+ "preemphasis_coefficient": 0.97,
465
+ "uses_cmvn": true,
466
+ "uses_deltas": true,
467
+ "uses_voiced": false,
468
+ "uses_splices": false,
469
+ "uses_speaker_adaptation": true,
470
+ "use_pitch": false,
471
+ "use_voicing": false,
472
+ "min_f0": 50,
473
+ "max_f0": 800,
474
+ "delta_pitch": 0.005,
475
+ "penalty_factor": 0.1,
476
+ "silence_weight": 0.0,
477
+ "splice_left_context": 3,
478
+ "splice_right_context": 3
479
+ },
480
+ "oov_phone": "spn",
481
+ "optional_silence_phone": "sil",
482
+ "phone_set_type": "UNKNOWN",
483
+ "silence_probability": 0.17,
484
+ "initial_silence_probability": 0.17,
485
+ "final_silence_correction": 0.99,
486
+ "final_non_silence_correction": 0.2966666666666667
487
+ }
acoustic/phone_lm.fst ADDED

Git LFS Details

  • SHA256: b60ab3b7da03b1751fb3f5ff87124cbe7c8eec462ebf261ee4cbdd430cb2f8ba
  • Pointer size: 131 Bytes
  • Size of remote file: 519 kB
acoustic/phone_pdf.counts ADDED
The diff for this file is too large to render. See raw diff
 
acoustic/phones.txt ADDED
@@ -0,0 +1,103 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ sil 1
3
+ spn 2
4
+ a 3
5
+ aj 4
6
+ aw 5
7
+ aː 6
8
+ b 7
9
+ bʲ 8
10
+ c 9
11
+ cʰ 10
12
+ cʷ 11
13
+ d 12
14
+ dʒ 13
15
+ dʲ 14
16
+ d̪ 15
17
+ e 16
18
+ ej 17
19
+ eː 18
20
+ f 19
21
+ fʲ 20
22
+ fʷ 21
23
+ h 22
24
+ i 23
25
+ iː 24
26
+ j 25
27
+ k 26
28
+ kp 27
29
+ kʰ 28
30
+ kʷ 29
31
+ l 30
32
+ m 31
33
+ mʲ 32
34
+ m̩ 33
35
+ n 34
36
+ n̩ 35
37
+ o 36
38
+ ow 37
39
+ oː 38
40
+ p 39
41
+ pʰ 40
42
+ pʲ 41
43
+ pʷ 42
44
+ s 43
45
+ t 44
46
+ tʃ 45
47
+ tʰ 46
48
+ tʲ 47
49
+ tʷ 48
50
+ t̪ 49
51
+ u 50
52
+ uː 51
53
+ v 52
54
+ vʲ 53
55
+ vʷ 54
56
+ w 55
57
+ z 56
58
+ æ 57
59
+ ç 58
60
+ ð 59
61
+ ŋ 60
62
+ ɐ 61
63
+ ɑ 62
64
+ ɑː 63
65
+ ɒ 64
66
+ ɒː 65
67
+ ɔ 66
68
+ ɔj 67
69
+ ɖ 68
70
+ ə 69
71
+ əw 70
72
+ ɚ 71
73
+ ɛ 72
74
+ ɛː 73
75
+ ɜ 74
76
+ ɜː 75
77
+ ɝ 76
78
+ ɟ 77
79
+ ɟʷ 78
80
+ ɡ 79
81
+ ɡb 80
82
+ ɡʷ 81
83
+ ɪ 82
84
+ ɫ 83
85
+ ɫ̩ 84
86
+ ɱ 85
87
+ ɲ 86
88
+ ɹ 87
89
+ ɾ 88
90
+ ɾʲ 89
91
+ ɾ̃ 90
92
+ ʃ 91
93
+ ʈ 92
94
+ ʈʲ 93
95
+ ʈʷ 94
96
+ ʉ 95
97
+ ʉː 96
98
+ ʊ 97
99
+ ʋ 98
100
+ ʎ 99
101
+ ʒ 100
102
+ ʔ 101
103
+ θ 102
acoustic/rules.yaml ADDED
@@ -0,0 +1,1679 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ dialects:
2
+ india:
3
+ - following_context: ə|ɚ
4
+ non_silence_before_correction: -0.09
5
+ preceding_context: ''
6
+ probability: 0.01
7
+ replacement: w
8
+ segment: '[ʉu]'
9
+ silence_after_probability: 1.18
10
+ silence_before_correction: 0.08
11
+ - following_context: $
12
+ non_silence_before_correction: -0.01
13
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
14
+ probability: 0.01
15
+ replacement: ''
16
+ segment: '[tʈ]'
17
+ silence_after_probability: 1.64
18
+ silence_before_correction: 0.01
19
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
20
+ non_silence_before_correction: 0.07
21
+ preceding_context: ''
22
+ probability: 0.16
23
+ replacement: s
24
+ segment: z
25
+ silence_after_probability: 0.91
26
+ silence_before_correction: -0.14
27
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
28
+ non_silence_before_correction: 0.0
29
+ preceding_context: ''
30
+ probability: 0.21
31
+ replacement: p
32
+ segment: b
33
+ silence_after_probability: 1.07
34
+ silence_before_correction: 0.0
35
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
36
+ non_silence_before_correction: -0.05
37
+ preceding_context: ''
38
+ probability: 0.06
39
+ replacement: tʃ
40
+ segment: dʒ
41
+ silence_after_probability: 4.67
42
+ silence_before_correction: 0.32
43
+ - following_context: '[vʋf]ʲ?'
44
+ non_silence_before_correction: -0.06
45
+ preceding_context: ''
46
+ probability: 0.01
47
+ replacement: ɱ
48
+ segment: m
49
+ silence_after_probability: 1.83
50
+ silence_before_correction: 0.04
51
+ - following_context: '[vʋf]ʲ?'
52
+ non_silence_before_correction: -0.02
53
+ preceding_context: ''
54
+ probability: 0.01
55
+ replacement: ɱ
56
+ segment: n
57
+ silence_after_probability: 2.36
58
+ silence_before_correction: -0.02
59
+ - following_context: '[tʈdɖ]$'
60
+ non_silence_before_correction: 0.0
61
+ preceding_context: m
62
+ probability: 0.11
63
+ replacement: ''
64
+ segment: '[pb]'
65
+ silence_after_probability: 2.0
66
+ silence_before_correction: 0.01
67
+ - following_context: '[sz]$'
68
+ non_silence_before_correction: -0.05
69
+ preceding_context: n
70
+ probability: 0.12
71
+ replacement: ''
72
+ segment: '[tʈdɖ]'
73
+ silence_after_probability: 0.69
74
+ silence_before_correction: 0.08
75
+ - following_context: '[s]$'
76
+ non_silence_before_correction: 0.14
77
+ preceding_context: ''
78
+ probability: 0.03
79
+ replacement: k
80
+ segment: '[sʃ] k'
81
+ silence_after_probability: 1.83
82
+ silence_before_correction: -0.2
83
+ - following_context: $
84
+ non_silence_before_correction: 0.0
85
+ preceding_context: ''
86
+ probability: 0.02
87
+ replacement: k s
88
+ segment: s k
89
+ silence_after_probability: 1.22
90
+ silence_before_correction: -0.11
91
+ - following_context: '[dɖʈtcɟɡk][ʲʷ]?'
92
+ non_silence_before_correction: 0.0
93
+ preceding_context: ''
94
+ probability: 0.05
95
+ replacement: ''
96
+ segment: p
97
+ silence_after_probability: 1.18
98
+ silence_before_correction: 0.01
99
+ - following_context: '[pbcɟɡk][ʲʷ]?'
100
+ non_silence_before_correction: -0.07
101
+ preceding_context: ''
102
+ probability: 0.01
103
+ replacement: ''
104
+ segment: '[tʈ]'
105
+ silence_after_probability: 1.11
106
+ silence_before_correction: 0.08
107
+ - following_context: '[dɖtʈcɟɡk][ʲʷ]?'
108
+ non_silence_before_correction: 0.01
109
+ preceding_context: ''
110
+ probability: 0.11
111
+ replacement: ''
112
+ segment: b
113
+ silence_after_probability: 1.1
114
+ silence_before_correction: -0.06
115
+ - following_context: '[dɖtʈpb][ʲʷ]?'
116
+ non_silence_before_correction: 0.02
117
+ preceding_context: ''
118
+ probability: 0.06
119
+ replacement: ''
120
+ segment: k
121
+ silence_after_probability: 1.13
122
+ silence_before_correction: -0.04
123
+ - following_context: '[dɖtʈpb][ʲʷ]?'
124
+ non_silence_before_correction: -0.01
125
+ preceding_context: ''
126
+ probability: 0.01
127
+ replacement: ''
128
+ segment: ɡ
129
+ silence_after_probability: 1.77
130
+ silence_before_correction: 0.01
131
+ - following_context: ([tʈpkc][ʲʷ]?)? ɹ
132
+ non_silence_before_correction: -0.02
133
+ preceding_context: ''
134
+ probability: 0.03
135
+ replacement: ʃ
136
+ segment: s
137
+ silence_after_probability: 1.28
138
+ silence_before_correction: 0.02
139
+ - following_context: ɹ
140
+ non_silence_before_correction: 0.0
141
+ preceding_context: ''
142
+ probability: 0.05
143
+ replacement: tʃ
144
+ segment: '[tʈ][ʲʷ]?'
145
+ silence_after_probability: 1.26
146
+ silence_before_correction: -0.02
147
+ - following_context: ''
148
+ non_silence_before_correction: 0.02
149
+ preceding_context: ''
150
+ probability: 0.03
151
+ replacement: tʃ
152
+ segment: '[tʈ][ʲʷ]? ɹ'
153
+ silence_after_probability: 1.08
154
+ silence_before_correction: -0.03
155
+ - following_context: ''
156
+ non_silence_before_correction: 0.0
157
+ preceding_context: ''
158
+ probability: 0.01
159
+ replacement: dʒ
160
+ segment: '[dɖ][ʲʷ]? ɹ'
161
+ silence_after_probability: 0.92
162
+ silence_before_correction: 0.01
163
+ - following_context: ɹ
164
+ non_silence_before_correction: -0.09
165
+ preceding_context: ''
166
+ probability: 0.03
167
+ replacement: dʒ
168
+ segment: '[dɖ][ʲʷ]?'
169
+ silence_after_probability: 1.11
170
+ silence_before_correction: 0.13
171
+ - following_context: ə n
172
+ non_silence_before_correction: 0.04
173
+ preceding_context: ''
174
+ probability: 0.01
175
+ replacement: ʔ
176
+ segment: '[tʈ]'
177
+ silence_after_probability: 0.57
178
+ silence_before_correction: -0.09
179
+ - following_context: $
180
+ non_silence_before_correction: 0.01
181
+ preceding_context: ɪ
182
+ probability: 0.21
183
+ replacement: n
184
+ segment: ŋ
185
+ silence_after_probability: 1.23
186
+ silence_before_correction: -0.03
187
+ - following_context: z$
188
+ non_silence_before_correction: -0.07
189
+ preceding_context: ɪ
190
+ probability: 0.2
191
+ replacement: n
192
+ segment: ŋ
193
+ silence_after_probability: 1.41
194
+ silence_before_correction: 0.09
195
+ - following_context: ''
196
+ non_silence_before_correction: 0.05
197
+ preceding_context: ''
198
+ probability: 0.03
199
+ replacement: l ə
200
+ segment: ə l ə
201
+ silence_after_probability: 0.98
202
+ silence_before_correction: -0.07
203
+ - following_context: ''
204
+ non_silence_before_correction: 0.03
205
+ preceding_context: ''
206
+ probability: 0.2
207
+ replacement: n ə
208
+ segment: ə n ə
209
+ silence_after_probability: 0.69
210
+ silence_before_correction: 0.05
211
+ - following_context: ''
212
+ non_silence_before_correction: 0.01
213
+ preceding_context: ''
214
+ probability: 0.03
215
+ replacement: m ə
216
+ segment: ə m ə
217
+ silence_after_probability: 1.0
218
+ silence_before_correction: -0.04
219
+ - following_context: ''
220
+ non_silence_before_correction: 0.02
221
+ preceding_context: ''
222
+ probability: 0.18
223
+ replacement: ɹ ə
224
+ segment: ə ɹ ə
225
+ silence_after_probability: 0.64
226
+ silence_before_correction: -0.06
227
+ - following_context: ''
228
+ non_silence_before_correction: 0.0
229
+ preceding_context: ''
230
+ probability: 0.12
231
+ replacement: ɾ
232
+ segment: ɹ
233
+ silence_after_probability: 1.15
234
+ silence_before_correction: -0.01
235
+ - following_context: .*(ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
236
+ non_silence_before_correction: -0.02
237
+ preceding_context: ^
238
+ probability: 0.01
239
+ replacement: ''
240
+ segment: ə
241
+ silence_after_probability: 1.27
242
+ silence_before_correction: -0.01
243
+ - following_context: $
244
+ non_silence_before_correction: 0.01
245
+ preceding_context: '[sʃn]'
246
+ probability: 0.06
247
+ replacement: ''
248
+ segment: '[tʈ]'
249
+ silence_after_probability: 0.61
250
+ silence_before_correction: -0.03
251
+ - following_context: $
252
+ non_silence_before_correction: 0.03
253
+ preceding_context: '[zʒn]'
254
+ probability: 0.19
255
+ replacement: ''
256
+ segment: '[dɖ]'
257
+ silence_after_probability: 0.38
258
+ silence_before_correction: -0.06
259
+ - following_context: ''
260
+ non_silence_before_correction: 0.02
261
+ preceding_context: n
262
+ probability: 0.02
263
+ replacement: ''
264
+ segment: '[dɖ]'
265
+ silence_after_probability: 1.25
266
+ silence_before_correction: -0.04
267
+ - following_context: ə|ɚ
268
+ non_silence_before_correction: 0.05
269
+ preceding_context: ''
270
+ probability: 0.01
271
+ replacement: j
272
+ segment: i
273
+ silence_after_probability: 1.3
274
+ silence_before_correction: -0.11
275
+ - following_context: ''
276
+ non_silence_before_correction: 0.0
277
+ preceding_context: ''
278
+ probability: 0.02
279
+ replacement: a
280
+ segment: ɒ
281
+ silence_after_probability: 1.29
282
+ silence_before_correction: -0.03
283
+ - following_context: ''
284
+ non_silence_before_correction: -0.02
285
+ preceding_context: ''
286
+ probability: 0.01
287
+ replacement: a
288
+ segment: ɑ
289
+ silence_after_probability: 1.45
290
+ silence_before_correction: 0.03
291
+ - following_context: ''
292
+ non_silence_before_correction: -0.02
293
+ preceding_context: ''
294
+ probability: 0.03
295
+ replacement: aː
296
+ segment: ɒː
297
+ silence_after_probability: 1.5
298
+ silence_before_correction: 0.02
299
+ - following_context: ''
300
+ non_silence_before_correction: 0.02
301
+ preceding_context: ''
302
+ probability: 0.12
303
+ replacement: aː
304
+ segment: ɑː
305
+ silence_after_probability: 1.03
306
+ silence_before_correction: -0.05
307
+ - following_context: ''
308
+ non_silence_before_correction: 0.0
309
+ preceding_context: ''
310
+ probability: 0.03
311
+ replacement: dʒ
312
+ segment: z
313
+ silence_after_probability: 2.04
314
+ silence_before_correction: -0.01
315
+ - following_context: ''
316
+ non_silence_before_correction: -0.04
317
+ preceding_context: ''
318
+ probability: 0.03
319
+ replacement: dʒ
320
+ segment: ʒ
321
+ silence_after_probability: 1.74
322
+ silence_before_correction: 0.04
323
+ - following_context: ''
324
+ non_silence_before_correction: -0.03
325
+ preceding_context: ''
326
+ probability: 0.03
327
+ replacement: z
328
+ segment: ʒ
329
+ silence_after_probability: 1.08
330
+ silence_before_correction: 0.03
331
+ - following_context: ''
332
+ non_silence_before_correction: -0.01
333
+ preceding_context: ''
334
+ probability: 0.19
335
+ replacement: ʃ
336
+ segment: ʒ
337
+ silence_after_probability: 1.15
338
+ silence_before_correction: 0.01
339
+ nigeria:
340
+ - following_context: $
341
+ non_silence_before_correction: 0.03
342
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
343
+ probability: 0.06
344
+ replacement: ''
345
+ segment: '[tʈ]'
346
+ silence_after_probability: 0.36
347
+ silence_before_correction: -0.09
348
+ - following_context: ''
349
+ non_silence_before_correction: 0.04
350
+ preceding_context: ''
351
+ probability: 0.25
352
+ replacement: d̪
353
+ segment: ð
354
+ silence_after_probability: 0.73
355
+ silence_before_correction: -0.22
356
+ - following_context: ''
357
+ non_silence_before_correction: 0.04
358
+ preceding_context: ''
359
+ probability: 0.21
360
+ replacement: t̪
361
+ segment: θ
362
+ silence_after_probability: 0.45
363
+ silence_before_correction: -0.15
364
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
365
+ non_silence_before_correction: 0.03
366
+ preceding_context: ''
367
+ probability: 0.06
368
+ replacement: s
369
+ segment: z
370
+ silence_after_probability: 0.57
371
+ silence_before_correction: -0.03
372
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
373
+ non_silence_before_correction: 0.03
374
+ preceding_context: ''
375
+ probability: 0.06
376
+ replacement: t
377
+ segment: d
378
+ silence_after_probability: 0.71
379
+ silence_before_correction: -0.04
380
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
381
+ non_silence_before_correction: 0.0
382
+ preceding_context: ''
383
+ probability: 0.04
384
+ replacement: p
385
+ segment: b
386
+ silence_after_probability: 0.34
387
+ silence_before_correction: 0.03
388
+ - following_context: '[vʋf]ʲ?'
389
+ non_silence_before_correction: 0.04
390
+ preceding_context: ''
391
+ probability: 0.01
392
+ replacement: ɱ
393
+ segment: m
394
+ silence_after_probability: 1.22
395
+ silence_before_correction: -0.07
396
+ - following_context: '[vʋf]ʲ?'
397
+ non_silence_before_correction: -0.01
398
+ preceding_context: ''
399
+ probability: 0.01
400
+ replacement: ɱ
401
+ segment: n
402
+ silence_after_probability: 0.49
403
+ silence_before_correction: 0.01
404
+ - following_context: '[tʈdɖ]$'
405
+ non_silence_before_correction: -0.01
406
+ preceding_context: m
407
+ probability: 0.05
408
+ replacement: ''
409
+ segment: '[pb]'
410
+ silence_after_probability: 0.8
411
+ silence_before_correction: 0.04
412
+ - following_context: '[sz]$'
413
+ non_silence_before_correction: -0.02
414
+ preceding_context: n
415
+ probability: 0.08
416
+ replacement: ''
417
+ segment: '[tʈdɖ]'
418
+ silence_after_probability: 0.12
419
+ silence_before_correction: 0.02
420
+ - following_context: '[s]$'
421
+ non_silence_before_correction: 0.02
422
+ preceding_context: ''
423
+ probability: 0.03
424
+ replacement: ''
425
+ segment: '[sʃ] t'
426
+ silence_after_probability: 0.26
427
+ silence_before_correction: -0.08
428
+ - following_context: $
429
+ non_silence_before_correction: 0.05
430
+ preceding_context: ''
431
+ probability: 0.01
432
+ replacement: k s
433
+ segment: s k
434
+ silence_after_probability: 0.98
435
+ silence_before_correction: -0.15
436
+ - following_context: '[dɖʈtcɟɡk][ʲʷ]?'
437
+ non_silence_before_correction: 0.04
438
+ preceding_context: ''
439
+ probability: 0.03
440
+ replacement: ''
441
+ segment: p
442
+ silence_after_probability: 0.61
443
+ silence_before_correction: -0.04
444
+ - following_context: '[pbcɟɡk][ʲʷ]?'
445
+ non_silence_before_correction: -0.1
446
+ preceding_context: ''
447
+ probability: 0.02
448
+ replacement: ''
449
+ segment: '[tʈ]'
450
+ silence_after_probability: 0.44
451
+ silence_before_correction: 0.24
452
+ - following_context: '[pbcɟɡk][ʲʷ]?'
453
+ non_silence_before_correction: 0.12
454
+ preceding_context: ''
455
+ probability: 0.01
456
+ replacement: ''
457
+ segment: d
458
+ silence_after_probability: 1.68
459
+ silence_before_correction: -0.18
460
+ - following_context: '[dɖtʈcɟɡk][ʲʷ]?'
461
+ non_silence_before_correction: 0.02
462
+ preceding_context: ''
463
+ probability: 0.03
464
+ replacement: ''
465
+ segment: b
466
+ silence_after_probability: 0.61
467
+ silence_before_correction: -0.04
468
+ - following_context: '[dɖtʈpb][ʲʷ]?'
469
+ non_silence_before_correction: 0.03
470
+ preceding_context: ''
471
+ probability: 0.05
472
+ replacement: ''
473
+ segment: k
474
+ silence_after_probability: 0.23
475
+ silence_before_correction: -0.05
476
+ - following_context: ([tʈpkc][ʲʷ]?)? ɹ
477
+ non_silence_before_correction: 0.04
478
+ preceding_context: ''
479
+ probability: 0.01
480
+ replacement: ʃ
481
+ segment: s
482
+ silence_after_probability: 0.64
483
+ silence_before_correction: -0.06
484
+ - following_context: ɹ
485
+ non_silence_before_correction: 0.01
486
+ preceding_context: ''
487
+ probability: 0.01
488
+ replacement: tʃ
489
+ segment: '[tʈ][ʲʷ]?'
490
+ silence_after_probability: 0.34
491
+ silence_before_correction: -0.02
492
+ - following_context: ''
493
+ non_silence_before_correction: 0.01
494
+ preceding_context: ''
495
+ probability: 0.03
496
+ replacement: tʃ
497
+ segment: '[tʈ][ʲʷ]? ɹ'
498
+ silence_after_probability: 0.34
499
+ silence_before_correction: -0.03
500
+ - following_context: ''
501
+ non_silence_before_correction: 0.06
502
+ preceding_context: ''
503
+ probability: 0.01
504
+ replacement: dʒ
505
+ segment: '[dɖ][ʲʷ]? ɹ'
506
+ silence_after_probability: 0.75
507
+ silence_before_correction: -0.09
508
+ - following_context: ɹ
509
+ non_silence_before_correction: -0.06
510
+ preceding_context: ''
511
+ probability: 0.01
512
+ replacement: dʒ
513
+ segment: '[dɖ][ʲʷ]?'
514
+ silence_after_probability: 0.92
515
+ silence_before_correction: 0.13
516
+ - following_context: $
517
+ non_silence_before_correction: 0.02
518
+ preceding_context: ''
519
+ probability: 0.04
520
+ replacement: p s
521
+ segment: b z
522
+ silence_after_probability: 0.58
523
+ silence_before_correction: -0.07
524
+ - following_context: $
525
+ non_silence_before_correction: -0.02
526
+ preceding_context: ''
527
+ probability: 0.05
528
+ replacement: t s
529
+ segment: d z
530
+ silence_after_probability: 0.38
531
+ silence_before_correction: 0.05
532
+ - following_context: $
533
+ non_silence_before_correction: 0.0
534
+ preceding_context: ''
535
+ probability: 0.02
536
+ replacement: k s
537
+ segment: ɡ z
538
+ silence_after_probability: 0.91
539
+ silence_before_correction: -0.03
540
+ - following_context: $
541
+ non_silence_before_correction: 0.0
542
+ preceding_context: ''
543
+ probability: 0.05
544
+ replacement: s
545
+ segment: z
546
+ silence_after_probability: 0.58
547
+ silence_before_correction: -0.01
548
+ - following_context: ''
549
+ non_silence_before_correction: -0.01
550
+ preceding_context: ^
551
+ probability: 0.04
552
+ replacement: ''
553
+ segment: ç
554
+ silence_after_probability: 0.67
555
+ silence_before_correction: -0.14
556
+ - following_context: ''
557
+ non_silence_before_correction: -0.03
558
+ preceding_context: ^
559
+ probability: 0.03
560
+ replacement: ''
561
+ segment: h
562
+ silence_after_probability: 0.31
563
+ silence_before_correction: 0.04
564
+ - following_context: $
565
+ non_silence_before_correction: 0.02
566
+ preceding_context: ŋ
567
+ probability: 0.35
568
+ replacement: ''
569
+ segment: ɡ
570
+ silence_after_probability: 0.12
571
+ silence_before_correction: -0.04
572
+ - following_context: $
573
+ non_silence_before_correction: 0.02
574
+ preceding_context: '[sʃn]'
575
+ probability: 0.11
576
+ replacement: ''
577
+ segment: '[tʈ]'
578
+ silence_after_probability: 0.15
579
+ silence_before_correction: -0.05
580
+ - following_context: $
581
+ non_silence_before_correction: 0.01
582
+ preceding_context: '[zʒn]'
583
+ probability: 0.07
584
+ replacement: ''
585
+ segment: '[dɖ]'
586
+ silence_after_probability: 0.21
587
+ silence_before_correction: -0.03
588
+ - following_context: ''
589
+ non_silence_before_correction: 0.03
590
+ preceding_context: n
591
+ probability: 0.01
592
+ replacement: ''
593
+ segment: '[dɖ]'
594
+ silence_after_probability: 0.25
595
+ silence_before_correction: -0.07
596
+ nonnative:
597
+ - following_context: ə|ɚ
598
+ non_silence_before_correction: -0.07
599
+ preceding_context: ''
600
+ probability: 0.02
601
+ replacement: w
602
+ segment: '[ʉu]'
603
+ silence_after_probability: 0.94
604
+ silence_before_correction: 0.1
605
+ - following_context: $
606
+ non_silence_before_correction: -0.01
607
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
608
+ probability: 0.02
609
+ replacement: ''
610
+ segment: '[tʈ]'
611
+ silence_after_probability: 0.76
612
+ silence_before_correction: 0.04
613
+ - following_context: ''
614
+ non_silence_before_correction: -0.16
615
+ preceding_context: ''
616
+ probability: 0.47
617
+ replacement: d̪
618
+ segment: ð
619
+ silence_after_probability: 1.36
620
+ silence_before_correction: 0.59
621
+ - following_context: ''
622
+ non_silence_before_correction: -0.04
623
+ preceding_context: ''
624
+ probability: 0.04
625
+ replacement: t̪
626
+ segment: θ
627
+ silence_after_probability: 2.03
628
+ silence_before_correction: 0.1
629
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
630
+ non_silence_before_correction: 0.08
631
+ preceding_context: ''
632
+ probability: 0.15
633
+ replacement: s
634
+ segment: z
635
+ silence_after_probability: 1.75
636
+ silence_before_correction: -0.09
637
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
638
+ non_silence_before_correction: 0.01
639
+ preceding_context: ''
640
+ probability: 0.31
641
+ replacement: t
642
+ segment: d
643
+ silence_after_probability: 1.66
644
+ silence_before_correction: 0.1
645
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
646
+ non_silence_before_correction: -0.04
647
+ preceding_context: ''
648
+ probability: 0.22
649
+ replacement: p
650
+ segment: b
651
+ silence_after_probability: 1.39
652
+ silence_before_correction: 0.08
653
+ - following_context: '[tpkcsʃf][ʲʷ]?'
654
+ non_silence_before_correction: 0.03
655
+ preceding_context: ''
656
+ probability: 0.5
657
+ replacement: k
658
+ segment: ɡ
659
+ silence_after_probability: 2.5
660
+ silence_before_correction: -0.07
661
+ - following_context: '[vʋf]ʲ?'
662
+ non_silence_before_correction: 0.15
663
+ preceding_context: ''
664
+ probability: 0.01
665
+ replacement: ɱ
666
+ segment: m
667
+ silence_after_probability: 2.41
668
+ silence_before_correction: -0.2
669
+ - following_context: '[vʋf]ʲ?'
670
+ non_silence_before_correction: -0.06
671
+ preceding_context: ''
672
+ probability: 0.01
673
+ replacement: ɱ
674
+ segment: n
675
+ silence_after_probability: 1.46
676
+ silence_before_correction: 0.06
677
+ - following_context: '[tʈdɖ]$'
678
+ non_silence_before_correction: 0.09
679
+ preceding_context: m
680
+ probability: 0.2
681
+ replacement: ''
682
+ segment: '[pb]'
683
+ silence_after_probability: 2.0
684
+ silence_before_correction: -0.17
685
+ - following_context: '[sz]$'
686
+ non_silence_before_correction: -0.03
687
+ preceding_context: n
688
+ probability: 0.11
689
+ replacement: ''
690
+ segment: '[tʈdɖ]'
691
+ silence_after_probability: 0.81
692
+ silence_before_correction: 0.08
693
+ - following_context: '[s]$'
694
+ non_silence_before_correction: 0.03
695
+ preceding_context: ''
696
+ probability: 0.04
697
+ replacement: ''
698
+ segment: '[sʃ] t'
699
+ silence_after_probability: 0.91
700
+ silence_before_correction: -0.04
701
+ - following_context: '[dɖʈtcɟɡk][ʲʷ]?'
702
+ non_silence_before_correction: 0.02
703
+ preceding_context: ''
704
+ probability: 0.04
705
+ replacement: ''
706
+ segment: p
707
+ silence_after_probability: 1.24
708
+ silence_before_correction: -0.03
709
+ - following_context: '[pbcɟɡk][ʲʷ]?'
710
+ non_silence_before_correction: 0.03
711
+ preceding_context: ''
712
+ probability: 0.03
713
+ replacement: ''
714
+ segment: '[tʈ]'
715
+ silence_after_probability: 0.89
716
+ silence_before_correction: -0.09
717
+ - following_context: '[dɖtʈcɟɡk][ʲʷ]?'
718
+ non_silence_before_correction: 0.07
719
+ preceding_context: ''
720
+ probability: 0.06
721
+ replacement: ''
722
+ segment: b
723
+ silence_after_probability: 1.48
724
+ silence_before_correction: -0.11
725
+ - following_context: '[dɖtʈpb][ʲʷ]?'
726
+ non_silence_before_correction: 0.0
727
+ preceding_context: ''
728
+ probability: 0.02
729
+ replacement: ''
730
+ segment: k
731
+ silence_after_probability: 1.21
732
+ silence_before_correction: -0.01
733
+ - following_context: ([tʈpkc][ʲʷ]?)? ɹ
734
+ non_silence_before_correction: 0.05
735
+ preceding_context: ''
736
+ probability: 0.01
737
+ replacement: ʃ
738
+ segment: s
739
+ silence_after_probability: 1.51
740
+ silence_before_correction: -0.06
741
+ - following_context: ɹ
742
+ non_silence_before_correction: 0.01
743
+ preceding_context: ''
744
+ probability: 0.12
745
+ replacement: tʃ
746
+ segment: '[tʈ][ʲʷ]?'
747
+ silence_after_probability: 1.34
748
+ silence_before_correction: -0.02
749
+ - following_context: ''
750
+ non_silence_before_correction: 0.02
751
+ preceding_context: ''
752
+ probability: 0.01
753
+ replacement: tʃ
754
+ segment: '[tʈ][ʲʷ]? ɹ'
755
+ silence_after_probability: 1.39
756
+ silence_before_correction: -0.04
757
+ - following_context: ''
758
+ non_silence_before_correction: 0.0
759
+ preceding_context: ''
760
+ probability: 0.01
761
+ replacement: dʒ
762
+ segment: '[dɖ][ʲʷ]? ɹ'
763
+ silence_after_probability: 0.86
764
+ silence_before_correction: 0.01
765
+ - following_context: ɹ
766
+ non_silence_before_correction: -0.11
767
+ preceding_context: ''
768
+ probability: 0.05
769
+ replacement: dʒ
770
+ segment: '[dɖ][ʲʷ]?'
771
+ silence_after_probability: 1.47
772
+ silence_before_correction: 0.17
773
+ - following_context: ə n
774
+ non_silence_before_correction: 0.0
775
+ preceding_context: ''
776
+ probability: 0.01
777
+ replacement: ʔ
778
+ segment: '[tʈ]'
779
+ silence_after_probability: 1.59
780
+ silence_before_correction: 0.02
781
+ - following_context: $
782
+ non_silence_before_correction: 0.01
783
+ preceding_context: ɪ
784
+ probability: 0.15
785
+ replacement: n
786
+ segment: ŋ
787
+ silence_after_probability: 1.23
788
+ silence_before_correction: -0.02
789
+ - following_context: z$
790
+ non_silence_before_correction: -0.01
791
+ preceding_context: ɪ
792
+ probability: 0.07
793
+ replacement: n
794
+ segment: ŋ
795
+ silence_after_probability: 1.12
796
+ silence_before_correction: 0.0
797
+ - following_context: ''
798
+ non_silence_before_correction: 0.08
799
+ preceding_context: ''
800
+ probability: 0.03
801
+ replacement: l ə
802
+ segment: ə l ə
803
+ silence_after_probability: 1.29
804
+ silence_before_correction: -0.15
805
+ - following_context: ''
806
+ non_silence_before_correction: 0.06
807
+ preceding_context: ''
808
+ probability: 0.04
809
+ replacement: n ə
810
+ segment: ə n ə
811
+ silence_after_probability: 0.81
812
+ silence_before_correction: -0.13
813
+ - following_context: ''
814
+ non_silence_before_correction: 0.16
815
+ preceding_context: ''
816
+ probability: 0.03
817
+ replacement: m ə
818
+ segment: ə m ə
819
+ silence_after_probability: 0.8
820
+ silence_before_correction: -0.24
821
+ - following_context: ''
822
+ non_silence_before_correction: 0.02
823
+ preceding_context: ''
824
+ probability: 0.11
825
+ replacement: ɹ ə
826
+ segment: ə ɹ ə
827
+ silence_after_probability: 0.9
828
+ silence_before_correction: -0.04
829
+ - following_context: ''
830
+ non_silence_before_correction: -0.05
831
+ preceding_context: ''
832
+ probability: 0.13
833
+ replacement: z
834
+ segment: ð
835
+ silence_after_probability: 1.73
836
+ silence_before_correction: 0.07
837
+ - following_context: ''
838
+ non_silence_before_correction: -0.05
839
+ preceding_context: ''
840
+ probability: 0.01
841
+ replacement: s
842
+ segment: θ
843
+ silence_after_probability: 1.62
844
+ silence_before_correction: 0.05
845
+ - following_context: $
846
+ non_silence_before_correction: 0.0
847
+ preceding_context: ''
848
+ probability: 0.07
849
+ replacement: s
850
+ segment: z
851
+ silence_after_probability: 1.38
852
+ silence_before_correction: 0.01
853
+ - following_context: $
854
+ non_silence_before_correction: -0.01
855
+ preceding_context: ''
856
+ probability: 0.2
857
+ replacement: f
858
+ segment: v
859
+ silence_after_probability: 2.0
860
+ silence_before_correction: 0.01
861
+ - following_context: ''
862
+ non_silence_before_correction: -0.09
863
+ preceding_context: ''
864
+ probability: 0.06
865
+ replacement: ʋ
866
+ segment: '[vw]ʲ'
867
+ silence_after_probability: 2.14
868
+ silence_before_correction: 0.14
869
+ - following_context: ''
870
+ non_silence_before_correction: -0.02
871
+ preceding_context: ''
872
+ probability: 0.02
873
+ replacement: i
874
+ segment: ɪ
875
+ silence_after_probability: 1.48
876
+ silence_before_correction: 0.01
877
+ - following_context: ''
878
+ non_silence_before_correction: -0.05
879
+ preceding_context: ''
880
+ probability: 0.05
881
+ replacement: u
882
+ segment: ʊ
883
+ silence_after_probability: 2.17
884
+ silence_before_correction: 0.1
885
+ - following_context: ''
886
+ non_silence_before_correction: 0.03
887
+ preceding_context: ^
888
+ probability: 0.02
889
+ replacement: ''
890
+ segment: '[hç]'
891
+ silence_after_probability: 1.3
892
+ silence_before_correction: -0.11
893
+ - following_context: ''
894
+ non_silence_before_correction: 0.01
895
+ preceding_context: ''
896
+ probability: 0.02
897
+ replacement: ɹ
898
+ segment: '[ʎɫl]'
899
+ silence_after_probability: 1.28
900
+ silence_before_correction: -0.02
901
+ - following_context: .*(ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
902
+ non_silence_before_correction: 0.08
903
+ preceding_context: ^
904
+ probability: 0.01
905
+ replacement: ''
906
+ segment: ə
907
+ silence_after_probability: 1.55
908
+ silence_before_correction: -0.16
909
+ - following_context: $
910
+ non_silence_before_correction: 0.02
911
+ preceding_context: '[sʃn]'
912
+ probability: 0.06
913
+ replacement: ''
914
+ segment: '[tʈ]'
915
+ silence_after_probability: 0.45
916
+ silence_before_correction: -0.05
917
+ - following_context: $
918
+ non_silence_before_correction: 0.02
919
+ preceding_context: '[zʒn]'
920
+ probability: 0.06
921
+ replacement: ''
922
+ segment: '[dɖ]'
923
+ silence_after_probability: 0.71
924
+ silence_before_correction: -0.05
925
+ - following_context: ''
926
+ non_silence_before_correction: 0.03
927
+ preceding_context: n
928
+ probability: 0.02
929
+ replacement: ''
930
+ segment: '[dɖ]'
931
+ silence_after_probability: 0.92
932
+ silence_before_correction: -0.07
933
+ - following_context: ə|ɚ
934
+ non_silence_before_correction: 0.0
935
+ preceding_context: ''
936
+ probability: 0.01
937
+ replacement: j
938
+ segment: i
939
+ silence_after_probability: 1.68
940
+ silence_before_correction: -0.02
941
+ uk:
942
+ - following_context: $
943
+ non_silence_before_correction: 0.03
944
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
945
+ probability: 0.03
946
+ replacement: ''
947
+ segment: '[tʈ]'
948
+ silence_after_probability: 0.36
949
+ silence_before_correction: 0.03
950
+ - following_context: ''
951
+ non_silence_before_correction: -0.16
952
+ preceding_context: ''
953
+ probability: 0.49
954
+ replacement: d̪
955
+ segment: ð
956
+ silence_after_probability: 0.55
957
+ silence_before_correction: 1.05
958
+ - following_context: ''
959
+ non_silence_before_correction: -0.03
960
+ preceding_context: ''
961
+ probability: 0.12
962
+ replacement: t̪
963
+ segment: θ
964
+ silence_after_probability: 0.86
965
+ silence_before_correction: 0.12
966
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
967
+ non_silence_before_correction: 0.03
968
+ preceding_context: ''
969
+ probability: 0.05
970
+ replacement: s
971
+ segment: z
972
+ silence_after_probability: 2.25
973
+ silence_before_correction: -0.05
974
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
975
+ non_silence_before_correction: -0.07
976
+ preceding_context: ''
977
+ probability: 0.07
978
+ replacement: p
979
+ segment: b
980
+ silence_after_probability: 1.59
981
+ silence_before_correction: 0.2
982
+ - following_context: '[tpkcsʃf][ʲʷ]?'
983
+ non_silence_before_correction: 0.06
984
+ preceding_context: ''
985
+ probability: 0.01
986
+ replacement: k
987
+ segment: ɡ
988
+ silence_after_probability: 4.12
989
+ silence_before_correction: -0.1
990
+ - following_context: '[tʈdɖ]$'
991
+ non_silence_before_correction: 0.16
992
+ preceding_context: m
993
+ probability: 0.08
994
+ replacement: ''
995
+ segment: '[pb]'
996
+ silence_after_probability: 1.16
997
+ silence_before_correction: -0.48
998
+ - following_context: '[sz]$'
999
+ non_silence_before_correction: -0.02
1000
+ preceding_context: n
1001
+ probability: 0.05
1002
+ replacement: ''
1003
+ segment: '[tʈdɖ]'
1004
+ silence_after_probability: 0.31
1005
+ silence_before_correction: 0.05
1006
+ - following_context: '[s]$'
1007
+ non_silence_before_correction: -0.13
1008
+ preceding_context: ''
1009
+ probability: 0.05
1010
+ replacement: ''
1011
+ segment: '[sʃ] t'
1012
+ silence_after_probability: 0.6
1013
+ silence_before_correction: 0.45
1014
+ - following_context: $
1015
+ non_silence_before_correction: 0.09
1016
+ preceding_context: ''
1017
+ probability: 0.01
1018
+ replacement: k s
1019
+ segment: s k
1020
+ silence_after_probability: 2.41
1021
+ silence_before_correction: -0.19
1022
+ - following_context: '[dɖʈtcɟɡk][ʲʷ]?'
1023
+ non_silence_before_correction: 0.0
1024
+ preceding_context: ''
1025
+ probability: 0.01
1026
+ replacement: ''
1027
+ segment: p
1028
+ silence_after_probability: 0.92
1029
+ silence_before_correction: 0.06
1030
+ - following_context: '[pbcɟɡk][ʲʷ]?'
1031
+ non_silence_before_correction: -0.24
1032
+ preceding_context: ''
1033
+ probability: 0.01
1034
+ replacement: ''
1035
+ segment: d
1036
+ silence_after_probability: 2.47
1037
+ silence_before_correction: 0.35
1038
+ - following_context: '[dɖtʈpb][ʲʷ]?'
1039
+ non_silence_before_correction: -0.01
1040
+ preceding_context: ''
1041
+ probability: 0.02
1042
+ replacement: ''
1043
+ segment: k
1044
+ silence_after_probability: 0.85
1045
+ silence_before_correction: 0.02
1046
+ - following_context: '[dɖtʈpb][ʲʷ]?'
1047
+ non_silence_before_correction: 0.13
1048
+ preceding_context: ''
1049
+ probability: 0.01
1050
+ replacement: ''
1051
+ segment: ɡ
1052
+ silence_after_probability: 1.43
1053
+ silence_before_correction: -0.19
1054
+ - following_context: ([tʈpkc][ʲʷ]?)? ɹ
1055
+ non_silence_before_correction: -0.04
1056
+ preceding_context: ''
1057
+ probability: 0.02
1058
+ replacement: ʃ
1059
+ segment: s
1060
+ silence_after_probability: 1.0
1061
+ silence_before_correction: 0.15
1062
+ - following_context: ɹ
1063
+ non_silence_before_correction: -0.01
1064
+ preceding_context: ''
1065
+ probability: 0.12
1066
+ replacement: tʃ
1067
+ segment: '[tʈ][ʲʷ]?'
1068
+ silence_after_probability: 1.13
1069
+ silence_before_correction: 0.06
1070
+ - following_context: ''
1071
+ non_silence_before_correction: -0.01
1072
+ preceding_context: ''
1073
+ probability: 0.01
1074
+ replacement: tʃ
1075
+ segment: '[tʈ][ʲʷ]? ɹ'
1076
+ silence_after_probability: 0.84
1077
+ silence_before_correction: 0.04
1078
+ - following_context: ''
1079
+ non_silence_before_correction: 0.04
1080
+ preceding_context: ''
1081
+ probability: 0.01
1082
+ replacement: dʒ
1083
+ segment: '[dɖ][ʲʷ]? ɹ'
1084
+ silence_after_probability: 1.08
1085
+ silence_before_correction: -0.07
1086
+ - following_context: ɹ
1087
+ non_silence_before_correction: -0.18
1088
+ preceding_context: ''
1089
+ probability: 0.03
1090
+ replacement: dʒ
1091
+ segment: '[dɖ][ʲʷ]?'
1092
+ silence_after_probability: 1.28
1093
+ silence_before_correction: 0.35
1094
+ - following_context: ə n
1095
+ non_silence_before_correction: 0.03
1096
+ preceding_context: ''
1097
+ probability: 0.02
1098
+ replacement: ʔ
1099
+ segment: '[tʈ]'
1100
+ silence_after_probability: 0.97
1101
+ silence_before_correction: -0.05
1102
+ - following_context: $
1103
+ non_silence_before_correction: -0.01
1104
+ preceding_context: ɪ
1105
+ probability: 0.15
1106
+ replacement: n
1107
+ segment: ŋ
1108
+ silence_after_probability: 1.0
1109
+ silence_before_correction: 0.03
1110
+ - following_context: z$
1111
+ non_silence_before_correction: -0.07
1112
+ preceding_context: ɪ
1113
+ probability: 0.1
1114
+ replacement: n
1115
+ segment: ŋ
1116
+ silence_after_probability: 1.02
1117
+ silence_before_correction: 0.15
1118
+ - following_context: ''
1119
+ non_silence_before_correction: 0.03
1120
+ preceding_context: ''
1121
+ probability: 0.02
1122
+ replacement: l ə
1123
+ segment: ə l ə
1124
+ silence_after_probability: 0.98
1125
+ silence_before_correction: -0.03
1126
+ - following_context: ''
1127
+ non_silence_before_correction: -0.12
1128
+ preceding_context: ''
1129
+ probability: 0.12
1130
+ replacement: n ə
1131
+ segment: ə n ə
1132
+ silence_after_probability: 0.69
1133
+ silence_before_correction: 0.93
1134
+ - following_context: ''
1135
+ non_silence_before_correction: -0.52
1136
+ preceding_context: ''
1137
+ probability: 0.01
1138
+ replacement: m ə
1139
+ segment: ə m ə
1140
+ silence_after_probability: 1.36
1141
+ silence_before_correction: 1.15
1142
+ - following_context: ''
1143
+ non_silence_before_correction: -0.02
1144
+ preceding_context: ''
1145
+ probability: 0.15
1146
+ replacement: ɹ ə
1147
+ segment: ə ɹ ə
1148
+ silence_after_probability: 0.52
1149
+ silence_before_correction: 0.19
1150
+ - following_context: ''
1151
+ non_silence_before_correction: -0.04
1152
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1153
+ probability: 0.04
1154
+ replacement: ʔ
1155
+ segment: t[ʲʷ]?
1156
+ silence_after_probability: 2.25
1157
+ silence_before_correction: 0.15
1158
+ - following_context: ʉː?
1159
+ non_silence_before_correction: 0.02
1160
+ preceding_context: ''
1161
+ probability: 0.46
1162
+ replacement: tʃ
1163
+ segment: tʲ
1164
+ silence_after_probability: 0.79
1165
+ silence_before_correction: -0.02
1166
+ - following_context: ʉː?
1167
+ non_silence_before_correction: 0.02
1168
+ preceding_context: ''
1169
+ probability: 0.17
1170
+ replacement: dʒ
1171
+ segment: dʲ
1172
+ silence_after_probability: 0.63
1173
+ silence_before_correction: 0.01
1174
+ - following_context: ''
1175
+ non_silence_before_correction: 0.09
1176
+ preceding_context: ^
1177
+ probability: 0.04
1178
+ replacement: ''
1179
+ segment: ç
1180
+ silence_after_probability: 0.61
1181
+ silence_before_correction: -0.39
1182
+ - following_context: ''
1183
+ non_silence_before_correction: -0.01
1184
+ preceding_context: ''
1185
+ probability: 0.01
1186
+ replacement: ʔ n̩
1187
+ segment: t ə n
1188
+ silence_after_probability: 0.88
1189
+ silence_before_correction: -0.08
1190
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1191
+ non_silence_before_correction: 0.0
1192
+ preceding_context: ''
1193
+ probability: 0.03
1194
+ replacement: n̩
1195
+ segment: ə n
1196
+ silence_after_probability: 0.62
1197
+ silence_before_correction: 0.14
1198
+ - following_context: $
1199
+ non_silence_before_correction: 0.05
1200
+ preceding_context: ''
1201
+ probability: 0.03
1202
+ replacement: n̩
1203
+ segment: ə n
1204
+ silence_after_probability: 0.94
1205
+ silence_before_correction: -0.12
1206
+ - following_context: $
1207
+ non_silence_before_correction: 0.12
1208
+ preceding_context: ''
1209
+ probability: 0.05
1210
+ replacement: m̩
1211
+ segment: ə m
1212
+ silence_after_probability: 0.09
1213
+ silence_before_correction: -0.21
1214
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1215
+ non_silence_before_correction: 0.01
1216
+ preceding_context: ''
1217
+ probability: 0.01
1218
+ replacement: m̩
1219
+ segment: ə m
1220
+ silence_after_probability: 1.36
1221
+ silence_before_correction: -0.03
1222
+ - following_context: $
1223
+ non_silence_before_correction: -0.05
1224
+ preceding_context: ''
1225
+ probability: 0.2
1226
+ replacement: ɫ̩
1227
+ segment: ə ɫ
1228
+ silence_after_probability: 0.48
1229
+ silence_before_correction: 0.24
1230
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1231
+ non_silence_before_correction: -0.01
1232
+ preceding_context: ''
1233
+ probability: 0.08
1234
+ replacement: ɫ̩
1235
+ segment: ə ɫ
1236
+ silence_after_probability: 1.09
1237
+ silence_before_correction: 0.02
1238
+ - following_context: .*(ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1239
+ non_silence_before_correction: 0.03
1240
+ preceding_context: ^
1241
+ probability: 0.01
1242
+ replacement: ''
1243
+ segment: ə
1244
+ silence_after_probability: 1.03
1245
+ silence_before_correction: -0.04
1246
+ - following_context: $
1247
+ non_silence_before_correction: -0.01
1248
+ preceding_context: '[sʃn]'
1249
+ probability: 0.06
1250
+ replacement: ''
1251
+ segment: '[tʈ]'
1252
+ silence_after_probability: 0.15
1253
+ silence_before_correction: 0.08
1254
+ - following_context: $
1255
+ non_silence_before_correction: 0.02
1256
+ preceding_context: '[zʒn]'
1257
+ probability: 0.09
1258
+ replacement: ''
1259
+ segment: '[dɖ]'
1260
+ silence_after_probability: 0.09
1261
+ silence_before_correction: -0.02
1262
+ - following_context: ''
1263
+ non_silence_before_correction: -0.01
1264
+ preceding_context: n
1265
+ probability: 0.01
1266
+ replacement: ''
1267
+ segment: '[dɖ]'
1268
+ silence_after_probability: 0.86
1269
+ silence_before_correction: 0.03
1270
+ - following_context: ə|ɚ
1271
+ non_silence_before_correction: -0.02
1272
+ preceding_context: ''
1273
+ probability: 0.01
1274
+ replacement: j
1275
+ segment: i
1276
+ silence_after_probability: 0.9
1277
+ silence_before_correction: -0.01
1278
+ - following_context: ə|ɚ
1279
+ non_silence_before_correction: 0.01
1280
+ preceding_context: ''
1281
+ probability: 0.01
1282
+ replacement: w
1283
+ segment: '[ʉu]'
1284
+ silence_after_probability: 0.38
1285
+ silence_before_correction: 0.11
1286
+ us:
1287
+ - following_context: ə|ɚ
1288
+ non_silence_before_correction: 0.06
1289
+ preceding_context: ''
1290
+ probability: 0.01
1291
+ replacement: w
1292
+ segment: '[ʉu]'
1293
+ silence_after_probability: 1.97
1294
+ silence_before_correction: -0.1
1295
+ - following_context: $
1296
+ non_silence_before_correction: 0.04
1297
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1298
+ probability: 0.1
1299
+ replacement: ''
1300
+ segment: '[tʈ]'
1301
+ silence_after_probability: 0.24
1302
+ silence_before_correction: -0.09
1303
+ - following_context: ''
1304
+ non_silence_before_correction: 0.0
1305
+ preceding_context: ''
1306
+ probability: 0.37
1307
+ replacement: d̪
1308
+ segment: ð
1309
+ silence_after_probability: 0.45
1310
+ silence_before_correction: 0.39
1311
+ - following_context: ''
1312
+ non_silence_before_correction: 0.05
1313
+ preceding_context: ''
1314
+ probability: 0.19
1315
+ replacement: t̪
1316
+ segment: θ
1317
+ silence_after_probability: 0.38
1318
+ silence_before_correction: -0.09
1319
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
1320
+ non_silence_before_correction: 0.07
1321
+ preceding_context: ''
1322
+ probability: 0.01
1323
+ replacement: t
1324
+ segment: d
1325
+ silence_after_probability: 0.61
1326
+ silence_before_correction: -0.13
1327
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
1328
+ non_silence_before_correction: -0.09
1329
+ preceding_context: ''
1330
+ probability: 0.01
1331
+ replacement: p
1332
+ segment: b
1333
+ silence_after_probability: 1.83
1334
+ silence_before_correction: 0.09
1335
+ - following_context: '[tʈpkcsʃf][ʲʷ]?'
1336
+ non_silence_before_correction: 0.11
1337
+ preceding_context: ''
1338
+ probability: 0.13
1339
+ replacement: tʃ
1340
+ segment: dʒ
1341
+ silence_after_probability: 0.8
1342
+ silence_before_correction: -0.44
1343
+ - following_context: '[vʋf]ʲ?'
1344
+ non_silence_before_correction: -0.08
1345
+ preceding_context: ''
1346
+ probability: 0.01
1347
+ replacement: ɱ
1348
+ segment: n
1349
+ silence_after_probability: 1.28
1350
+ silence_before_correction: 0.06
1351
+ - following_context: '[tʈdɖ]$'
1352
+ non_silence_before_correction: 0.0
1353
+ preceding_context: m
1354
+ probability: 0.02
1355
+ replacement: ''
1356
+ segment: '[pb]'
1357
+ silence_after_probability: 2.0
1358
+ silence_before_correction: 0.0
1359
+ - following_context: '[sz]$'
1360
+ non_silence_before_correction: 0.02
1361
+ preceding_context: n
1362
+ probability: 0.01
1363
+ replacement: ''
1364
+ segment: '[tʈdɖ]'
1365
+ silence_after_probability: 0.12
1366
+ silence_before_correction: -0.02
1367
+ - following_context: '[s]$'
1368
+ non_silence_before_correction: -0.08
1369
+ preceding_context: ''
1370
+ probability: 0.01
1371
+ replacement: ''
1372
+ segment: '[sʃ] t'
1373
+ silence_after_probability: 0.51
1374
+ silence_before_correction: 0.16
1375
+ - following_context: '[dɖʈtcɟɡk][ʲʷ]?'
1376
+ non_silence_before_correction: 0.02
1377
+ preceding_context: ''
1378
+ probability: 0.01
1379
+ replacement: ''
1380
+ segment: p
1381
+ silence_after_probability: 0.29
1382
+ silence_before_correction: -0.06
1383
+ - following_context: '[pbcɟɡk][ʲʷ]?'
1384
+ non_silence_before_correction: 0.07
1385
+ preceding_context: ''
1386
+ probability: 0.05
1387
+ replacement: ''
1388
+ segment: '[tʈ]'
1389
+ silence_after_probability: 0.42
1390
+ silence_before_correction: -0.17
1391
+ - following_context: '[dɖtʈpb][ʲʷ]?'
1392
+ non_silence_before_correction: 0.0
1393
+ preceding_context: ''
1394
+ probability: 0.02
1395
+ replacement: ''
1396
+ segment: k
1397
+ silence_after_probability: 0.33
1398
+ silence_before_correction: 0.0
1399
+ - following_context: ([tʈpkc][ʲʷ]?)? ɹ
1400
+ non_silence_before_correction: 0.02
1401
+ preceding_context: ''
1402
+ probability: 0.03
1403
+ replacement: ʃ
1404
+ segment: s
1405
+ silence_after_probability: 0.41
1406
+ silence_before_correction: -0.05
1407
+ - following_context: ɹ
1408
+ non_silence_before_correction: 0.03
1409
+ preceding_context: ''
1410
+ probability: 0.04
1411
+ replacement: tʃ
1412
+ segment: '[tʈ][ʲʷ]?'
1413
+ silence_after_probability: 0.61
1414
+ silence_before_correction: -0.08
1415
+ - following_context: ''
1416
+ non_silence_before_correction: 0.02
1417
+ preceding_context: ''
1418
+ probability: 0.02
1419
+ replacement: tʃ
1420
+ segment: '[tʈ][ʲʷ]? ɹ'
1421
+ silence_after_probability: 0.24
1422
+ silence_before_correction: -0.03
1423
+ - following_context: ''
1424
+ non_silence_before_correction: 0.01
1425
+ preceding_context: ''
1426
+ probability: 0.02
1427
+ replacement: dʒ
1428
+ segment: '[dɖ][ʲʷ]? ɹ'
1429
+ silence_after_probability: 0.17
1430
+ silence_before_correction: 0.01
1431
+ - following_context: ɹ
1432
+ non_silence_before_correction: -0.03
1433
+ preceding_context: ''
1434
+ probability: 0.02
1435
+ replacement: dʒ
1436
+ segment: '[dɖ][ʲʷ]?'
1437
+ silence_after_probability: 0.44
1438
+ silence_before_correction: 0.1
1439
+ - following_context: $
1440
+ non_silence_before_correction: 0.02
1441
+ preceding_context: ɪ
1442
+ probability: 0.19
1443
+ replacement: n
1444
+ segment: ŋ
1445
+ silence_after_probability: 0.42
1446
+ silence_before_correction: -0.04
1447
+ - following_context: z$
1448
+ non_silence_before_correction: -0.03
1449
+ preceding_context: ɪ
1450
+ probability: 0.19
1451
+ replacement: n
1452
+ segment: ŋ
1453
+ silence_after_probability: 0.39
1454
+ silence_before_correction: 0.1
1455
+ - following_context: ''
1456
+ non_silence_before_correction: -0.04
1457
+ preceding_context: ''
1458
+ probability: 0.01
1459
+ replacement: l ə
1460
+ segment: ə l ə
1461
+ silence_after_probability: 1.19
1462
+ silence_before_correction: 0.1
1463
+ - following_context: ''
1464
+ non_silence_before_correction: 0.1
1465
+ preceding_context: ''
1466
+ probability: 0.03
1467
+ replacement: n ə
1468
+ segment: ə n ə
1469
+ silence_after_probability: 0.25
1470
+ silence_before_correction: -0.25
1471
+ - following_context: ''
1472
+ non_silence_before_correction: 0.0
1473
+ preceding_context: ''
1474
+ probability: 0.01
1475
+ replacement: m ə
1476
+ segment: ə m ə
1477
+ silence_after_probability: 0.82
1478
+ silence_before_correction: 0.09
1479
+ - following_context: ''
1480
+ non_silence_before_correction: 0.02
1481
+ preceding_context: ''
1482
+ probability: 0.05
1483
+ replacement: ɹ ə
1484
+ segment: ə ɹ ə
1485
+ silence_after_probability: 0.29
1486
+ silence_before_correction: -0.05
1487
+ - following_context: .*(ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1488
+ non_silence_before_correction: 0.07
1489
+ preceding_context: ^
1490
+ probability: 0.04
1491
+ replacement: ''
1492
+ segment: ə
1493
+ silence_after_probability: 0.39
1494
+ silence_before_correction: -0.17
1495
+ - following_context: $
1496
+ non_silence_before_correction: 0.04
1497
+ preceding_context: '[sʃn]'
1498
+ probability: 0.07
1499
+ replacement: ''
1500
+ segment: '[tʈ]'
1501
+ silence_after_probability: 0.15
1502
+ silence_before_correction: -0.08
1503
+ - following_context: $
1504
+ non_silence_before_correction: 0.05
1505
+ preceding_context: '[zʒn]'
1506
+ probability: 0.06
1507
+ replacement: ''
1508
+ segment: '[dɖ]'
1509
+ silence_after_probability: 0.21
1510
+ silence_before_correction: -0.12
1511
+ - following_context: ''
1512
+ non_silence_before_correction: 0.0
1513
+ preceding_context: n
1514
+ probability: 0.01
1515
+ replacement: ''
1516
+ segment: '[dɖ]'
1517
+ silence_after_probability: 0.42
1518
+ silence_before_correction: 0.03
1519
+ - following_context: ə|ɚ
1520
+ non_silence_before_correction: 0.03
1521
+ preceding_context: ''
1522
+ probability: 0.01
1523
+ replacement: j
1524
+ segment: i
1525
+ silence_after_probability: 0.85
1526
+ silence_before_correction: -0.06
1527
+ - following_context: (ɪ|ə|ɚ|i)
1528
+ non_silence_before_correction: 0.03
1529
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1530
+ probability: 0.1
1531
+ replacement: ɾ
1532
+ segment: '[td]'
1533
+ silence_after_probability: 0.26
1534
+ silence_before_correction: -0.09
1535
+ - following_context: (ɪ|ə|ɚ|i)
1536
+ non_silence_before_correction: 0.01
1537
+ preceding_context: ɹ
1538
+ probability: 0.05
1539
+ replacement: ɾ
1540
+ segment: '[td]'
1541
+ silence_after_probability: 0.33
1542
+ silence_before_correction: 0.02
1543
+ - following_context: (ɪ|ə|ɚ|i)
1544
+ non_silence_before_correction: 0.0
1545
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1546
+ probability: 0.06
1547
+ replacement: ɾʲ
1548
+ segment: '[td]ʲ'
1549
+ silence_after_probability: 0.3
1550
+ silence_before_correction: -0.01
1551
+ - following_context: (ɪ|ə|ɚ|i)
1552
+ non_silence_before_correction: -0.02
1553
+ preceding_context: ɹ
1554
+ probability: 0.19
1555
+ replacement: ɾʲ
1556
+ segment: '[td]ʲ'
1557
+ silence_after_probability: 0.27
1558
+ silence_before_correction: 0.16
1559
+ - following_context: (ɪ|ə|ɚ|i)
1560
+ non_silence_before_correction: 0.13
1561
+ preceding_context: (ɫ|ɫ̩)
1562
+ probability: 0.02
1563
+ replacement: ɾʲ
1564
+ segment: '[td]ʲ'
1565
+ silence_after_probability: 1.07
1566
+ silence_before_correction: -0.21
1567
+ - following_context: (ɪ|ə|ɚ|i)
1568
+ non_silence_before_correction: 0.05
1569
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1570
+ probability: 0.01
1571
+ replacement: ɾ̃
1572
+ segment: (ɲ|n)
1573
+ silence_after_probability: 0.51
1574
+ silence_before_correction: -0.11
1575
+ - following_context: (ɪ|ə|ɚ|i)
1576
+ non_silence_before_correction: 0.04
1577
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1578
+ probability: 0.01
1579
+ replacement: ɾ̃
1580
+ segment: (ɲ|n) [td][ʲʷ]?
1581
+ silence_after_probability: 0.4
1582
+ silence_before_correction: -0.1
1583
+ - following_context: $
1584
+ non_silence_before_correction: 0.1
1585
+ preceding_context: ''
1586
+ probability: 0.01
1587
+ replacement: ɑː
1588
+ segment: ɒː
1589
+ silence_after_probability: 0.25
1590
+ silence_before_correction: -0.3
1591
+ - following_context: '[^ɹ]'
1592
+ non_silence_before_correction: 0.05
1593
+ preceding_context: ''
1594
+ probability: 0.01
1595
+ replacement: ɑː
1596
+ segment: ɒː
1597
+ silence_after_probability: 0.56
1598
+ silence_before_correction: -0.1
1599
+ - following_context: $
1600
+ non_silence_before_correction: -0.06
1601
+ preceding_context: ''
1602
+ probability: 0.03
1603
+ replacement: ɑ
1604
+ segment: ɒ
1605
+ silence_after_probability: 1.0
1606
+ silence_before_correction: -0.03
1607
+ - following_context: '[^ɹ]'
1608
+ non_silence_before_correction: 0.04
1609
+ preceding_context: ''
1610
+ probability: 0.01
1611
+ replacement: ɑ
1612
+ segment: ɒ
1613
+ silence_after_probability: 0.29
1614
+ silence_before_correction: -0.11
1615
+ - following_context: $
1616
+ non_silence_before_correction: 0.01
1617
+ preceding_context: (ʊ|ɔj|ɝ|ɛ|ej|ɜ|a|u|o|ow|æ|aw|əw|aj|ɐ|ɪ|ə|ɔ|e|ɚ|ɑ|ʉ|ɒ|i)ː?
1618
+ probability: 0.04
1619
+ replacement: ɾ
1620
+ segment: d
1621
+ silence_after_probability: 0.41
1622
+ silence_before_correction: -0.04
1623
+ - following_context: ''
1624
+ non_silence_before_correction: 0.07
1625
+ preceding_context: ''
1626
+ probability: 0.01
1627
+ replacement: ʔ n̩
1628
+ segment: t ə n
1629
+ silence_after_probability: 0.88
1630
+ silence_before_correction: -0.15
1631
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1632
+ non_silence_before_correction: 0.02
1633
+ preceding_context: ''
1634
+ probability: 0.01
1635
+ replacement: n̩
1636
+ segment: ə n
1637
+ silence_after_probability: 0.24
1638
+ silence_before_correction: -0.05
1639
+ - following_context: $
1640
+ non_silence_before_correction: 0.05
1641
+ preceding_context: ''
1642
+ probability: 0.05
1643
+ replacement: n̩
1644
+ segment: ə n
1645
+ silence_after_probability: 0.35
1646
+ silence_before_correction: -0.12
1647
+ - following_context: $
1648
+ non_silence_before_correction: 0.09
1649
+ preceding_context: ''
1650
+ probability: 0.06
1651
+ replacement: m̩
1652
+ segment: ə m
1653
+ silence_after_probability: 0.24
1654
+ silence_before_correction: -0.25
1655
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1656
+ non_silence_before_correction: 0.03
1657
+ preceding_context: ''
1658
+ probability: 0.01
1659
+ replacement: m̩
1660
+ segment: ə m
1661
+ silence_after_probability: 0.64
1662
+ silence_before_correction: -0.02
1663
+ - following_context: $
1664
+ non_silence_before_correction: 0.05
1665
+ preceding_context: ''
1666
+ probability: 0.17
1667
+ replacement: ɫ̩
1668
+ segment: ə ɫ
1669
+ silence_after_probability: 0.26
1670
+ silence_before_correction: -0.12
1671
+ - following_context: '[^ʊɔɝaɔɛɜeuoæɐɪəɚɑʉɒi].*'
1672
+ non_silence_before_correction: 0.01
1673
+ preceding_context: ''
1674
+ probability: 0.03
1675
+ replacement: ɫ̩
1676
+ segment: ə ɫ
1677
+ silence_after_probability: 0.58
1678
+ silence_before_correction: -0.03
1679
+ rules: []
acoustic/tree ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71d2741b42fe55707ca41d908d6157bc94c6171623b156f71274b13ecb6dade7
3
+ size 468787
dictionary/english_india_mfa.dict ADDED
The diff for this file is too large to render. See raw diff
 
dictionary/english_nigeria_mfa.dict ADDED
The diff for this file is too large to render. See raw diff
 
dictionary/english_uk_mfa.dict ADDED
The diff for this file is too large to render. See raw diff
 
dictionary/english_us_mfa.dict ADDED
The diff for this file is too large to render. See raw diff
 
g2p/english_india_mfa/graphemes.sym ADDED
@@ -0,0 +1,280 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ | 1
3
+ _ 2
4
+ a 3
5
+ s|s 4
6
+ '|d 5
7
+ '|e 6
8
+ m 7
9
+ ' 8
10
+ e 9
11
+ l|l 10
12
+ '|m 11
13
+ r|e 12
14
+ '|s 13
15
+ u|n 14
16
+ v|e 15
17
+ '|v 16
18
+ d 17
19
+ r 18
20
+ g 19
21
+ v 20
22
+ w|e 21
23
+ s 22
24
+ o|m 23
25
+ c|a 24
26
+ p 25
27
+ c 26
28
+ b 27
29
+ c|h 28
30
+ e|n 29
31
+ h 30
32
+ j 31
33
+ l 32
34
+ a|c 33
35
+ u 34
36
+ k 35
37
+ a|l 36
38
+ o|r 37
39
+ l|i 38
40
+ i 39
41
+ s|t 40
42
+ a|m 41
43
+ a|n 42
44
+ l|o 43
45
+ a|r 44
46
+ w|o 45
47
+ f 46
48
+ h|u 47
49
+ o|n 48
50
+ r|o 49
51
+ n|i 50
52
+ i|c 51
53
+ v|o 52
54
+ e|l 53
55
+ t|h 54
56
+ b|a 55
57
+ c|i 56
58
+ n|a 57
59
+ t|e 58
60
+ c|k 59
61
+ w 60
62
+ c|o 61
63
+ f|t 62
64
+ t 63
65
+ c|e 64
66
+ j|o 65
67
+ o 66
68
+ n|e 67
69
+ m|p 68
70
+ n|d 69
71
+ e|d 70
72
+ l|y 71
73
+ e|r 72
74
+ n|g 73
75
+ m|e 74
76
+ n|t 75
77
+ s|e 76
78
+ s|h 77
79
+ l|e 78
80
+ t|i 79
81
+ z|a 80
82
+ b|b 81
83
+ s|i 82
84
+ e|s 83
85
+ v|i 84
86
+ y 85
87
+ i|e 86
88
+ d|e 87
89
+ d|a 88
90
+ r|i 89
91
+ d|i 90
92
+ d|o 91
93
+ m|i 92
94
+ n 93
95
+ o|u 94
96
+ d|u 95
97
+ u|c 96
98
+ u|l 97
99
+ l|a 98
100
+ z|i 99
101
+ z 100
102
+ b|e 101
103
+ g|e 102
104
+ m|o 103
105
+ k|i 104
106
+ b|i 105
107
+ c|y 106
108
+ r|a 107
109
+ w|y 108
110
+ e|t 109
111
+ h|i 110
112
+ k|a 111
113
+ h|o 112
114
+ i|n 113
115
+ g|a 114
116
+ i|l 115
117
+ u|r 116
118
+ j|e 117
119
+ c|t 118
120
+ j|u 119
121
+ t|a 120
122
+ z|e 121
123
+ b|l 122
124
+ m|a 123
125
+ b|o 124
126
+ o|o 125
127
+ g|i 126
128
+ t|o 127
129
+ u|m 128
130
+ h|a 129
131
+ m|s 130
132
+ f|f 131
133
+ b|r 132
134
+ d|g 133
135
+ a|d 134
136
+ z|z 135
137
+ s|c 136
138
+ i|s 137
139
+ i|a 138
140
+ o|l 139
141
+ q|u 140
142
+ a|t 141
143
+ t|r 142
144
+ i|t 143
145
+ b|u 144
146
+ f|e 145
147
+ f|u 146
148
+ u|t 147
149
+ w|a 148
150
+ b|y 149
151
+ m|y 150
152
+ c|u 151
153
+ u|s 152
154
+ p|h 153
155
+ r|s 154
156
+ u|a 155
157
+ p|i 156
158
+ c|c 157
159
+ m|m 158
160
+ c|r 159
161
+ m|u 160
162
+ x 161
163
+ g|l 162
164
+ v|a 163
165
+ n|o 164
166
+ z|o 165
167
+ f|i 166
168
+ p|o 167
169
+ k|e 168
170
+ n|s 169
171
+ n|c 170
172
+ i|d 171
173
+ d|d 172
174
+ h|e 173
175
+ g|h 174
176
+ j|a 175
177
+ f|o 176
178
+ w|n 177
179
+ d|r 178
180
+ p|e 179
181
+ w|s 180
182
+ d|y 181
183
+ e|a 182
184
+ p|y 183
185
+ p|a 184
186
+ g|o 185
187
+ o|p 186
188
+ f|a 187
189
+ f|l 188
190
+ f|r 189
191
+ g|r 190
192
+ p|p 191
193
+ g|u 192
194
+ h|l 193
195
+ '|t 194
196
+ i|o 195
197
+ j|i 196
198
+ s|a 197
199
+ k|h 198
200
+ m|b 199
201
+ k|k 200
202
+ k|n 201
203
+ k|o 202
204
+ k|s 203
205
+ k|u 204
206
+ u|i 205
207
+ h|y 206
208
+ w|i 207
209
+ x|i 208
210
+ k|y 209
211
+ y|s 210
212
+ p|u 211
213
+ o|t 212
214
+ f|y 213
215
+ c|l 214
216
+ w|h 215
217
+ p|l 216
218
+ p|r 217
219
+ o|s 218
220
+ a|s 219
221
+ w|r 220
222
+ h|r 221
223
+ h|n 222
224
+ w|f 223
225
+ w|k 224
226
+ k|w 225
227
+ w|l 226
228
+ z|u 227
229
+ z|y 228
230
+ k|l 229
231
+ u|e 230
232
+ b|d 231
233
+ '|n 232
234
+ h|m 233
235
+ w|d 234
236
+ v|u 235
237
+ q|l 236
238
+ v|y 237
239
+ q|i 238
240
+ z|h 239
241
+ m|n 240
242
+ k|r 241
243
+ q 242
244
+ p|s 243
245
+ ô 244
246
+ x|s 245
247
+ c|s 246
248
+ w|m 247
249
+ w|t 248
250
+ '|a 249
251
+ b|t 250
252
+ x|c 251
253
+ p|t 252
254
+ w|b 253
255
+ v|s 254
256
+ b|s 255
257
+ b|j 256
258
+ '|r 257
259
+ x|h 258
260
+ x|u 259
261
+ d|s 260
262
+ '|c 261
263
+ q|a 262
264
+ '|l 263
265
+ k|b 264
266
+ m|c 265
267
+ h|t 266
268
+ x|y 267
269
+ x|o 268
270
+ '|i 269
271
+ h|w 270
272
+ w|u 271
273
+ '|o 272
274
+ d|m 273
275
+ f|s 274
276
+ x|a 275
277
+ x|e 276
278
+ '|k 277
279
+ ü|r 278
280
+ ü 279
g2p/english_india_mfa/meta.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"version": "2.2.6.dev1+g6874b58.d20230320", "architecture": "phonetisaurus", "train_date": "2023-05-07 11:50:08.461087", "phones": ["a", "aj", "aw", "b", "b\u02b2", "c", "c\u02b7", "d\u0292", "d\u032a", "e\u02d0", "f", "f\u02b2", "h", "i", "i\u02d0", "j", "k", "k\u02b7", "l", "m", "m\u02b2", "n", "o\u02d0", "p", "p\u02b2", "p\u02b7", "s", "t\u0283", "t\u032a", "z", "\u00e7", "\u014b", "\u0251", "\u0251\u02d0", "\u0252", "\u0252\u02d0", "\u0254j", "\u0256", "\u0259", "\u025b", "\u025b\u02d0", "\u025c", "\u025c\u02d0", "\u025f", "\u025f\u02b7", "\u0261", "\u0261\u02b7", "\u026a", "\u0272", "\u0279", "\u027e", "\u0283", "\u0288", "\u0288\u02b2", "\u0288\u02b7", "\u0289", "\u0289\u02d0", "\u028a", "\u028b", "\u028e", "\u0292"], "graphemes": ["\u00fc", "e", "v", "g", "n", "x", "f", "l", "\u00f4", "m", "w", "t", "z", "u", "p", "i", "'", "o", "b", "h", "y", "r", "c", "k", "s", "q", "d", "j", "a"], "grapheme_order": 2, "phone_order": 2, "sequence_separator": "|", "evaluation": {"num_words": 7493, "word_error_rate": null, "phone_error_rate": null}, "training": {"num_words": 67432, "num_graphemes": 29, "num_phones": 61}}
g2p/english_india_mfa/model.fst ADDED

Git LFS Details

  • SHA256: 4caf7deb712242ee3b69f1a3be85b3f29333275c8bf088017a09d137efe6bdba
  • Pointer size: 133 Bytes
  • Size of remote file: 37.6 MB
g2p/english_india_mfa/phones.sym ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ a 1
3
+ s 2
4
+ ɖ 3
5
+ ʈ 4
6
+ ə 5
7
+ ɪ 6
8
+ m 7
9
+ ɛ 8
10
+ l 9
11
+ z 10
12
+ n 11
13
+ ʋ 12
14
+ ɒ 13
15
+ ɑ 14
16
+ ɑː 15
17
+ eː 16
18
+ iː 17
19
+ dʒ 18
20
+ ɒː 19
21
+ k 20
22
+ p 21
23
+ bʲ 22
24
+ tʃ 23
25
+ ʊ 24
26
+ c 25
27
+ j 26
28
+ b 27
29
+ ɡ 28
30
+ ʎ 29
31
+ f 30
32
+ ɹ 31
33
+ h 32
34
+ ʉː 33
35
+ ɛː 34
36
+ ɲ 35
37
+ oː 36
38
+ t̪ 37
39
+ i 38
40
+ aj 39
41
+ ɾ 40
42
+ ŋ 41
43
+ ʃ 42
44
+ ʈʲ 43
45
+ mʲ 44
46
+ ʈʷ 45
47
+ ʉ 46
48
+ ɜː 47
49
+ aw 48
50
+ ʒ 49
51
+ ɜ 50
52
+ ɟ 51
53
+ pʲ 52
54
+ ɔj 53
55
+ fʲ 54
56
+ cʷ 55
57
+ d̪ 56
58
+ ç 57
59
+ ɡʷ 58
60
+ ɟʷ 59
61
+ pʷ 60
62
+ kʷ 61
g2p/english_nigeria_mfa/graphemes.sym ADDED
@@ -0,0 +1,379 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ | 1
3
+ _ 2
4
+ '|d 3
5
+ ' 4
6
+ e|m 5
7
+ l|l 6
8
+ '|m 7
9
+ r|e 8
10
+ '|s 9
11
+ u|n 10
12
+ '|v 11
13
+ e 12
14
+ v|e 13
15
+ -|d 14
16
+ a 15
17
+ d 16
18
+ a|r 17
19
+ g 18
20
+ v 19
21
+ a|w 20
22
+ s|o 21
23
+ m|e 22
24
+ b 23
25
+ h 24
26
+ c|a 25
27
+ p 26
28
+ c 27
29
+ c|h 28
30
+ e|n 29
31
+ s 30
32
+ j 31
33
+ l 32
34
+ m 33
35
+ c|u 34
36
+ u 35
37
+ k 36
38
+ l|i 37
39
+ i 38
40
+ a|l 39
41
+ s|t 40
42
+ a|m 41
43
+ v|a 42
44
+ r|k 43
45
+ w|o 44
46
+ f 45
47
+ h|u 46
48
+ o|n 47
49
+ o 48
50
+ n|i 49
51
+ r|o 50
52
+ v|o 51
53
+ e|l 52
54
+ t|h 53
55
+ e|i 54
56
+ a|b 55
57
+ b|a 56
58
+ c|i 57
59
+ n|a 58
60
+ t|e 59
61
+ c|k 60
62
+ w|a 61
63
+ r|d 62
64
+ c|o 63
65
+ a|c 64
66
+ u|s 65
67
+ f|t 66
68
+ t 67
69
+ l|o 68
70
+ n|e 69
71
+ p|e 70
72
+ a|n 71
73
+ e|d 72
74
+ e|r 73
75
+ n|g 74
76
+ s|e 75
77
+ n|t 76
78
+ s|h 77
79
+ l|y 78
80
+ s|s 79
81
+ l|e 80
82
+ s|i 81
83
+ t|i 82
84
+ a|t 83
85
+ t|o 84
86
+ i|r 85
87
+ b|b 86
88
+ e|s 87
89
+ e|y 88
90
+ o|t 89
91
+ t|s 90
92
+ h|i 91
93
+ v|i 92
94
+ y 93
95
+ d|e 94
96
+ r|i 95
97
+ d|i 96
98
+ d|o 97
99
+ n 98
100
+ m|i 99
101
+ o|u 100
102
+ d|u 101
103
+ c|e 102
104
+ c|t 103
105
+ e|e 104
106
+ l|a 105
107
+ r 106
108
+ a|u 107
109
+ b|e 108
110
+ g|g 109
111
+ m|o 110
112
+ r|r 111
113
+ r|a 112
114
+ r|y 113
115
+ w|y 114
116
+ t|t 115
117
+ e|t 116
118
+ o|r 117
119
+ k|a 118
120
+ h|o 119
121
+ i|n 120
122
+ c|y 121
123
+ b|i 122
124
+ t|y 123
125
+ m|b 124
126
+ g|e 125
127
+ u|r 126
128
+ j|e 127
129
+ j|u 128
130
+ g|a 129
131
+ n|c 130
132
+ z|e 131
133
+ g|u 132
134
+ o|d 133
135
+ l|u 134
136
+ b|l 135
137
+ m|a 136
138
+ b|o 137
139
+ o|o 138
140
+ d|a 139
141
+ g|i 140
142
+ u|m 141
143
+ n|d 142
144
+ h|a 143
145
+ b|r 144
146
+ e|u 145
147
+ o|i 146
148
+ d|g 147
149
+ s|c 148
150
+ a|e 149
151
+ i|l 150
152
+ i|s 151
153
+ o|l 152
154
+ p|t 153
155
+ q|u 154
156
+ t|u 155
157
+ r|u 156
158
+ b|u 157
159
+ j|a 158
160
+ u|l 159
161
+ f|e 160
162
+ a|g 161
163
+ z|z 162
164
+ l|t 163
165
+ b|y 164
166
+ m|y 165
167
+ a|i 166
168
+ g|y 167
169
+ p|h 168
170
+ r|p 169
171
+ t|a 170
172
+ s|a 171
173
+ p|i 172
174
+ t|r 173
175
+ c|l 174
176
+ m|s 175
177
+ c|c 176
178
+ z|a 177
179
+ m|m 178
180
+ m|p 179
181
+ n|y 180
182
+ u|p 181
183
+ c|r 182
184
+ m|u 183
185
+ x 184
186
+ o|m 185
187
+ l|d 186
188
+ n|o 187
189
+ z|o 188
190
+ f|i 189
191
+ i|e 190
192
+ d|s 191
193
+ o|w 192
194
+ o|e 193
195
+ r|n 194
196
+ o|c 195
197
+ o|k 196
198
+ p|o 197
199
+ p|u 198
200
+ n|s 199
201
+ a|d 200
202
+ d|d 201
203
+ u|c 202
204
+ j|i 203
205
+ g|b 204
206
+ p|a 205
207
+ s|p 206
208
+ w|u 207
209
+ y|e 208
210
+ h|e 209
211
+ r|t 210
212
+ g|h 211
213
+ k|i 212
214
+ j|o 213
215
+ i|p 214
216
+ o|g 215
217
+ i|c 216
218
+ d|r 217
219
+ d|v 218
220
+ k|e 219
221
+ d|y 220
222
+ p|y 221
223
+ g|o 222
224
+ o|p 223
225
+ e|o 224
226
+ f|a 225
227
+ f|f 226
228
+ r|s 227
229
+ e|c 228
230
+ z|i 229
231
+ k|p 230
232
+ f|l 231
233
+ f|o 232
234
+ f|r 233
235
+ n|n 234
236
+ '|t 235
237
+ p|r 236
238
+ g|l 237
239
+ e|a 238
240
+ g|r 239
241
+ p|p 240
242
+ u|e 241
243
+ a|p 242
244
+ l|s 243
245
+ r|l 244
246
+ r|m 245
247
+ y|s 246
248
+ y|o 247
249
+ y|i 248
250
+ k|h 249
251
+ y|a 250
252
+ w 251
253
+ k|k 252
254
+ k|n 253
255
+ k|o 254
256
+ k|u 255
257
+ k|w 256
258
+ u|t 257
259
+ w|i 258
260
+ f|u 259
261
+ g|n 260
262
+ y|u 261
263
+ a|y 262
264
+ y|w 263
265
+ n|u 264
266
+ z 265
267
+ é|t 266
268
+ é|p 267
269
+ é 268
270
+ i|a 269
271
+ y|l 270
272
+ n|k 271
273
+ v|y 272
274
+ w|s 273
275
+ n|f 274
276
+ u|i 275
277
+ h|y 276
278
+ w|e 277
279
+ x|i 278
280
+ w|h 279
281
+ i|d 280
282
+ a|k 281
283
+ p|l 282
284
+ g|m 283
285
+ o|v 284
286
+ i|m 285
287
+ o|a 286
288
+ i|v 287
289
+ r|c 288
290
+ a|s 289
291
+ i|g 290
292
+ k|s 291
293
+ u|a 292
294
+ s|u 293
295
+ y|n 294
296
+ i|o 295
297
+ h|w 296
298
+ m|n 297
299
+ a|v 298
300
+ w|r 299
301
+ a|z 300
302
+ z|u 301
303
+ z|y 302
304
+ '|l 303
305
+ u|g 304
306
+ k|y 305
307
+ e|f 306
308
+ b|d 307
309
+ f|y 308
310
+ a|f 309
311
+ i|z 310
312
+ u|d 311
313
+ h|l 312
314
+ v|v 313
315
+ k|l 314
316
+ u|b 315
317
+ y|d 316
318
+ v|u 317
319
+ i|t 318
320
+ e|g 319
321
+ y|r 320
322
+ r|b 321
323
+ e|p 322
324
+ y|m 323
325
+ z|h 324
326
+ q|l 325
327
+ e|v 326
328
+ e|x 327
329
+ r|g 328
330
+ q 329
331
+ p|s 330
332
+ x|s 331
333
+ w|k 332
334
+ w|m 333
335
+ w|t 334
336
+ q|i 335
337
+ w|b 336
338
+ o|b 337
339
+ b|s 338
340
+ b|j 339
341
+ x|e 340
342
+ x|h 341
343
+ o|s 342
344
+ x|t 343
345
+ é|e 344
346
+ '|c 345
347
+ q|a 346
348
+ o|f 347
349
+ w|d 348
350
+ y|c 349
351
+ d|w 350
352
+ j|j 351
353
+ d|n 352
354
+ '|n 353
355
+ i|b 354
356
+ w|l 355
357
+ y|g 356
358
+ h|r 357
359
+ k|r 358
360
+ -|k 359
361
+ -|f 360
362
+ '|a 361
363
+ h|m 362
364
+ x|y 363
365
+ '|r 364
366
+ p|n 365
367
+ w|n 366
368
+ x|o 367
369
+ '|i 368
370
+ h|b 369
371
+ é|g 370
372
+ c|s 371
373
+ x|u 372
374
+ f|s 373
375
+ x|a 374
376
+ '|k 375
377
+ y|p 376
378
+ ü 377
379
+ - 378
g2p/english_nigeria_mfa/meta.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"version": "2.2.6.dev1+g6874b58.d20230320", "architecture": "phonetisaurus", "train_date": "2023-05-07 11:33:01.252545", "phones": ["a", "aj", "aw", "a\u02d0", "b", "b\u02b2", "c", "c\u02b0", "c\u02b7", "d", "d\u0292", "d\u02b2", "e", "f", "f\u02b2", "h", "i", "i\u02d0", "j", "k", "kp", "k\u02b0", "k\u02b7", "l", "m", "m\u02b2", "n", "o", "p", "p\u02b0", "p\u02b2", "p\u02b7", "s", "t", "t\u0283", "t\u02b0", "t\u02b2", "t\u02b7", "u", "u\u02d0", "v", "v\u02b2", "w", "z", "\u00e7", "\u00f0", "\u014b", "\u0254", "\u0254j", "\u025b", "\u025b\u02d0", "\u025c", "\u025c\u02d0", "\u025f", "\u025f\u02b7", "\u0261", "\u0261b", "\u0261\u02b7", "\u026b", "\u0272", "\u0279", "\u0283", "\u028a", "\u028e", "\u03b8"], "graphemes": ["\u00fc", "e", "v", "g", "n", "x", "f", "l", "-", "m", "w", "t", "z", "u", "p", "i", "\u00e9", "'", "o", "b", "h", "y", "r", "c", "k", "s", "q", "d", "j", "a"], "grapheme_order": 2, "phone_order": 2, "sequence_separator": "|", "evaluation": {"num_words": 5633, "word_error_rate": null, "phone_error_rate": null}, "training": {"num_words": 50705, "num_graphemes": 30, "num_phones": 65}}
g2p/english_nigeria_mfa/model.fst ADDED

Git LFS Details

  • SHA256: 05815898dc747f7f62bdcd3cc06b9901230920b28e330f3770a8367809a8ff1d
  • Pointer size: 133 Bytes
  • Size of remote file: 26.1 MB
g2p/english_nigeria_mfa/phones.sym ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ d 1
3
+ t 2
4
+ ɛ 3
5
+ m 4
6
+ ɫ 5
7
+ i 6
8
+ a 7
9
+ z 8
10
+ s 9
11
+ ɔ 10
12
+ n 11
13
+ v 12
14
+ spn 13
15
+ e 14
16
+ dʲ 15
17
+ iː 16
18
+ dʒ 17
19
+ vʲ 18
20
+ bʲ 19
21
+ tʃ 20
22
+ kʰ 21
23
+ p 22
24
+ pʰ 23
25
+ k 24
26
+ ʊ 25
27
+ cʰ 26
28
+ j 27
29
+ ʎ 28
30
+ w 29
31
+ f 30
32
+ ɹ 31
33
+ h 32
34
+ uː 33
35
+ ɛː 34
36
+ u 35
37
+ ɲ 36
38
+ o 37
39
+ ɡ 38
40
+ θ 39
41
+ b 40
42
+ l 41
43
+ pʲ 42
44
+ ŋ 43
45
+ ʃ 44
46
+ tʲ 45
47
+ tʰ 46
48
+ tʷ 47
49
+ mʲ 48
50
+ c 49
51
+ ç 50
52
+ aj 51
53
+ ɟ 52
54
+ aː 53
55
+ aw 54
56
+ kʷ 55
57
+ ɜ 56
58
+ ɔj 57
59
+ fʲ 58
60
+ cʷ 59
61
+ ɡb 60
62
+ ð 61
63
+ kp 62
64
+ ɟʷ 63
65
+ ɡʷ 64
66
+ pʷ 65
67
+ ɜː 66
g2p/english_uk_mfa/graphemes.sym ADDED
@@ -0,0 +1,288 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ | 1
3
+ _ 2
4
+ a|n 3
5
+ n 4
6
+ '|e 5
7
+ m 6
8
+ ' 7
9
+ l|l 8
10
+ '|m 9
11
+ r|e 10
12
+ '|s 11
13
+ u|n 12
14
+ v|e 13
15
+ '|v 14
16
+ e 15
17
+ a 16
18
+ d 17
19
+ r 18
20
+ g 19
21
+ v 20
22
+ w|e 21
23
+ s 22
24
+ o|m 23
25
+ b 24
26
+ h 25
27
+ c 26
28
+ p 27
29
+ c|h 28
30
+ j 29
31
+ l 30
32
+ c|u 31
33
+ u 32
34
+ k 33
35
+ a|l 34
36
+ o|r 35
37
+ l|i 36
38
+ i 37
39
+ s|t 38
40
+ a|m 39
41
+ l|o 40
42
+ a|r 41
43
+ w|o 42
44
+ f 43
45
+ h|u 44
46
+ o|n 45
47
+ r|o 46
48
+ n|i 47
49
+ i|c 48
50
+ v|o 49
51
+ e|l 50
52
+ t|h 51
53
+ b|a 52
54
+ c|a 53
55
+ c|i 54
56
+ n|a 55
57
+ t|e 56
58
+ c|k 57
59
+ w 58
60
+ c|o 59
61
+ f|t 60
62
+ t 61
63
+ j|o 62
64
+ n|e 63
65
+ m|p 64
66
+ n|d 65
67
+ e|d 66
68
+ l|y 67
69
+ e|r 68
70
+ o 69
71
+ n|g 70
72
+ m|e 71
73
+ n|t 72
74
+ s|e 73
75
+ s|h 74
76
+ s|s 75
77
+ l|e 76
78
+ t|i 77
79
+ i|s 78
80
+ a|t 79
81
+ t|o 80
82
+ z|a 81
83
+ b|b 82
84
+ s|i 83
85
+ e|s 84
86
+ v|i 85
87
+ y 86
88
+ i|e 87
89
+ t|s 88
90
+ h|i 89
91
+ d|e 90
92
+ d|a 91
93
+ l|a 92
94
+ r|i 93
95
+ d|i 94
96
+ d|o 95
97
+ m|i 96
98
+ o|u 97
99
+ d|u 98
100
+ c|e 99
101
+ u|c 100
102
+ u|l 101
103
+ z|i 102
104
+ z 103
105
+ b|e 104
106
+ e|c 105
107
+ g|e 106
108
+ e|n 107
109
+ k|i 108
110
+ b|i 109
111
+ g|a 110
112
+ r|a 111
113
+ w|y 112
114
+ e|t 113
115
+ k|a 114
116
+ h|o 115
117
+ i|n 116
118
+ c|y 117
119
+ i|l 118
120
+ u|r 119
121
+ j|a 120
122
+ j|e 121
123
+ c|t 122
124
+ b|j 123
125
+ j|u 124
126
+ n|c 125
127
+ b|l 126
128
+ a|c 127
129
+ t|a 128
130
+ z|e 129
131
+ g|u 130
132
+ o|o 131
133
+ m|a 132
134
+ b|o 133
135
+ g|i 134
136
+ u|m 135
137
+ h|a 136
138
+ m|s 137
139
+ i|d 138
140
+ m|o 139
141
+ f|f 140
142
+ b|r 141
143
+ d|g 142
144
+ p|t 143
145
+ z|z 144
146
+ s|c 145
147
+ i|a 146
148
+ o|l 147
149
+ q|u 148
150
+ t|r 149
151
+ i|t 150
152
+ d|l 151
153
+ b|u 152
154
+ f|e 153
155
+ f|u 154
156
+ u|t 155
157
+ w|a 156
158
+ b|y 157
159
+ m|y 158
160
+ u|s 159
161
+ p|h 160
162
+ r|s 161
163
+ u|a 162
164
+ p|i 163
165
+ c|c 164
166
+ m|m 165
167
+ d|s 166
168
+ c|r 167
169
+ u|e 168
170
+ m|u 169
171
+ g|l 170
172
+ v|a 171
173
+ v|u 172
174
+ n|o 173
175
+ z|o 174
176
+ f|i 175
177
+ p|o 176
178
+ k|e 177
179
+ n|s 178
180
+ d|d 179
181
+ m|b 180
182
+ o|p 181
183
+ h|e 182
184
+ g|h 183
185
+ x 184
186
+ f|o 185
187
+ w|n 186
188
+ d|r 187
189
+ w|s 188
190
+ r|t 189
191
+ p|e 190
192
+ d|y 191
193
+ e|a 192
194
+ p|y 193
195
+ p|a 194
196
+ g|o 195
197
+ f|a 196
198
+ f|l 197
199
+ f|r 198
200
+ '|t 199
201
+ g|r 200
202
+ p|r 201
203
+ p|p 202
204
+ s|a 203
205
+ h|l 204
206
+ i|o 205
207
+ y|s 206
208
+ j|i 207
209
+ k|h 208
210
+ k|k 209
211
+ k|n 210
212
+ k|s 211
213
+ k|u 212
214
+ h|y 213
215
+ o|t 214
216
+ w|i 215
217
+ k|y 216
218
+ p|u 217
219
+ f|y 218
220
+ c|l 219
221
+ a|s 220
222
+ v|y 221
223
+ k|o 222
224
+ x|i 223
225
+ w|h 224
226
+ p|l 225
227
+ u|i 226
228
+ o|s 227
229
+ w|r 228
230
+ w|l 229
231
+ h|r 230
232
+ h|n 231
233
+ w|f 232
234
+ w|k 233
235
+ k|w 234
236
+ z|u 235
237
+ z|y 236
238
+ z|s 237
239
+ k|l 238
240
+ b|d 239
241
+ d|w 240
242
+ '|n 241
243
+ h|m 242
244
+ w|d 243
245
+ z|h 244
246
+ q|l 245
247
+ w|u 246
248
+ q|i 247
249
+ m|n 248
250
+ q 249
251
+ k|r 250
252
+ p|s 251
253
+ x|s 252
254
+ c|s 253
255
+ w|m 254
256
+ w|t 255
257
+ ô 256
258
+ '|a 257
259
+ b|t 258
260
+ x|c 259
261
+ v|l 260
262
+ w|b 261
263
+ v|s 262
264
+ b|s 263
265
+ x|e 264
266
+ x|h 265
267
+ x|u 266
268
+ '|c 267
269
+ q|a 268
270
+ '|p 269
271
+ '|d 270
272
+ '|l 271
273
+ m|c 272
274
+ h|t 273
275
+ x|y 274
276
+ x|o 275
277
+ '|i 276
278
+ h|b 277
279
+ h|w 278
280
+ z|w 279
281
+ '|o 280
282
+ '|r 281
283
+ d|m 282
284
+ f|s 283
285
+ x|a 284
286
+ '|k 285
287
+ ü|r 286
288
+ ü 287
g2p/english_uk_mfa/meta.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"version": "2.2.6.dev1+g6874b58.d20230320", "architecture": "phonetisaurus", "train_date": "2023-05-07 12:25:49.314352", "phones": ["a", "aj", "aw", "b", "b\u02b2", "c", "c\u02b0", "c\u02b7", "d", "d\u0292", "d\u02b2", "e", "ej", "f", "f\u02b2", "f\u02b7", "h", "i", "i\u02d0", "j", "k", "k\u02b0", "k\u02b7", "l", "m", "m\u02b2", "n", "p", "p\u02b0", "p\u02b2", "p\u02b7", "s", "t", "t\u0283", "t\u02b0", "t\u02b2", "t\u02b7", "v", "v\u02b2", "v\u02b7", "w", "z", "\u00e7", "\u00f0", "\u014b", "\u0250", "\u0251", "\u0251\u02d0", "\u0252", "\u0252\u02d0", "\u0254j", "\u0259", "\u0259w", "\u025b", "\u025b\u02d0", "\u025c", "\u025c\u02d0", "\u025f", "\u025f\u02b7", "\u0261", "\u0261\u02b7", "\u026a", "\u026b", "\u0272", "\u0279", "\u0283", "\u0289", "\u0289\u02d0", "\u028a", "\u028e", "\u0292", "\u03b8"], "graphemes": ["\u00fc", "e", "v", "g", "n", "x", "f", "l", "\u00f4", "m", "w", "t", "z", "u", "p", "i", "'", "o", "b", "h", "y", "r", "c", "k", "s", "q", "d", "j", "a"], "grapheme_order": 2, "phone_order": 2, "sequence_separator": "|", "evaluation": {"num_words": 7497, "word_error_rate": null, "phone_error_rate": null}, "training": {"num_words": 67478, "num_graphemes": 29, "num_phones": 72}}
g2p/english_uk_mfa/model.fst ADDED

Git LFS Details

  • SHA256: b80168903f21c0b23aa001e91a98cce29499b16db24a9a13d54ad751b6b5ea5f
  • Pointer size: 133 Bytes
  • Size of remote file: 39.5 MB
g2p/english_uk_mfa/phones.sym ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ a 1
3
+ n 2
4
+ ə 3
5
+ m 4
6
+ ɛ 5
7
+ ɪ 6
8
+ ɫ 7
9
+ s 8
10
+ z 9
11
+ ɐ 10
12
+ v 11
13
+ ɑ 12
14
+ ɑː 13
15
+ ɒ 14
16
+ ej 15
17
+ dʲ 16
18
+ iː 17
19
+ dʒ 18
20
+ vʲ 19
21
+ ɒː 20
22
+ bʲ 21
23
+ tʃ 22
24
+ pʰ 23
25
+ k 24
26
+ kʰ 25
27
+ ʊ 26
28
+ cʰ 27
29
+ j 28
30
+ b 29
31
+ ɡ 30
32
+ ʎ 31
33
+ t 32
34
+ l 33
35
+ d 34
36
+ w 35
37
+ f 36
38
+ ɹ 37
39
+ h 38
40
+ ʉː 39
41
+ ɛː 40
42
+ ɲ 41
43
+ əw 42
44
+ θ 43
45
+ i 44
46
+ aj 45
47
+ p 46
48
+ ŋ 47
49
+ ʃ 48
50
+ tʲ 49
51
+ tʷ 50
52
+ mʲ 51
53
+ tʰ 52
54
+ c 53
55
+ ɟ 54
56
+ ʉ 55
57
+ aw 56
58
+ kʷ 57
59
+ ɜː 58
60
+ ʒ 59
61
+ ɜ 60
62
+ pʲ 61
63
+ ɔj 62
64
+ fʲ 63
65
+ cʷ 64
66
+ ç 65
67
+ ɡʷ 66
68
+ ɟʷ 67
69
+ ð 68
70
+ pʷ 69
71
+ vʷ 70
72
+ e 71
73
+ fʷ 72
g2p/english_us_mfa/graphemes.sym ADDED
@@ -0,0 +1,354 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ | 1
3
+ _ 2
4
+ '|a 3
5
+ '|d 4
6
+ ' 5
7
+ e|m 6
8
+ i|n 7
9
+ l|l 8
10
+ '|m 9
11
+ '|n 10
12
+ r|e 11
13
+ '|r 12
14
+ e 13
15
+ '|s 14
16
+ u|n 15
17
+ v|e 16
18
+ '|v 17
19
+ a 18
20
+ d 19
21
+ r 20
22
+ g 21
23
+ v 22
24
+ a|w 23
25
+ s|o 24
26
+ m|e 25
27
+ b 26
28
+ h 27
29
+ a|b 28
30
+ o 29
31
+ m 30
32
+ y|c 31
33
+ c|a 32
34
+ p 33
35
+ c 34
36
+ c|h 35
37
+ e|n 36
38
+ j 37
39
+ l 38
40
+ u 39
41
+ c|u 40
42
+ k 41
43
+ a|l 42
44
+ o|r 43
45
+ s|t 44
46
+ a|m 45
47
+ a|n 46
48
+ l|o 47
49
+ a|r 48
50
+ w|o 49
51
+ f 50
52
+ h|u 51
53
+ s 52
54
+ o|n 53
55
+ i|c 54
56
+ i 55
57
+ t|e 56
58
+ v|o 57
59
+ e|l 58
60
+ t|h 59
61
+ e|i 60
62
+ b|a 61
63
+ d|e 62
64
+ a|c 63
65
+ v|i 64
66
+ x|i 65
67
+ c|i 66
68
+ s|c 67
69
+ u|s 68
70
+ c|k 69
71
+ w 70
72
+ c|o 71
73
+ r|i 72
74
+ t|i 73
75
+ n 74
76
+ t 75
77
+ l|i 76
78
+ u|l 77
79
+ a|d 78
80
+ d|d 79
81
+ a|i 80
82
+ c|e 81
83
+ e|r 82
84
+ s|s 83
85
+ k|a 84
86
+ n|a 85
87
+ n|e 86
88
+ m|p 87
89
+ p|e 88
90
+ e|d 89
91
+ l|y 90
92
+ e|e 91
93
+ d|o 92
94
+ n|i 93
95
+ n|g 94
96
+ n|t 95
97
+ a|p 96
98
+ r|o 97
99
+ n|o 98
100
+ s|i 99
101
+ o|g 100
102
+ e|s 101
103
+ l|a 102
104
+ s|e 103
105
+ s|h 104
106
+ e|v 105
107
+ l|e 106
108
+ i|a 107
109
+ a|s 108
110
+ d|i 109
111
+ z|e 110
112
+ a|t 111
113
+ j|o 112
114
+ u|r 113
115
+ o|i 114
116
+ x 115
117
+ y|a 116
118
+ z|a 117
119
+ b|b 118
120
+ c|y 119
121
+ b|e 120
122
+ e|y 121
123
+ b|o 122
124
+ t|s 123
125
+ i|p 124
126
+ t|t 125
127
+ y 126
128
+ t|u 127
129
+ z 128
130
+ i|m 129
131
+ o|u 130
132
+ m|b 131
133
+ t|o 132
134
+ o|m 133
135
+ m|i 134
136
+ g|i 135
137
+ g|e 136
138
+ h|y 137
139
+ e|c 138
140
+ o|t 139
141
+ t|y 140
142
+ p|y 141
143
+ r|a 142
144
+ v|a 143
145
+ d|u 144
146
+ n|s 145
147
+ c|t 146
148
+ u|c 147
149
+ e|a 148
150
+ u|m 149
151
+ g|o 150
152
+ e|g 151
153
+ m|o 152
154
+ s|k 153
155
+ m|u 154
156
+ t|r 155
157
+ b|r 156
158
+ k|i 157
159
+ k|u 158
160
+ t|a 159
161
+ c|r 160
162
+ b|i 161
163
+ b|y 162
164
+ i|r 163
165
+ f|o 164
166
+ g|a 165
167
+ n|n 166
168
+ w|e 167
169
+ w|y 168
170
+ p|o 169
171
+ e|t 170
172
+ f|a 171
173
+ h|e 172
174
+ h|i 173
175
+ h|o 174
176
+ i|d 175
177
+ j|a 176
178
+ i|e 177
179
+ i|l 178
180
+ p|h 179
181
+ j|e 180
182
+ j|u 181
183
+ d|g 182
184
+ n|c 183
185
+ k|h 184
186
+ z|i 185
187
+ b|l 186
188
+ a|u 187
189
+ p|s 188
190
+ g|u 189
191
+ o|o 190
192
+ o|w 191
193
+ l|u 192
194
+ g|r 193
195
+ m|a 194
196
+ o|b 195
197
+ c|c 196
198
+ s|p 197
199
+ d|a 198
200
+ g|h 199
201
+ n|d 200
202
+ o|v 201
203
+ z|z 202
204
+ i|o 203
205
+ h|a 204
206
+ z|o 205
207
+ o|a 206
208
+ o|k 207
209
+ p|t 208
210
+ k|e 209
211
+ i|s 210
212
+ o|p 211
213
+ o|l 212
214
+ b|u 213
215
+ q|u 214
216
+ s|u 215
217
+ f|e 216
218
+ a|g 217
219
+ u|t 218
220
+ l|t 219
221
+ w|a 220
222
+ p|u 221
223
+ i|b 222
224
+ p|i 223
225
+ c|l 224
226
+ d|s 225
227
+ l|d 226
228
+ g|l 227
229
+ y|n 228
230
+ v|u 229
231
+ f|i 230
232
+ y|l 231
233
+ i|t 232
234
+ y|m 233
235
+ e|p 234
236
+ d|h 235
237
+ l|s 236
238
+ d|r 237
239
+ f|t 238
240
+ d|y 239
241
+ y|p 240
242
+ f|f 241
243
+ a|y 242
244
+ a|f 243
245
+ f|l 244
246
+ f|r 245
247
+ '|t 246
248
+ p|l 247
249
+ p|r 248
250
+ i|z 249
251
+ p|p 250
252
+ h|l 251
253
+ k|o 252
254
+ s|l 253
255
+ y|o 254
256
+ j|i 255
257
+ a|k 256
258
+ k|s 257
259
+ k|k 258
260
+ k|n 259
261
+ w|i 260
262
+ x|e 261
263
+ f|u 262
264
+ n|k 263
265
+ i|g 264
266
+ y|s 265
267
+ p|a 266
268
+ n|u 267
269
+ s|a 268
270
+ v|y 269
271
+ i|v 270
272
+ o|s 271
273
+ o|e 272
274
+ f|y 273
275
+ y|g 274
276
+ w|s 275
277
+ n|f 276
278
+ y|i 277
279
+ t|l 278
280
+ t|w 279
281
+ w|h 280
282
+ q|a 281
283
+ q 282
284
+ o|c 283
285
+ k|w 284
286
+ w|r 285
287
+ r|s 286
288
+ o|f 287
289
+ s|y 288
290
+ h|w 289
291
+ h|n 290
292
+ a|v 291
293
+ -|g 292
294
+ y|e 293
295
+ y|u 294
296
+ z|u 295
297
+ z|y 296
298
+ '|l 297
299
+ '|y 298
300
+ h|r 299
301
+ k|y 300
302
+ k|l 301
303
+ v|v 302
304
+ o|d 303
305
+ i|f 304
306
+ x|s 305
307
+ y|d 306
308
+ y|r 307
309
+ z|h 308
310
+ q|l 309
311
+ d|l 310
312
+ w|u 311
313
+ q|i 312
314
+ e|f 313
315
+ k|r 314
316
+ w|n 315
317
+ d|n 316
318
+ w|k 317
319
+ w|m 318
320
+ c|s 319
321
+ ô 320
322
+ v|l 321
323
+ w|d 322
324
+ x|x 323
325
+ e|x 324
326
+ x|c 325
327
+ x|h 326
328
+ x|t 327
329
+ '|c 328
330
+ b|s 329
331
+ '|p 330
332
+ x|u 331
333
+ y|w 332
334
+ h|m 333
335
+ -|j 334
336
+ w|l 335
337
+ '|e 336
338
+ '|o 337
339
+ -|m 338
340
+ w|b 339
341
+ w|f 340
342
+ w|t 341
343
+ x|y 342
344
+ x|a 343
345
+ x|o 344
346
+ '|i 345
347
+ h|b 346
348
+ d|w 347
349
+ -|t 348
350
+ -|h 349
351
+ -|z 350
352
+ ü|r 351
353
+ - 352
354
+ ü 353
g2p/english_us_mfa/meta.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"version": "2.2.6.dev1+g6874b58.d20230320", "architecture": "phonetisaurus", "train_date": "2023-05-07 12:08:58.048522", "phones": ["aj", "aw", "b", "b\u02b2", "c", "c\u02b0", "c\u02b7", "d", "d\u0292", "d\u02b2", "ej", "f", "f\u02b2", "h", "i", "i\u02d0", "j", "k", "k\u02b0", "k\u02b7", "l", "m", "m\u02b2", "n", "ow", "p", "p\u02b0", "p\u02b2", "p\u02b7", "s", "t", "t\u0283", "t\u02b0", "t\u02b2", "t\u02b7", "v", "v\u02b2", "w", "z", "\u00e6", "\u00e7", "\u00f0", "\u014b", "\u0250", "\u0251", "\u0251\u02d0", "\u0252", "\u0252\u02d0", "\u0254j", "\u0259", "\u025a", "\u025b", "\u025d", "\u025f", "\u025f\u02b7", "\u0261", "\u0261\u02b7", "\u026a", "\u026b", "\u0272", "\u0279", "\u0283", "\u0289", "\u0289\u02d0", "\u028a", "\u028e", "\u0292", "\u0294", "\u03b8"], "graphemes": ["\u00fc", "e", "v", "g", "n", "x", "f", "l", "\u00f4", "-", "m", "w", "t", "z", "u", "p", "i", "'", "o", "b", "h", "y", "r", "c", "k", "s", "q", "d", "j", "a"], "grapheme_order": 2, "phone_order": 2, "sequence_separator": "|", "evaluation": {"num_words": 7904, "word_error_rate": null, "phone_error_rate": null}, "training": {"num_words": 71153, "num_graphemes": 30, "num_phones": 69}}
g2p/english_us_mfa/model.fst ADDED

Git LFS Details

  • SHA256: 0da3ed70916a4b541d00cfc7e5dbc2d21086df2f6ace4b84be7d0bda2557ed06
  • Pointer size: 133 Bytes
  • Size of remote file: 37.7 MB
g2p/english_us_mfa/phones.sym ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <eps> 0
2
+ ə 1
3
+ d 2
4
+ t 3
5
+ ɪ 4
6
+ m 5
7
+ ɛ 6
8
+ n 7
9
+ ɫ 8
10
+ ɚ 9
11
+ ɹ 10
12
+ ʔ 11
13
+ s 12
14
+ z 13
15
+ ɐ 14
16
+ v 15
17
+ ej 16
18
+ ɑː 17
19
+ dʲ 18
20
+ iː 19
21
+ ɑ 20
22
+ dʒ 21
23
+ vʲ 22
24
+ ɒː 23
25
+ bʲ 24
26
+ tʃ 25
27
+ æ 26
28
+ b 27
29
+ ow 28
30
+ aj 29
31
+ cʰ 30
32
+ p 31
33
+ pʰ 32
34
+ k 33
35
+ j 34
36
+ ʊ 35
37
+ kʰ 36
38
+ ɒ 37
39
+ ɡ 38
40
+ l 39
41
+ w 40
42
+ f 41
43
+ h 42
44
+ ʉ 43
45
+ ɲ 44
46
+ pʲ 45
47
+ i 46
48
+ θ 47
49
+ ʉː 48
50
+ tʲ 49
51
+ ʃ 50
52
+ c 51
53
+ tʰ 52
54
+ ʎ 53
55
+ ŋ 54
56
+ ʒ 55
57
+ mʲ 56
58
+ ç 57
59
+ ɝ 58
60
+ ɔj 59
61
+ aw 60
62
+ fʲ 61
63
+ ɟ 62
64
+ kʷ 63
65
+ cʷ 64
66
+ ð 65
67
+ ɟʷ 66
68
+ ɡʷ 67
69
+ tʷ 68
70
+ pʷ 69