File size: 1,025 Bytes
bacba0c 62ba88f bacba0c 62ba88f bacba0c | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 | ---
license: cc-by-sa-4.0
datasets:
- Wikidepia/openslr_enhanced
language:
- id
- jv
---
# Montreal Forced Aligner (MFA) for Indonesia-Javanese
This repository contains MFA model for Indonesia-Javanese language. This model primarily trained on Javanese ASR dataset (https://www.openslr.org/35/), that are enhanced using DeepFilterNet2 to remove unwanted noise. Lexicon contained in this repository comes from Google's language-resource [Javanese Lexicon](https://github.com/google/language-resources/blob/master/jv/data/lexicon.tsv).
While this model is only trained on Javanese language, you can also use this to align Indonesian speech. You might need to add Indonesian lexicon to the dictionary file.
## Example Usage
To align:
```bash
mfa align --g2p_model_path g2p_jv.zip audio_dir lexicon_jv.dict acoustic_model.zip aligned_dir
```
## Resources
- https://montreal-forced-aligner.readthedocs.io/en/latest/first_steps/alignment_example.html
- https://mfa-models.readthedocs.io/en/latest/
## License
CC-BY-SA-4.0
|