--- license: cc-by-sa-4.0 datasets: - Wikidepia/openslr_enhanced language: - id - jv --- # Montreal Forced Aligner (MFA) for Indonesia-Javanese This repository contains MFA model for Indonesia-Javanese language. This model primarily trained on Javanese ASR dataset (https://www.openslr.org/35/), that are enhanced using DeepFilterNet2 to remove unwanted noise. Lexicon contained in this repository comes from Google's language-resource [Javanese Lexicon](https://github.com/google/language-resources/blob/master/jv/data/lexicon.tsv). While this model is only trained on Javanese language, you can also use this to align Indonesian speech. You might need to add Indonesian lexicon to the dictionary file. ## Example Usage To align: ```bash mfa align --g2p_model_path g2p_jv.zip audio_dir lexicon_jv.dict acoustic_model.zip aligned_dir ``` ## Resources - https://montreal-forced-aligner.readthedocs.io/en/latest/first_steps/alignment_example.html - https://mfa-models.readthedocs.io/en/latest/ ## License CC-BY-SA-4.0