SLSCU
/

thai-dialect_khummuang_model

Automatic Speech Recognition

Model card Files Files and versions

artitsu commited on Sep 13, 2023

Commit

1b46209

·

1 Parent(s): cf7539e

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -22,12 +22,12 @@ The training recipe was based on wsj recipe in [espnet](https://github.com/espne
 <!-- Provide a longer summary of what this model is. -->
-This model is Hybrid CTC/Attention model with pre-trained HuBERT as the encoder.
-The model pre-trained on Thai-central, Khummuang, Korat, and Pattani and fine-tuned on Khummuang, Korat, and Pattani.
-you can demo on colab with [this link](https://colab.research.google.com/drive/1stltGdpG9OV-sCl9QgkvEXZV7fGB2Ixe?usp=sharing). (Please note that you cannot inference >4 seconds of audio with free Google colab.)
 ## Evaluation

 <!-- Provide a longer summary of what this model is. -->
+This model is a Hybrid CTC/Attention model with pre-trained HuBERT encoder.
+The model was pre-trained on Thai-central, Khummuang, Korat, and Pattani and fine-tuned on Khummuang, Korat, and Pattani. (Experiment 3 in the paper)
+We provide some demo code to do inference with this model architecture on colab [here](https://colab.research.google.com/drive/1stltGdpG9OV-sCl9QgkvEXZV7fGB2Ixe?usp=sharing).
+(Code is for Thai-Central. Please select the correct model accordingly.)
 ## Evaluation