awong-dev's picture
File-tuned on common voice 25 yue dataset. Full run with punctuation removed before CER calculation. LR 3e-6, 3 epochs, cosine schedule
b11baa4 verified