F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper • 2410.06885 • Published • 47
F5-TTS finetune on all formosan data (ithuan, fb ilrdf dict, klokah) without samples only one word, using ipa as input.
Only contains ithuan ami and trv part.
g2p from this repo.
please refer source repo
Base model
SWivid/F5-TTS