Create README.md

Files changed (1) hide show

README.md ADDED Viewed


1	+ This is a fine tuning on the Whisper Base model for using it as a CMU Voice-To-Text model. No conversion is needed, the model outputs the ARPABET symbols space separated instead of plain english text. It was trained using a datasest with over 100 hours of meeting recordings of conversational english.
2	+
3	+ The model is still in development. Got a WER of 26, which is pretty good for just 100 hours of audio. But it could improve if more training data is provided. Nevertheless, this is the base version of Whisper which has limited precision. Probably something like small or medium will be enough for this kind of application.