cpercivati commited on
Commit
05e5b1c
·
verified ·
1 Parent(s): 79210d0

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ This is a fine tuning on the Whisper Base model for using it as a CMU Voice-To-Text model. No conversion is needed, the model outputs the ARPABET symbols space separated instead of plain english text. It was trained using a datasest with over 100 hours of meeting recordings of conversational english.
2
+
3
+ The model is still in development. Got a WER of 26, which is pretty good for just 100 hours of audio. But it could improve if more training data is provided. Nevertheless, this is the base version of Whisper which has limited precision. Probably something like small or medium will be enough for this kind of application.