mozilla-foundation/common_voice_17_0
Updated • 6.05k • 18
This is a fine-tuned model is using the OpenAI/whisper-small - 242M parameters.
The fine-tuned model uses a native Telugu dataset with a set of different audio files with various dialects/accents of the telugu language.
Being a small version, the model takes 1 hour and 15 minutes to train.
Steps to download this model locally on your respective system:
Base model
openai/whisper-small