Instructions to use mitchelldehaven/whisper-large-v2-ru with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mitchelldehaven/whisper-large-v2-ru with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="mitchelldehaven/whisper-large-v2-ru")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("mitchelldehaven/whisper-large-v2-ru") model = AutoModelForSpeechSeq2Seq.from_pretrained("mitchelldehaven/whisper-large-v2-ru") - Notebooks
- Google Colab
- Kaggle
Whisper model finetuned using audio data from Open STT Russian Dataset (https://github.com/snakers4/open_stt).
There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with "" rather than Whisper's " "). This mismatch leads to a slight degradation on CommonVoice.
- Downloads last month
- 38
Evaluation results
- WER on mozilla-foundation/common_voice_11_0test set self-reported7.730