metadata
base_model: openai/whisper-tiny
datasets:
- common_voice_17_0
language: ba
library_name: transformers
license: apache-2.0
model-index:
- name: Finetuned openai/whisper-tiny on Bashkir
results:
- task:
type: automatic-speech-recognition
name: Speech-to-Text
dataset:
name: Common Voice (Bashkir)
type: common_voice
metrics:
- type: wer
value: 102.544
Finetuned openai/whisper-tiny on 133675 Bashkir training audio samples from mozilla-foundation/common_voice_17_0.
This model was created from the Mozilla.ai Blueprint: speech-to-text-finetune.
Evaluation results on 14513 audio samples of Bashkir:
Baseline model (before finetuning) on Bashkir
- Word Error Rate (Normalized): 150.765
- Word Error Rate (Orthographic): 127.801
- Character Error Rate (Normalized): 116.224
- Character Error Rate (Orthographic): 115.431
- Loss: 5.831
Finetuned model (after finetuning) on Bashkir
- Word Error Rate (Normalized): 102.544
- Word Error Rate (Orthographic): 103.049
- Character Error Rate (Normalized): 89.277
- Character Error Rate (Orthographic): 89.293
- Loss: 1.441