| base_model: openai/whisper-tiny | |
| datasets: | |
| - common_voice_17_0 | |
| language: ba | |
| library_name: transformers | |
| license: apache-2.0 | |
| model-index: | |
| - name: Finetuned openai/whisper-tiny on Bashkir | |
| results: | |
| - task: | |
| type: automatic-speech-recognition | |
| name: Speech-to-Text | |
| dataset: | |
| name: Common Voice (Bashkir) | |
| type: common_voice | |
| metrics: | |
| - type: wer | |
| value: 102.544 | |
| # Finetuned openai/whisper-tiny on 133675 Bashkir training audio samples from mozilla-foundation/common_voice_17_0. | |
| This model was created from the Mozilla.ai Blueprint: | |
| [speech-to-text-finetune](https://github.com/mozilla-ai/speech-to-text-finetune). | |
| ## Evaluation results on 14513 audio samples of Bashkir: | |
| ### Baseline model (before finetuning) on Bashkir | |
| - Word Error Rate (Normalized): 150.765 | |
| - Word Error Rate (Orthographic): 127.801 | |
| - Character Error Rate (Normalized): 116.224 | |
| - Character Error Rate (Orthographic): 115.431 | |
| - Loss: 5.831 | |
| ### Finetuned model (after finetuning) on Bashkir | |
| - Word Error Rate (Normalized): 102.544 | |
| - Word Error Rate (Orthographic): 103.049 | |
| - Character Error Rate (Normalized): 89.277 | |
| - Character Error Rate (Orthographic): 89.293 | |
| - Loss: 1.441 | |