whisper-tiny-ba / README.md
stdbug's picture
Upload README.md with huggingface_hub
37b942c verified
metadata
base_model: openai/whisper-tiny
datasets:
  - common_voice_17_0
language: ba
library_name: transformers
license: apache-2.0
model-index:
  - name: Finetuned openai/whisper-tiny on Bashkir
    results:
      - task:
          type: automatic-speech-recognition
          name: Speech-to-Text
        dataset:
          name: Common Voice (Bashkir)
          type: common_voice
        metrics:
          - type: wer
            value: 102.544

Finetuned openai/whisper-tiny on 133675 Bashkir training audio samples from mozilla-foundation/common_voice_17_0.

This model was created from the Mozilla.ai Blueprint: speech-to-text-finetune.

Evaluation results on 14513 audio samples of Bashkir:

Baseline model (before finetuning) on Bashkir

  • Word Error Rate (Normalized): 150.765
  • Word Error Rate (Orthographic): 127.801
  • Character Error Rate (Normalized): 116.224
  • Character Error Rate (Orthographic): 115.431
  • Loss: 5.831

Finetuned model (after finetuning) on Bashkir

  • Word Error Rate (Normalized): 102.544
  • Word Error Rate (Orthographic): 103.049
  • Character Error Rate (Normalized): 89.277
  • Character Error Rate (Orthographic): 89.293
  • Loss: 1.441