Update README.md
Browse files
README.md
CHANGED
|
@@ -1,12 +1,44 @@
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
-
- ur
|
| 4 |
license: apache-2.0
|
| 5 |
base_model: openai/whisper-tiny
|
| 6 |
tags:
|
| 7 |
-
-
|
|
|
|
|
|
|
|
|
|
| 8 |
datasets:
|
| 9 |
- mozilla-foundation/common_voice_17_0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
pipeline_tag: automatic-speech-recognition
|
| 11 |
---
|
| 12 |
|
|
@@ -18,6 +50,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
|
| 20 |
|
|
|
|
|
|
|
| 21 |
## Model description
|
| 22 |
|
| 23 |
Whisper Tiny Urdu ASR Model
|
|
@@ -32,6 +66,17 @@ Despite being the smallest variant in its family, this model achieves Good perfo
|
|
| 32 |
|
| 33 |
This model is particularly suited for applications on edge devices with limited computational resources. Additionally, it can be converted to a FasterWhisper model using the CTranslate2 library, allowing for even faster inference on devices with lower processing power.
|
| 34 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
### Training hyperparameters
|
| 36 |
|
| 37 |
The following hyperparameters were used during training:
|
|
|
|
| 1 |
---
|
| 2 |
+
library_name: transformers
|
|
|
|
| 3 |
license: apache-2.0
|
| 4 |
base_model: openai/whisper-tiny
|
| 5 |
tags:
|
| 6 |
+
- automatic-speech-recognition
|
| 7 |
+
- whisper
|
| 8 |
+
- urdu
|
| 9 |
+
- mozilla-foundation/common_voice_17_0
|
| 10 |
datasets:
|
| 11 |
- mozilla-foundation/common_voice_17_0
|
| 12 |
+
metrics:
|
| 13 |
+
- wer
|
| 14 |
+
- cer
|
| 15 |
+
- bleu
|
| 16 |
+
- chrf
|
| 17 |
+
model-index:
|
| 18 |
+
- name: whisper-tiny-urdu
|
| 19 |
+
results:
|
| 20 |
+
- task:
|
| 21 |
+
type: automatic-speech-recognition
|
| 22 |
+
name: Automatic Speech Recognition
|
| 23 |
+
dataset:
|
| 24 |
+
name: CSALT Voice Dataset
|
| 25 |
+
type: urdu-asr/csalt-voice
|
| 26 |
+
split: validation
|
| 27 |
+
metrics:
|
| 28 |
+
- type: wer
|
| 29 |
+
value: 64.961
|
| 30 |
+
name: WER
|
| 31 |
+
- type: cer
|
| 32 |
+
value: 42.488
|
| 33 |
+
name: CER
|
| 34 |
+
- type: bleu
|
| 35 |
+
value: 16.710
|
| 36 |
+
name: BLEU
|
| 37 |
+
- type: chrf
|
| 38 |
+
value: 43.545
|
| 39 |
+
name: ChrF
|
| 40 |
+
language:
|
| 41 |
+
- ur
|
| 42 |
pipeline_tag: automatic-speech-recognition
|
| 43 |
---
|
| 44 |
|
|
|
|
| 50 |
|
| 51 |
This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
|
| 52 |
|
| 53 |
+
>👉 Review the testing script: [Testing Urdu Whisper tiny](https://www.kaggle.com/code/kingabzpro/testing-urdu-whisper-tiny?scriptVersionId=250300591)
|
| 54 |
+
|
| 55 |
## Model description
|
| 56 |
|
| 57 |
Whisper Tiny Urdu ASR Model
|
|
|
|
| 66 |
|
| 67 |
This model is particularly suited for applications on edge devices with limited computational resources. Additionally, it can be converted to a FasterWhisper model using the CTranslate2 library, allowing for even faster inference on devices with lower processing power.
|
| 68 |
|
| 69 |
+
## Evaluation
|
| 70 |
+
|
| 71 |
+
Urdu ASR Evaluation on [urdu-asr/csalt-voice](https://huggingface.co/datasets/urdu-asr/csalt-voice) (Validation Split).
|
| 72 |
+
|
| 73 |
+
| Metric | Value | Description |
|
| 74 |
+
|--------|----------|------------------------------------|
|
| 75 |
+
| **WER** | 64.961% | Word Error Rate (lower is better) |
|
| 76 |
+
| **CER** | 42.488% | Character Error Rate |
|
| 77 |
+
| **BLEU** | 16.710% | BLEU Score (higher is better) |
|
| 78 |
+
| **ChrF** | 43.545 | Character n-gram F-score |
|
| 79 |
+
|
| 80 |
### Training hyperparameters
|
| 81 |
|
| 82 |
The following hyperparameters were used during training:
|