kingabzpro commited on
Commit
cdb8a91
·
verified ·
1 Parent(s): cc16ad3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -3
README.md CHANGED
@@ -1,12 +1,44 @@
1
  ---
2
- language:
3
- - ur
4
  license: apache-2.0
5
  base_model: openai/whisper-tiny
6
  tags:
7
- - generated_from_trainer
 
 
 
8
  datasets:
9
  - mozilla-foundation/common_voice_17_0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  pipeline_tag: automatic-speech-recognition
11
  ---
12
 
@@ -18,6 +50,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
20
 
 
 
21
  ## Model description
22
 
23
  Whisper Tiny Urdu ASR Model
@@ -32,6 +66,17 @@ Despite being the smallest variant in its family, this model achieves Good perfo
32
 
33
  This model is particularly suited for applications on edge devices with limited computational resources. Additionally, it can be converted to a FasterWhisper model using the CTranslate2 library, allowing for even faster inference on devices with lower processing power.
34
 
 
 
 
 
 
 
 
 
 
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
 
1
  ---
2
+ library_name: transformers
 
3
  license: apache-2.0
4
  base_model: openai/whisper-tiny
5
  tags:
6
+ - automatic-speech-recognition
7
+ - whisper
8
+ - urdu
9
+ - mozilla-foundation/common_voice_17_0
10
  datasets:
11
  - mozilla-foundation/common_voice_17_0
12
+ metrics:
13
+ - wer
14
+ - cer
15
+ - bleu
16
+ - chrf
17
+ model-index:
18
+ - name: whisper-tiny-urdu
19
+ results:
20
+ - task:
21
+ type: automatic-speech-recognition
22
+ name: Automatic Speech Recognition
23
+ dataset:
24
+ name: CSALT Voice Dataset
25
+ type: urdu-asr/csalt-voice
26
+ split: validation
27
+ metrics:
28
+ - type: wer
29
+ value: 64.961
30
+ name: WER
31
+ - type: cer
32
+ value: 42.488
33
+ name: CER
34
+ - type: bleu
35
+ value: 16.710
36
+ name: BLEU
37
+ - type: chrf
38
+ value: 43.545
39
+ name: ChrF
40
+ language:
41
+ - ur
42
  pipeline_tag: automatic-speech-recognition
43
  ---
44
 
 
50
 
51
  This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
52
 
53
+ >👉 Review the testing script: [Testing Urdu Whisper tiny](https://www.kaggle.com/code/kingabzpro/testing-urdu-whisper-tiny?scriptVersionId=250300591)
54
+
55
  ## Model description
56
 
57
  Whisper Tiny Urdu ASR Model
 
66
 
67
  This model is particularly suited for applications on edge devices with limited computational resources. Additionally, it can be converted to a FasterWhisper model using the CTranslate2 library, allowing for even faster inference on devices with lower processing power.
68
 
69
+ ## Evaluation
70
+
71
+ Urdu ASR Evaluation on [urdu-asr/csalt-voice](https://huggingface.co/datasets/urdu-asr/csalt-voice) (Validation Split).
72
+
73
+ | Metric | Value | Description |
74
+ |--------|----------|------------------------------------|
75
+ | **WER** | 64.961% | Word Error Rate (lower is better) |
76
+ | **CER** | 42.488% | Character Error Rate |
77
+ | **BLEU** | 16.710% | BLEU Score (higher is better) |
78
+ | **ChrF** | 43.545 | Character n-gram F-score |
79
+
80
  ### Training hyperparameters
81
 
82
  The following hyperparameters were used during training: