Update README.md
Browse files
README.md
CHANGED
|
@@ -80,10 +80,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 80 |
|
| 81 |
# Whisper Medium ID - FLEURS-CV-LBV - Augmented
|
| 82 |
|
| 83 |
-
This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the:
|
| 84 |
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
| 85 |
- [google/fleurs](https://huggingface.co/datasets/google/fleurs)
|
| 86 |
-
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
|
| 87 |
|
| 88 |
It achieves the following results on the evaluation set (Common Voice 11.0):
|
| 89 |
- Loss: 0.2788
|
|
@@ -100,10 +100,20 @@ More information needed
|
|
| 100 |
|
| 101 |
## Training and evaluation data
|
| 102 |
|
| 103 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 104 |
|
| 105 |
## Training procedure
|
| 106 |
|
|
|
|
|
|
|
| 107 |
### Training hyperparameters
|
| 108 |
|
| 109 |
The following hyperparameters were used during training:
|
|
|
|
| 80 |
|
| 81 |
# Whisper Medium ID - FLEURS-CV-LBV - Augmented
|
| 82 |
|
| 83 |
+
This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the following datasets:
|
| 84 |
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
| 85 |
- [google/fleurs](https://huggingface.co/datasets/google/fleurs)
|
| 86 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
|
| 87 |
|
| 88 |
It achieves the following results on the evaluation set (Common Voice 11.0):
|
| 89 |
- Loss: 0.2788
|
|
|
|
| 100 |
|
| 101 |
## Training and evaluation data
|
| 102 |
|
| 103 |
+
Training:
|
| 104 |
+
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0) (train+validation)
|
| 105 |
+
- [google/fleurs](https://huggingface.co/datasets/google/fleurs) (train+validation)
|
| 106 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia) (train)
|
| 107 |
+
|
| 108 |
+
Evaluation:
|
| 109 |
+
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0) (test)
|
| 110 |
+
- [google/fleurs](https://huggingface.co/datasets/google/fleurs) (test)
|
| 111 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia) (test)
|
| 112 |
|
| 113 |
## Training procedure
|
| 114 |
|
| 115 |
+
Datasets were augmented on-the-fly using [audiomentations](https://github.com/iver56/audiomentations) via PitchShift, AddGaussianNoise and TimeStretch transformations at `p=0.3`.
|
| 116 |
+
|
| 117 |
### Training hyperparameters
|
| 118 |
|
| 119 |
The following hyperparameters were used during training:
|