mapau
/

whisper-small-hr

Automatic Speech Recognition

Generated from Trainer

Eval Results (legacy)

Model card Files Files and versions

Metrics Training metrics Community

mapau commited on Jun 7, 2024

Commit

20da83b

·

verified ·

1 Parent(s): 90fa0cb

End of training

Files changed (2) hide show

README.md +12 -11
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 25.56675062972292
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,8 +34,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the parlaSmall_subset dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5479
-- Wer: 25.5668
 ## Model description
@@ -55,24 +55,25 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 16
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 3000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer     |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.0003        | 32.52 | 1000 | 0.5035          | 26.3224 |
-| 0.0001        | 65.04 | 2000 | 0.5380          | 25.8186 |
-| 0.0001        | 97.56 | 3000 | 0.5479          | 25.5668 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 25.440806045340054
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the parlaSmall_subset dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5739
+- Wer: 25.4408
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 2
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 8
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 4000
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer     |
+|:-------------:|:------:|:----:|:---------------:|:-------:|
+| 0.0003        | 32.52  | 1000 | 0.5073          | 25.0630 |
+| 0.0001        | 65.04  | 2000 | 0.5470          | 25.5668 |
+| 0.0001        | 97.56  | 3000 | 0.5668          | 25.0630 |
+| 0.0           | 130.08 | 4000 | 0.5739          | 25.4408 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aca1d7e4c78a89c7bd393a96793059b60b5233e0b9d6ba0bdda86cef635d93fe
 size 966995080

 version https://git-lfs.github.com/spec/v1
+oid sha256:21785a90c2ff94068d372db58168b2776ecf312b8fb8d1912318f300170d3578
 size 966995080