gglabs/stt-turbo-multilingual-v0.0.10

Browse files

Files changed (4) hide show

README.md +21 -11
adapter_model.safetensors +1 -1
runs/May23_11-15-26_ubuntu/events.out.tfevents.1747998928.ubuntu.2088417.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the custom_multilingual dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3849
-- Wer: 25.9259
 ## Model description
@@ -40,22 +40,32 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 5
-- training_steps: 20
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer     |
-|:-------------:|:------:|:----:|:---------------:|:-------:|
-| 0.6954        | 0.1754 | 10   | 0.4176          | 33.3333 |
-| 0.2358        | 0.3509 | 20   | 0.3849          | 25.9259 |
 ### Framework versions

 This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the custom_multilingual dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3980
+- Wer: 15.3116
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0005
+- train_batch_size: 16
+- eval_batch_size: 32
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 64
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 200
+- training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Wer     |
+|:-------------:|:-------:|:----:|:---------------:|:-------:|
+| 0.3199        | 1.0971  | 500  | 0.3644          | 18.0913 |
+| 0.2659        | 2.1942  | 1000 | 0.3579          | 17.1415 |
+| 0.2321        | 3.2913  | 1500 | 0.3579          | 16.6319 |
+| 0.2101        | 4.3884  | 2000 | 0.3635          | 16.3539 |
+| 0.1901        | 5.4855  | 2500 | 0.3677          | 15.7470 |
+| 0.1673        | 6.5826  | 3000 | 0.3748          | 15.4413 |
+| 0.146         | 7.6796  | 3500 | 0.3813          | 15.4691 |
+| 0.13          | 8.7767  | 4000 | 0.3879          | 15.6266 |
+| 0.1206        | 9.8738  | 4500 | 0.3950          | 15.4459 |
+| 0.1091        | 10.9709 | 5000 | 0.3980          | 15.3116 |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9e250308d65adb63715682609aad8380931d6a6e3eed668fc2cf884641d6895d
 size 78677456

 version https://git-lfs.github.com/spec/v1
+oid sha256:30c0d6d7a16b36b1af07c24d22195c702e8f39cb3baf65c0cdde8080ffa2f2d3
 size 78677456

runs/May23_11-15-26_ubuntu/events.out.tfevents.1747998928.ubuntu.2088417.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:103209f726da1817ae0c4cf7a1ba70022df77533b9f8beb9958eb0d1d9f078c8
+size 20470

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cbf1d1b72d537e72b57798bbbbc8dd3efa369137a34584d722d696526f8e4e06
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:8373683a2465563929a973e2ec6b94a5075a0414524cfa447c6b53e234a32892
 size 5496