Model save

Browse files

Files changed (5) hide show

README.md +23 -18
final_model/model.safetensors +1 -1
final_model/training_args.bin +1 -1
model.safetensors +1 -1
runs/Jan24_23-42-43_b212ad9c366f/events.out.tfevents.1769303596.b212ad9c366f.38.4 +3 -0

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/speecht5_asr](https://huggingface.co/microsoft/speecht5_asr) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1951
-- Wer Ortho: 63.3333
-- Wer: 62.4672
 ## Model description
@@ -39,7 +39,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 2
 - eval_batch_size: 8
 - seed: 42
@@ -47,24 +47,29 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 100
-- training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer Ortho | Wer      |
-|:-------------:|:------:|:----:|:---------------:|:---------:|:--------:|
-| 1.7381        | 0.3731 | 100  | 1.4437          | 371.3889  | 213.9108 |
-| 0.8437        | 0.7463 | 200  | 0.5686          | 80.5556   | 81.6273  |
-| 0.4461        | 1.1194 | 300  | 0.3668          | 76.1111   | 77.4278  |
-| 0.3753        | 1.4925 | 400  | 0.2760          | 72.7778   | 74.0157  |
-| 0.3416        | 1.8657 | 500  | 0.2392          | 80.8333   | 84.7769  |
-| 0.2656        | 2.2388 | 600  | 0.2138          | 67.7778   | 67.9790  |
-| 0.2706        | 2.6119 | 700  | 0.2085          | 74.7222   | 77.1654  |
-| 0.2509        | 2.9851 | 800  | 0.1995          | 63.0556   | 62.2047  |
-| 0.2314        | 3.3582 | 900  | 0.1949          | 62.5      | 61.6798  |
-| 0.2806        | 3.7313 | 1000 | 0.1951          | 63.3333   | 62.4672  |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/speecht5_asr](https://huggingface.co/microsoft/speecht5_asr) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2003
+- Wer Ortho: 62.9526
+- Wer: 59.7855
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
 - train_batch_size: 2
 - eval_batch_size: 8
 - seed: 42
 - total_train_batch_size: 16
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 50
+- training_steps: 1500
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer      | Wer Ortho |
+|:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|
+| 1.7381        | 0.3731 | 100  | 1.4437          | 213.9108 | 371.3889  |
+| 0.8437        | 0.7463 | 200  | 0.5686          | 81.6273  | 80.5556   |
+| 0.4461        | 1.1194 | 300  | 0.3668          | 77.4278  | 76.1111   |
+| 0.3753        | 1.4925 | 400  | 0.2760          | 74.0157  | 72.7778   |
+| 0.3416        | 1.8657 | 500  | 0.2392          | 84.7769  | 80.8333   |
+| 0.2656        | 2.2388 | 600  | 0.2138          | 67.9790  | 67.7778   |
+| 0.2706        | 2.6119 | 700  | 0.2085          | 77.1654  | 74.7222   |
+| 0.2509        | 2.9851 | 800  | 0.1995          | 62.2047  | 63.0556   |
+| 0.2314        | 3.3582 | 900  | 0.1949          | 61.6798  | 62.5      |
+| 0.2806        | 3.7313 | 1000 | 0.1951          | 62.4672  | 63.3333   |
+| 0.2254        | 4.1045 | 1100 | 0.1912          | 68.6111  | 69.2913   |
+| 0.2674        | 4.4776 | 1200 | 0.1863          | 68.6111  | 69.8163   |
+| 0.301         | 4.8507 | 1300 | 0.1862          | 67.5     | 67.9790   |
+| 0.2354        | 5.2239 | 1400 | 0.1850          | 61.1111  | 59.8425   |
+| 0.2349        | 5.5970 | 1500 | 0.1851          | 67.2222  | 67.7165   |
 ### Framework versions

final_model/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3dc61cb1446fcbb6002384682ffdd95b1f127033e00738521480a2388356e233
 size 604711248

 version https://git-lfs.github.com/spec/v1
+oid sha256:e97f0f2105b52dd2fb6ccb67620876f0077649b8cfc24a74306dada9954c2c30
 size 604711248

final_model/training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e1870232b40870feeeb9e828e86f936d1b7f61afd2c72c99e07956df5e914e39
 size 5432

 version https://git-lfs.github.com/spec/v1
+oid sha256:e0a2743349e0bef163b7e3dc33ca78b1ee4ab8fa27db594b0364ec3235b2d068
 size 5432

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:545df90da3bca7fa4d96468a34d09d50aa7365598f7822a78be7c1faf444f325
 size 604711248

 version https://git-lfs.github.com/spec/v1
+oid sha256:e97f0f2105b52dd2fb6ccb67620876f0077649b8cfc24a74306dada9954c2c30
 size 604711248

runs/Jan24_23-42-43_b212ad9c366f/events.out.tfevents.1769303596.b212ad9c366f.38.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c66d60c3cfc5c2e316b508605b82f4b21680db3cc1aef03f4ca396150a3d7e8c
+size 459