sil-ai
/

mzr-chapter-audio-dataset-force-aligned-speecht5

@@ -5,18 +5,18 @@ base_model: microsoft/speecht5_tts
 tags:
 - generated_from_trainer
 model-index:
-- name: ykv-chapter-audio-dataset-force-aligned-speecht5
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# ykv-chapter-audio-dataset-force-aligned-speecht5
 This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0562
 ## Model description
@@ -51,46 +51,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch    | Step  | Validation Loss |
 |:-------------:|:--------:|:-----:|:---------------:|
-| 0.0899        | 12.5016  | 1000  | 0.0604          |
-| 0.0742        | 25.0     | 2000  | 0.0540          |
-| 0.073         | 37.5016  | 3000  | 0.0535          |
-| 0.0733        | 50.0     | 4000  | 0.0561          |
-| 0.0677        | 62.5016  | 5000  | 0.0530          |
-| 0.0621        | 75.0     | 6000  | 0.0522          |
-| 0.0652        | 87.5016  | 7000  | 0.0542          |
-| 0.0609        | 100.0    | 8000  | 0.0527          |
-| 0.0606        | 112.5016 | 9000  | 0.0530          |
-| 0.059         | 125.0    | 10000 | 0.0528          |
-| 0.0542        | 137.5016 | 11000 | 0.0529          |
-| 0.0536        | 150.0    | 12000 | 0.0530          |
-| 0.0543        | 162.5016 | 13000 | 0.0535          |
-| 0.0547        | 175.0    | 14000 | 0.0539          |
-| 0.0533        | 187.5016 | 15000 | 0.0539          |
-| 0.0523        | 200.0    | 16000 | 0.0553          |
-| 0.0506        | 212.5016 | 17000 | 0.0545          |
-| 0.05          | 225.0    | 18000 | 0.0554          |
-| 0.0509        | 237.5016 | 19000 | 0.0544          |
-| 0.0474        | 250.0    | 20000 | 0.0550          |
-| 0.0468        | 262.5016 | 21000 | 0.0548          |
-| 0.0483        | 275.0    | 22000 | 0.0558          |
-| 0.0477        | 287.5016 | 23000 | 0.0553          |
-| 0.0471        | 300.0    | 24000 | 0.0553          |
-| 0.0459        | 312.5016 | 25000 | 0.0559          |
-| 0.0474        | 325.0    | 26000 | 0.0555          |
-| 0.0452        | 337.5016 | 27000 | 0.0561          |
-| 0.0445        | 350.0    | 28000 | 0.0558          |
-| 0.0438        | 362.5016 | 29000 | 0.0560          |
-| 0.0452        | 375.0    | 30000 | 0.0563          |
-| 0.0438        | 387.5016 | 31000 | 0.0560          |
-| 0.0437        | 400.0    | 32000 | 0.0563          |
-| 0.0436        | 412.5016 | 33000 | 0.0563          |
-| 0.0449        | 425.0    | 34000 | 0.0567          |
-| 0.0434        | 437.5016 | 35000 | 0.0564          |
-| 0.0448        | 450.0    | 36000 | 0.0563          |
-| 0.0421        | 462.5016 | 37000 | 0.0562          |
-| 0.0438        | 475.0    | 38000 | 0.0562          |
-| 0.0429        | 487.5016 | 39000 | 0.0562          |
-| 0.043         | 500.0    | 40000 | 0.0562          |
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: mzr-chapter-audio-dataset-force-aligned-speecht5
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mzr-chapter-audio-dataset-force-aligned-speecht5
 This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0448
 ## Model description
 | Training Loss | Epoch    | Step  | Validation Loss |
 |:-------------:|:--------:|:-----:|:---------------:|
+| 0.0626        | 12.5016  | 1000  | 0.0466          |
+| 0.0517        | 25.0     | 2000  | 0.0436          |
+| 0.0524        | 37.5016  | 3000  | 0.0428          |
+| 0.0501        | 50.0     | 4000  | 0.0423          |
+| 0.0464        | 62.5016  | 5000  | 0.0408          |
+| 0.0422        | 75.0     | 6000  | 0.0421          |
+| 0.0479        | 87.5016  | 7000  | 0.0416          |
+| 0.0434        | 100.0    | 8000  | 0.0425          |
+| 0.0421        | 112.5016 | 9000  | 0.0416          |
+| 0.0408        | 125.0    | 10000 | 0.0424          |
+| 0.0376        | 137.5016 | 11000 | 0.0438          |
+| 0.0371        | 150.0    | 12000 | 0.0419          |
+| 0.0377        | 162.5016 | 13000 | 0.0429          |
+| 0.0377        | 175.0    | 14000 | 0.0422          |
+| 0.0371        | 187.5016 | 15000 | 0.0427          |
+| 0.0362        | 200.0    | 16000 | 0.0437          |
+| 0.036         | 212.5016 | 17000 | 0.0438          |
+| 0.0349        | 225.0    | 18000 | 0.0435          |
+| 0.0356        | 237.5016 | 19000 | 0.0438          |
+| 0.034         | 250.0    | 20000 | 0.0434          |
+| 0.033         | 262.5016 | 21000 | 0.0437          |
+| 0.0335        | 275.0    | 22000 | 0.0443          |
+| 0.0329        | 287.5016 | 23000 | 0.0445          |
+| 0.0332        | 300.0    | 24000 | 0.0448          |
+| 0.0324        | 312.5016 | 25000 | 0.0449          |
+| 0.0329        | 325.0    | 26000 | 0.0442          |
+| 0.0317        | 337.5016 | 27000 | 0.0445          |
+| 0.0311        | 350.0    | 28000 | 0.0443          |
+| 0.0304        | 362.5016 | 29000 | 0.0448          |
+| 0.0313        | 375.0    | 30000 | 0.0443          |
+| 0.0308        | 387.5016 | 31000 | 0.0450          |
+| 0.0312        | 400.0    | 32000 | 0.0447          |
+| 0.0307        | 412.5016 | 33000 | 0.0448          |
+| 0.0312        | 425.0    | 34000 | 0.0448          |
+| 0.0304        | 437.5016 | 35000 | 0.0446          |
+| 0.0313        | 450.0    | 36000 | 0.0448          |
+| 0.0298        | 462.5016 | 37000 | 0.0446          |
+| 0.0307        | 475.0    | 38000 | 0.0447          |
+| 0.0302        | 487.5016 | 39000 | 0.0449          |
+| 0.0303        | 500.0    | 40000 | 0.0448          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:17646d2688c94bdfee98b208bb954fc42b215f01643055d5f38cd4b4237e85ce
 size 577899912

 version https://git-lfs.github.com/spec/v1
+oid sha256:5bedb9e5c55c2e4802cd8b8645ebaf64710394fa16b3f645ce52b7c71ce9aa2d
 size 577899912