sil-ai
/

mzr-chapter-audio-dataset-force-aligned-speecht5

+---
+library_name: transformers
+license: mit
+base_model: microsoft/speecht5_tts
+tags:
+- generated_from_trainer
+model-index:
+- name: ykv-chapter-audio-dataset-force-aligned-speecht5
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# ykv-chapter-audio-dataset-force-aligned-speecht5
+This model is a fine-tuned version of [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0562
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 3407
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
+- optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 4000
+- training_steps: 40000
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch    | Step  | Validation Loss |
+|:-------------:|:--------:|:-----:|:---------------:|
+| 0.0899        | 12.5016  | 1000  | 0.0604          |
+| 0.0742        | 25.0     | 2000  | 0.0540          |
+| 0.073         | 37.5016  | 3000  | 0.0535          |
+| 0.0733        | 50.0     | 4000  | 0.0561          |
+| 0.0677        | 62.5016  | 5000  | 0.0530          |
+| 0.0621        | 75.0     | 6000  | 0.0522          |
+| 0.0652        | 87.5016  | 7000  | 0.0542          |
+| 0.0609        | 100.0    | 8000  | 0.0527          |
+| 0.0606        | 112.5016 | 9000  | 0.0530          |
+| 0.059         | 125.0    | 10000 | 0.0528          |
+| 0.0542        | 137.5016 | 11000 | 0.0529          |
+| 0.0536        | 150.0    | 12000 | 0.0530          |
+| 0.0543        | 162.5016 | 13000 | 0.0535          |
+| 0.0547        | 175.0    | 14000 | 0.0539          |
+| 0.0533        | 187.5016 | 15000 | 0.0539          |
+| 0.0523        | 200.0    | 16000 | 0.0553          |
+| 0.0506        | 212.5016 | 17000 | 0.0545          |
+| 0.05          | 225.0    | 18000 | 0.0554          |
+| 0.0509        | 237.5016 | 19000 | 0.0544          |
+| 0.0474        | 250.0    | 20000 | 0.0550          |
+| 0.0468        | 262.5016 | 21000 | 0.0548          |
+| 0.0483        | 275.0    | 22000 | 0.0558          |
+| 0.0477        | 287.5016 | 23000 | 0.0553          |
+| 0.0471        | 300.0    | 24000 | 0.0553          |
+| 0.0459        | 312.5016 | 25000 | 0.0559          |
+| 0.0474        | 325.0    | 26000 | 0.0555          |
+| 0.0452        | 337.5016 | 27000 | 0.0561          |
+| 0.0445        | 350.0    | 28000 | 0.0558          |
+| 0.0438        | 362.5016 | 29000 | 0.0560          |
+| 0.0452        | 375.0    | 30000 | 0.0563          |
+| 0.0438        | 387.5016 | 31000 | 0.0560          |
+| 0.0437        | 400.0    | 32000 | 0.0563          |
+| 0.0436        | 412.5016 | 33000 | 0.0563          |
+| 0.0449        | 425.0    | 34000 | 0.0567          |
+| 0.0434        | 437.5016 | 35000 | 0.0564          |
+| 0.0448        | 450.0    | 36000 | 0.0563          |
+| 0.0421        | 462.5016 | 37000 | 0.0562          |
+| 0.0438        | 475.0    | 38000 | 0.0562          |
+| 0.0429        | 487.5016 | 39000 | 0.0562          |
+| 0.043         | 500.0    | 40000 | 0.0562          |
+### Framework versions
+- Transformers 4.57.1
+- Pytorch 2.8.0+cu128
+- Datasets 4.2.0
+- Tokenizers 0.22.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:43e94d7968644b656268e3bdf1d23bd1f33f89284dcdf9dc59561df9e487badf
 size 577899912

 version https://git-lfs.github.com/spec/v1
+oid sha256:f9f8c503b2e02faa3670df636b8c49a13dd7558e2544d76d9ad24ad4b2f52979
 size 577899912