keithito/lj_speech
Updated • 1.37k • 62
This model is a fine-tuned version of microsoft/speecht5_tts on the lj_speech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4684 | 0.31 | 100 | 0.4141 |
| 0.4496 | 0.61 | 200 | 0.4108 |
| 0.4334 | 0.92 | 300 | 0.3955 |
| 0.4245 | 1.22 | 400 | 0.3921 |
| 0.4225 | 1.53 | 500 | 0.3892 |
| 0.4207 | 1.83 | 600 | 0.3858 |
| 0.4151 | 2.14 | 700 | 0.3820 |
| 0.4136 | 2.44 | 800 | 0.3803 |
| 0.4105 | 2.75 | 900 | 0.3782 |
| 0.4083 | 3.05 | 1000 | 0.3763 |
| 0.4046 | 3.36 | 1100 | 0.3764 |
| 0.4012 | 3.66 | 1200 | 0.3748 |
| 0.4004 | 3.97 | 1300 | 0.3733 |
| 0.3998 | 4.27 | 1400 | 0.3726 |
| 0.4013 | 4.58 | 1500 | 0.3717 |
Base model
microsoft/speecht5_tts