keithito/lj_speech
Updated • 1.42k • 62
How to use ezrab/speecht5_finetuned_lj_speech with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="ezrab/speecht5_finetuned_lj_speech") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("ezrab/speecht5_finetuned_lj_speech")
model = AutoModelForTextToSpectrogram.from_pretrained("ezrab/speecht5_finetuned_lj_speech")This model is a fine-tuned version of microsoft/speecht5_tts on the keithito/lj_speech dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4519 | 1.3569 | 500 | 0.4035 |
| 0.4307 | 2.7137 | 1000 | 0.3897 |
| 0.4243 | 4.0706 | 1500 | 0.3842 |
| 0.4154 | 5.4274 | 2000 | 0.3814 |
| 0.4158 | 6.7843 | 2500 | 0.3793 |
| 0.409 | 8.1411 | 3000 | 0.3783 |
| 0.4112 | 9.4980 | 3500 | 0.3774 |
| 0.4135 | 10.8548 | 4000 | 0.3772 |
Base model
microsoft/speecht5_tts