facebook/voxpopuli
Viewer • Updated • 1.26M • 19k • 152
How to use Dulence/speecht5_tts_voxpopuli_hr with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-audio", model="Dulence/speecht5_tts_voxpopuli_hr") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("Dulence/speecht5_tts_voxpopuli_hr")
model = AutoModelForTextToSpectrogram.from_pretrained("Dulence/speecht5_tts_voxpopuli_hr")This model is a fine-tuned version of microsoft/speecht5_tts on the VoxPopuli hr dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4915 | 3.24 | 1000 | 0.4504 |
| 0.4757 | 6.49 | 2000 | 0.4366 |
| 0.4653 | 9.73 | 3000 | 0.4318 |
| 0.4636 | 12.98 | 4000 | 0.4304 |
Base model
microsoft/speecht5_tts