facebook/voxpopuli
Viewer • Updated • 1.26M • 18.8k • 152
How to use ruisp/speecht5_finetuned_voxpopuli_it with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="ruisp/speecht5_finetuned_voxpopuli_it") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("ruisp/speecht5_finetuned_voxpopuli_it")
model = AutoModelForTextToSpectrogram.from_pretrained("ruisp/speecht5_finetuned_voxpopuli_it")This model is a fine-tuned version of microsoft/speecht5_tts on the facebook/voxpopuli dataset. It achieves the following results on the evaluation set:
It uses the speaker embedding model speechbrain/spkrec-xvect-voxceleb
More information needed
test_size=0.15
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.6118 | 1.94 | 300 | 0.5508 |
| 0.5729 | 3.89 | 600 | 0.5204 |
| 0.563 | 5.83 | 900 | 0.5126 |
Base model
microsoft/speecht5_tts