facebook/voxpopuli
Viewer • Updated • 1.26M • 18.8k • 152
How to use EYEDOL/speek with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="EYEDOL/speek") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("EYEDOL/speek")
model = AutoModelForTextToSpectrogram.from_pretrained("EYEDOL/speek")# Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("EYEDOL/speek")
model = AutoModelForTextToSpectrogram.from_pretrained("EYEDOL/speek")This model is a fine-tuned version of microsoft/speecht5_tts on the VoxPopuli dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.3986 | 3.0488 | 1000 | 0.3671 |
| 0.3848 | 6.0976 | 2000 | 0.3597 |
| 0.3821 | 9.1463 | 3000 | 0.3544 |
| 0.3813 | 12.1951 | 4000 | 0.3553 |
Base model
microsoft/speecht5_tts
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="EYEDOL/speek")