mozilla-foundation/common_voice_17_0
Updated • 5.4k • 17
How to use anirxudh/speecht5_tts_mal with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-audio", model="anirxudh/speecht5_tts_mal") # Load model directly
from transformers import AutoProcessor, AutoModelForTextToSpectrogram
processor = AutoProcessor.from_pretrained("anirxudh/speecht5_tts_mal")
model = AutoModelForTextToSpectrogram.from_pretrained("anirxudh/speecht5_tts_mal")This model is a fine-tuned version of microsoft/speecht5_tts on the CommonVoice dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.0775 | 7.5758 | 1000 | 0.4753 |
| 0.9791 | 15.1515 | 2000 | 0.4433 |
| 0.9368 | 22.7273 | 3000 | 0.4349 |
| 0.9392 | 30.3030 | 4000 | 0.4338 |
Base model
microsoft/speecht5_tts