HAMMALE
/

speecht5-darija

Model card Files Files and versions

Metrics Training metrics Community

HAMMALE commited on Apr 27, 2025

Commit

a10cca3

·

verified ·

1 Parent(s): 2f99e8a

Add model README

Files changed (1) hide show

README.md +50 -0

README.md ADDED Viewed

	@@ -0,0 +1,50 @@

+# Moroccan Darija Text-to-Speech Model
+This model is a fine-tuned version of SpeechT5 for Moroccan Darija Text-to-Speech synthesis.
+## Model Details
+- **Base Model**: Microsoft SpeechT5
+- **Fine-tuned on**: DODa audio dataset
+- **Languages**: Moroccan Darija (Latin script)
+- **Features**: Multiple voice support (male/female)
+- **Release Date**: April 2025
+## Usage
+```python
+from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5HifiGan
+import torch
+import soundfile as sf
+# Load models
+processor = SpeechT5Processor.from_pretrained("HAMMALE/speecht5-darija")
+model = SpeechT5ForTextToSpeech.from_pretrained("HAMMALE/speecht5-darija")
+vocoder = SpeechT5HifiGan.from_pretrained("microsoft/speecht5_hifigan")
+# Load speaker embedding (replace with your own speaker embedding)
+speaker_embedding = torch.randn(1, 512)  # Example embedding
+# Process text
+text = "Salam, kifach nta lyoum?"
+inputs = processor(text=text, return_tensors="pt")
+# Generate speech
+speech = model.generate_speech(inputs["input_ids"], speaker_embedding, vocoder=vocoder)
+# Save audio file
+sf.write("output.wav", speech.numpy(), 16000)
+```
+## Demo
+A live demo is available at [Hugging Face Spaces](https://huggingface.co/spaces/HAMMALE/darija-tts-demo)
+## License
+This model is available under the MIT License.
+## Acknowledgments
+- The [DODa audio dataset](https://huggingface.co/datasets/atlasia/DODa-audio-dataset) creators
+- Microsoft Research for the SpeechT5 model architecture