Amebo Premium Voice - Hausa TTS
ποΈ Natural Nigerian Hausa Text-to-Speech
Amebo Premium Voice is a high-quality Hausa TTS model built on Meta's MMS-TTS, enhanced with warmth processing for natural, clear speech synthesis.
Features
- β Native Hausa pronunciation - Correct sounds for Ι, Ι, Ζ, etc.
- β Fast inference - ~100ms latency after warmup
- β Lightweight - 36MB model
- β Warmth processing - Natural, warm voice quality
- β Production ready - Perfect for call centers & voice apps
Usage
Python API
from transformers import VitsModel, AutoTokenizer
import torch
# Load model
model = VitsModel.from_pretrained("facebook/mms-tts-hau")
tokenizer = AutoTokenizer.from_pretrained("facebook/mms-tts-hau")
# Generate speech
text = "Sannu da zuwa. Ina kwana?"
inputs = tokenizer(text, return_tensors="pt")
with torch.no_grad():
output = model(**inputs).waveform
# Save audio
import soundfile as sf
sf.write("output.wav", output.squeeze().numpy(), 16000)
Inference API
import requests
API_URL = "https://api-inference.huggingface.co/models/YOUR_USERNAME/amebo-premium-voice"
headers = {"Authorization": "Bearer YOUR_HF_TOKEN"}
response = requests.post(API_URL, headers=headers, json={
"inputs": "Sannu da zuwa cikin aikin mu.",
"parameters": {
"warmth": 0.3,
"presence": 0.2
}
})
Parameters
| Parameter | Default | Range | Description |
|---|---|---|---|
| warmth | 0.3 | 0.0-1.0 | Voice warmth (low-mid boost) |
| presence | 0.2 | 0.0-1.0 | Voice clarity (high-mid boost) |
Performance
| Metric | Value |
|---|---|
| Model Size | 36 MB |
| Sample Rate | 16 kHz |
| Latency (GPU) | ~100ms |
| Latency (CPU) | ~500ms |
Supported Text
- Standard Hausa text
- Special characters: Ι, Ι, Ζ, Ζ΄
- Numbers and punctuation
License
Apache 2.0 (based on Meta MMS-TTS)
Credits
- Base model: Meta MMS-TTS
- Warmth processing: Amebo AI
- Training data: NaijaVoices dataset
Made with β€οΈ for Nigerian Hausa speakers