Text-to-Speech
Transformers
Safetensors
Bambara
vits
text-to-audio
mms
multilingual
Open-Source
Mali
Bambara
Eval Results (legacy)
Instructions to use sudoping01/bambara-tts with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use sudoping01/bambara-tts with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-to-speech", model="sudoping01/bambara-tts")# Load model directly from transformers import AutoTokenizer, AutoModelForTextToWaveform tokenizer = AutoTokenizer.from_pretrained("sudoping01/bambara-tts") model = AutoModelForTextToWaveform.from_pretrained("sudoping01/bambara-tts") - Notebooks
- Google Colab
- Kaggle
Bambara TTS
Text-to-speech synthesis model for Bambara (Bamanankan), a language spoken by over 14 million people primarily in Mali.
Technical Specifications
- Architecture: VITS (Variational Inference with adversarial learning for end-to-end TTS)
- Base Model: Facebook/Meta MMS
- Size: 145 MB
- Format: PyTorch
- Sampling Rate: 16kHz
- Language: Bambara (bm-ML)
- Performance: Optimized for CPU (4GB RAM recommended)
Installation
pip install transformers torch soundfile
Usage
from transformers import VitsModel, AutoTokenizer
import torch
# Load model and tokenizer
model = VitsModel.from_pretrained("sudoping01/bambara-tts")
tokenizer = AutoTokenizer.from_pretrained("sudoping01/bambara-tts")
# Prepare text and generate speech
text = "An filɛ ni ye yɔrɔ minna ni an ye an sigi ka a layɛ yala an bɛ ka baara min kɛ ɛsike a kɛlen don ka Ɲɛ wa ?"
inputs = tokenizer(text, return_tensors="pt")
with torch.no_grad():
output = model(**inputs).waveform
# Save output
waveform = output.squeeze().cpu().numpy()
sample_rate = model.config.sampling_rate
import soundfile as sf
sf.write("bambara_output.wav", waveform, sample_rate)
Limitations
- Limited handling of loanwords and code-switching with French
- Variable performance across regional dialects
- Requires standard orthography
- Limited prosody and emotional expression
License
CC BY-NC 4.0 (Attribution-NonCommercial)
- Non-commercial use only
- Attribution required for model authors and Meta
- Use must respect Bambara language and culture
References
@misc{bambara-tts,
author = {sudoping01},
title = {Text-to-Speech Model for Bambara},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/sudoping01/bambara-tts}}
}
- Downloads last month
- 30
Model tree for sudoping01/bambara-tts
Base model
facebook/mms-ttsSpaces using sudoping01/bambara-tts 2
Evaluation results
- Subjective Qualityself-reportedN/A