You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Configuration Parsing Warning:Invalid JSON for config file config.json

VITS Mooré (Mooré Text-to-Speech)

This is a Text-to-Speech (TTS) model for the Mooré (Moore) language, a Gur language spoken in Burkina Faso by over 8 million people. The model is based on the VITS (Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) architecture and trained using the Coqui TTS framework.

Model Details

Developed by: Soumana Dama — Full stack developper & AI engineer-Founder & Lead AI Engineer at Scoinvestigator AI
Model type: VITS (Text-to-Speech)
Language(s) (NLP): Mooré (ISO 639-3: mos)
Framework: Coqui TTS / PyTorch
Parameters: ~28.6 Million (Generator) / ~74 Million (Total with Discriminator)
License: Apache 2.0
Developer Contacts:
- LinkedIn: Soumana Dama on LinkedIn
- Scoinvestigator AI: scoinvestigator.com
- GitHub: Damasoumana1
- Portfolio: soumanadama.netlify.app
- Email: soumanadama93@gmail.com

Uses

Direct Use

This model is designed to synthesize spoken Mooré audio from written Mooré text. It can be integrated into educational tools, voice assistants, accessibility software for the visually impaired, and content creation platforms.

Out-of-Scope Use

This model is trained on a specific Mooré dialect and voice. It is not intended for voice cloning without consent or generating deepfakes.

Training Details

Training Data

The model was trained on a custom, carefully curated Mooré speech corpus consisting of high-quality audio and their corresponding text transcriptions.

Training Procedure & Parameters

Architecture: VITS (End-to-End TTS)
Framework: Coqui TTS
Hardware: Google Colab (Tesla T4 / A100 GPU)
Training Steps: > 22,000 steps (~9,400+ epochs)
Sample Rate: 16,000 Hz
Metrics (at step 22k):
- loss_gen (Generator Loss): ~35.8
- loss_disc (Discriminator Loss): ~2.4

How to Get Started with the Model

To use this model, you will need to install the TTS library from Coqui.

Installation

pip install TTS

Inference in Python

Download the checkpoint_*.pth and config.json files from this repository and use the following Python code to generate audio:

import torch
from TTS.api import TTS

# Paths to your downloaded files
model_path = "checkpoint_22000.pth" # Replace with the latest checkpoint name
config_path = "config.json"

# Initialize TTS
# The model will automatically use GPU if available
tts = TTS(model_path=model_path, config_path=config_path, progress_bar=False)

# Text to synthesize
text = "A wilga a nin-buiidã a tʋʋm-kãsemsã."

# Generate audio and save to a file
tts.tts_to_file(text=text, file_path="output_moore.wav")

print("Audio generated successfully!")

Note on Special Characters

Ensure that the input text matches the orthography and character set used during training. Special characters not present in the vocabulary might be discarded by the phonemizer/tokenizer.

Downloads last month: 33

Collection including Dama12/vits-tts-moore

African Language AI - Moore & Dioula

Collection

Open-source speech and language technologies for African languages, including Mooré and Dioula datasets, STT and TTS developed by Soumana Dama. • 9 items • Updated 2 days ago