You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Configuration Parsing Warning:Invalid JSON for config file config.json

VITS Mooré (Mooré Text-to-Speech)

This is a Text-to-Speech (TTS) model for the Mooré (Moore) language, a Gur language spoken in Burkina Faso by over 8 million people. The model is based on the VITS (Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) architecture and trained using the Coqui TTS framework.

Model Details


Uses

Direct Use

This model is designed to synthesize spoken Mooré audio from written Mooré text. It can be integrated into educational tools, voice assistants, accessibility software for the visually impaired, and content creation platforms.

Out-of-Scope Use

This model is trained on a specific Mooré dialect and voice. It is not intended for voice cloning without consent or generating deepfakes.


Training Details

Training Data

The model was trained on a custom, carefully curated Mooré speech corpus consisting of high-quality audio and their corresponding text transcriptions.

Training Procedure & Parameters

  • Architecture: VITS (End-to-End TTS)
  • Framework: Coqui TTS
  • Hardware: Google Colab (Tesla T4 / A100 GPU)
  • Training Steps: > 22,000 steps (~9,400+ epochs)
  • Sample Rate: 16,000 Hz
  • Metrics (at step 22k):
    • loss_gen (Generator Loss): ~35.8
    • loss_disc (Discriminator Loss): ~2.4

How to Get Started with the Model

To use this model, you will need to install the TTS library from Coqui.

Installation

pip install TTS

Inference in Python

Download the checkpoint_*.pth and config.json files from this repository and use the following Python code to generate audio:

import torch
from TTS.api import TTS

# Paths to your downloaded files
model_path = "checkpoint_22000.pth" # Replace with the latest checkpoint name
config_path = "config.json"

# Initialize TTS
# The model will automatically use GPU if available
tts = TTS(model_path=model_path, config_path=config_path, progress_bar=False)

# Text to synthesize
text = "A wilga a nin-buiidã a tʋʋm-kãsemsã."

# Generate audio and save to a file
tts.tts_to_file(text=text, file_path="output_moore.wav")

print("Audio generated successfully!")

Note on Special Characters

Ensure that the input text matches the orthography and character set used during training. Special characters not present in the vocabulary might be discarded by the phonemizer/tokenizer.


Downloads last month
33
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including Dama12/vits-tts-moore