README / README.md
rashid0784's picture
Update README.md
e4b29ef verified
# Uganda Open Source AI Lab (USOAL)
**USOAL** is an open initiative created by passionate machine learning researchers in Uganda to share exciting models and findings from the local research community.
We aim to contribute to the growth of African language technologies by building open, high-quality models for speech, language, and other AI applications.
---
## Table of Contents
- [Uganda Text-to-Speech (TTS)](#1-uganda-text-to-speech-tts)
- [Uganda Text Generation Model](#2-uganda-text-generation-advanced)
---
## 1. Uganda Text-to-Speech (TTS)
A collection of fine-tuned Orpheus 3B models that generate natural-sounding speech in multiple Ugandan languages including English, Luganda, Runyankole, Teso, and Acholi. These Models were built on top of open-sourced datasets from SunBird AI, Yogera and Mozilla's Common Voice Dataset.
#### 🔉 Audio Examples (With Prompts)
| Language | Voice | Prompt | Audio Sample |
|------------|-------------|------------------------------------------------------------------------------------------|--------------|
| English | Christopher | Hello I can speak in English as Christopher, one of the voices I can speak. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/English/christopher.wav"></audio> |
| English | Barbara | Or as Barbara, this is one of my female voices. Pretty cool, right? | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/English/barbara.wav"></audio> |
| English | Mary | I can also speak as Mary as well. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/English/mary.wav"></audio> |
| English | James | Or I can speak as James, as you can see. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/English/james.wav"></audio> |
| English | Jessica | This is my other voice called Jessica. I have more voices like Jennifer, Susan, Linda, Patricia, and Elizabeth, which I’ll share when they’re ready. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/English/jessica.wav"></audio> |
| Luganda | Christopher | Nsobola okwo’geranga Christopher nga wowulila kati. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Luganda/christopher.wav"></audio> |
| Luganda | Charles | Oba neenjogela nga Charles wenti. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Luganda/charles.wav"></audio> |
| Luganda | Sandra | Nina neddoboozi lya Sandra bweliti. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Luganda/sandra.wav"></audio> |
| Luganda | Michelle | Nsobola ogwogella bwenti mulino eddoboozi. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Luganda/michelle.wav"></audio> |
| Luganda | Daniel | Oba nemulino elye’kisajja nga woowulira. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Luganda/daniel.wav"></audio> |
| Runyankole | Christopher | Nimbasa kugamba nka Christopher omwiraka eri. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Runyankole/christopher.wav"></audio> |
| Runyankole | Patricia | Bimwe ebirikugambwa aha reediyo nibihwera abantu kumanya obutare burungi bw’amasharuura gaabo. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Runyankole/patricia.wav"></audio> |
| Runyankole | Elizabeth | Omu disiturikiti ya Kayunga emisiri erikukira obwngi ekashangwa erimu ebicoori ebiine oburwaire. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Runyankole/elizabeth.wav"></audio> |
| Runyankole | Michelle | Nimbasa kugamba nka Michelle omwiraka eri. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Runyankole/michelle.wav"></audio> |
| Runyankole | James | Uganda eteire amaani aha buhingi n’oburiisa. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Runyankole/james.wav"></audio> |
| Teso | Christopher | Epedorete akoriok aimedaun ejok kanejaas aicoreta nu itikitikere adeka. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Teso/christopher.wav"></audio> |
| Teso | Jessica | Akoru ikorion luegelegela nes ingarakini itunganan. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Teso/jessica.wav"></audio> |
| Teso | James | Iraasit yen emunaara aticepak ikur enyamitos. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Teso/james.wav"></audio> |
| Teso | Daniel | Aipagisanar nes ewai ecie lo ibwaikinet iboro toma aswam. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Teso/daniel.wav"></audio> |
| Teso | Barbara | Isisianakinete isomeroi kwana asiomak eipone lo isubusaere. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Teso/barbara.wav"></audio> |
| Acholi | Mark | Uganda tye ka keme ki lok me pur. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Acholi/mark.wav"></audio> |
| Acholi | Barbara | Lupur twero nongo kony ma dit ka gunongo ngec me gengo onyo cango two ma balo jami ma i poto. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Acholi/barbara.wav"></audio> |
| Acholi | Michelle | Gum madwong me timo biacara tye i te yub ma pe jenge i kom gamente. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Acholi/mitchelle.wav"></audio> |
| Acholi | James | Ler ma pe gidodo ma woto ka yenyo cam i dye poto obalo cam weng ma tye i poto. | <audio controls src="https://github.com/Uganda-lang/Ganda-TTS/raw/refs/heads/main/Example/Acholi/james.wav"></audio> |
---
### 🛠️ How it Works
These TTS models are built using a two-stage architecture:
1. **Audio Token Generation**
Uses SNAC (Structured Neural Audio Codec) to convert text into audio tokens.
2. **TTS Model Fine-Tuning**
Fine-tuned versions of the Orpheus 3B model convert the audio tokens into realistic speech in multiple Ugandan languages.
> ⚠️ Note: Some non-English outputs may sound lower in quality due to SNAC not being pretrained on local African phonetics.
The Github repository shares more details about the models
📦 Code: [GitHub Repository for Uganda TTS](https://github.com/Uganda-lang/Ganda-TTS)
---
## 2. Uganda Text Generation (Advanced)
We are also training powerful models that can understand and generate text in low-resource Ugandan languages.
📦 Code: [GitHub Repository for Uganda Text Generation](https://github.com/Uganda-lang/llm-ug-qwen3-14B)