|
|
--- |
|
|
language: |
|
|
- sw |
|
|
base_model: |
|
|
- google-t5/t5-small |
|
|
pipeline_tag: translation |
|
|
--- |
|
|
|
|
|
|
|
|
## Overview |
|
|
|
|
|
THiNK’s **Luo–Swahili Translation Model** is a fine-tuned T5-small designed to translate between the Luo (dav) and Swahili (swa) languages. It was trained on a parallel corpus of approximately 29,300 Luo–Swahili sentence pairs drawn from the larger “Kenyan Low-Resource Language Data” dataset. thinkKenya—also known as Tech Innovators Network Kenya—is a community-driven technology initiative founded in 2019 to support digital transformation and applied open innovation across Kenya, with a special focus on African local languages. |
|
|
|
|
|
## Model Details |
|
|
|
|
|
* **Model name**: [thinkKenya/luo\_swa\_translation\_model](https://huggingface.co/thinkKenya/luo_swa_translation_model) |
|
|
* **Architecture**: T5-small (≈60.5 M parameters; base checkpoint: [google/t5-small](https://huggingface.co/google/t5-small)) |
|
|
* **Framework**: Hugging Face Transformers (weights in [safetensors format](https://huggingface.co/thinkKenya/luo_swa_translation_model/tree/main)) |
|
|
* **Tensor type**: fp32 |
|
|
* **Training status**: In progress (latest reported step: 146,600) |
|
|
|
|
|
## Dataset |
|
|
|
|
|
* **Dataset name**: [thinkKenya/kenyan-low-resource-language-data](https://huggingface.co/datasets/thinkKenya/kenyan-low-resource-language-data) |
|
|
* **Task**: Translation (parallel text, Parquet format) |
|
|
* **Languages**: Luo (ISO `dav`) ↔ Swahili (ISO `swa`) |
|
|
* **Subset used**: `luo_swa` (≈29.3 k total examples; train split: 21.3 k; test split: 5.33 k) |
|
|
* **License**: [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) |
|
|
|
|
|
## Organization: Tech Innovators Network Kenya (thinkKenya) |
|
|
|
|
|
* **Website**: [think.ke](https://think.ke) |
|
|
* **Hugging Face Org**: [thinkKenya](https://huggingface.co/thinkKenya) |
|
|
* **Founded**: 2019 |
|
|
* **Mission**: To accelerate digital transformation and applied open innovation in Kenya, with special emphasis on building AI solutions for African local languages. |
|
|
|
|
|
## Training Configuration |
|
|
|
|
|
| Component | Details | |
|
|
| ----------------- | ----------------------------------------------------------------------------------------------------- | |
|
|
| Model weights | [`model.safetensors`](https://huggingface.co/thinkKenya/luo_swa_translation_model/tree/main) (242 MB) | |
|
|
| Tokenizer files | `tokenizer.json`, `special_tokens_map.json`, `tokenizer_config.json` | |
|
|
| Config file | `config.json` | |
|
|
| Training args | `training_args.bin` | |
|
|
| Software versions | `transformers` ≥ 4.x, `datasets` ≥ 2.x | |
|
|
|
|
|
## Example Usage |
|
|
|
|
|
```python |
|
|
from transformers import T5Tokenizer, T5ForConditionalGeneration |
|
|
|
|
|
tokenizer = T5Tokenizer.from_pretrained("thinkKenya/luo_swa_translation_model") |
|
|
model = T5ForConditionalGeneration.from_pretrained("thinkKenya/luo_swa_translation_model") |
|
|
|
|
|
input_text = "translate Luo to Swahili: Wuki ghwa choki" |
|
|
inputs = tokenizer(input_text, return_tensors="pt") |
|
|
outputs = model.generate(**inputs) |
|
|
print(tokenizer.decode(outputs[0], skip_special_tokens=True)) |
|
|
``` |
|
|
|
|
|
## Limitations |
|
|
|
|
|
* **Ongoing fine-tuning**: Outputs may still be unstable until training completes. |
|
|
* **Domain coverage**: Trained on conversational and narrative sentences—performance may drop on highly specialized or out-of-domain text. |
|
|
* **No public benchmarks yet**: Users are encouraged to evaluate with their own BLEU/ROUGE metrics. |
|
|
|
|
|
Thought for a couple of seconds |
|
|
|
|
|
|
|
|
Below is the **License** section added to the model card, specifying the CC BY 4.0 terms and the required attribution format. |
|
|
|
|
|
--- |
|
|
|
|
|
## License |
|
|
|
|
|
This model and the underlying dataset are released under the [Creative Commons Attribution 4.0 International License (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/). |
|
|
|
|
|
You are free to: |
|
|
|
|
|
* **Share** — copy and redistribute the material in any medium or format |
|
|
* **Adapt** — remix, transform, and build upon the material for any purpose, even commercially |
|
|
|
|
|
**Under the following terms:** |
|
|
|
|
|
* **Attribution** — You must give appropriate credit, provide a link to the license, and indicate if changes were made. |
|
|
* **No additional restrictions** — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits. |
|
|
|
|
|
**Suggested attribution for this model:** |
|
|
|
|
|
> “Luo–Swahili Translation Model, thinkKenya (Tech Innovators Network Kenya), CC BY 4.0, [https://huggingface.co/thinkKenya/luo\_swa\_translation\_model”](https://huggingface.co/thinkKenya/luo_swa_translation_model”) |
|
|
|
|
|
|
|
|
## Citation |
|
|
|
|
|
```bibtex |
|
|
@misc{luo_swa_translation_model, |
|
|
title = {Luo–Swahili Translation Model}, |
|
|
author = {thinkKenya (Tech Innovators Network Kenya)}, |
|
|
year = {2024}, |
|
|
publisher = {Hugging Face}, |
|
|
howpublished = {\url{https://huggingface.co/thinkKenya/luo_swa_translation_model}}, |
|
|
} |
|
|
``` |
|
|
|
|
|
|