Ethio-ASR Logo

arXiv ๐Ÿ“– [ preprint ]

โš’๏ธ Model Description

Ethio-ASR is a suite of multilingual Automatic Speech Recognition (ASR) models that support five Ethiopian languages: Amharic, Tigrinya, Afaan Oromo, Sidama, and Wolaytta. The ASR model in this repo is based on the wav2vec2โ€‘bert-2.0 pre-trained model by fine-tuning it on the WAXAL Speech Dataset.

  • Developed by: Ethio-ASR Team
  • Task: Speech Recognition (ASR) and Language Identification (LID)
  • Languages: Amharic, Tigrinya, Afaan Oromo, Sidama, and Wolaytta
  • License: CC-BY-4.0
  • Finetuned from: facebook/w2v-bert-2.0

๐Ÿ“ˆ Evaluation on WAXAL Test Set

๐Ÿ“Œ ASR model in this HF repo

Model # Params Amharic Tigrinya Oromo Wolaytta Sidaama Avg.
Ethio-ASR (afrihubert) 92M 30.95 42.42 27.57 40.44 34.02 35.08
Ethio-ASR (mms-300) 300M 30.19 41.62 26.41 39.10 32.66 33.99
Ethio-ASR (mms-1b) 1B 26.14 37.63 23.69 37.51 31.02 31.20
Ethio-ASR (w2v-bert-2.0) ๐Ÿ“Œ 600M 22.92 35.22 24.44 38.19 31.65 30.48

๐Ÿš€ Examples

Language Audio Human Transcription ASR Transcription
1 Oromo Suuraan asii gaditti argaa jirru kun lafa gurgurtaa kuduraa fi muduraa dha. Kuduraa fi muduraan nyaataaf kan baay'ee namatti toluudha. Nyaachuudhaaf illee kuduraa fi muduraan baay'ee filataamaa dha. Kanaaf, kuduraa fi muduraa kana baay'een jaalladha. Baay'ee nyaachuuf illee baay'een fedha guddaa qaba. [ORM] suurran asii gaditti argaa jirru kun lafa gurgurtaa kuduraafi muduraadha. kuduraaf muduraa nyaataaf kan baay'ee namatti toludha. nyaachuudhaaf illee kuduraaf muduraan baay'ee filatamaadha. kanaaf kuduraaf muduraa kan baay'een jaalladha baay'ee nyaachuuf illee baay'een fedha guddaa qaba.
2 Amharic แˆฐแ‹Žแ‰ฝ แ‰คแ‰ณแ‰ธแ‹แŠ• แˆˆแˆ›แˆตแŒŒแŒฅ แ‹จแ‰ฐแˆˆแ‹ซแ‹ฉ แŒจแˆญแ‰†แ‰ฝแŠ• แ‹ญแŒ แ‰€แˆ›แˆ‰แข แŒจแˆญแ‰†แ‰ฝ แ‰คแ‰ถแ‰ฝแŠ• แˆˆแˆ›แˆตแŒŒแŒฅ แ‹จแˆšแŒ แ‰…แˆ™แ‰ต แ‹จแ‰ฐแˆˆแ‹ซแ‹ฉ แ‹จแ‰…แˆญแ… แ‰ แˆ˜แˆตแŒ แ‰ตแฃ แŠฅแŠ•แ‹ฐ แ‹šแŒแ‹›แŒ แ‰…แˆญแŒฝ แ‰ แˆ›แˆตแ‹ฐแˆญแŒแฃ แŠฅแŠ•แ‹ฐ แ‰ขแˆซแ‰ขแˆฎ แАแŒˆแˆฎแ‰ฝแŠ• แ‰ แˆ˜แˆตแˆซแ‰ตแฃ แŒแ‹ตแŒแ‹ณ แˆ‹แ‹ญ แ‰ แˆ˜แˆˆแŒ แแฃ แ‰ แˆ›แŠ•แŒ แˆแŒ แˆ แˆˆแ‰คแ‰ฑ แ‹แ‰ แ‰ต แ‹ญแˆฐแŒฃแˆ‰แข [AMH] แˆฐแˆฐแ‹Žแ‰ฝ แ‰คแ‰ณแ‰ธแ‹แŠ• แˆˆแˆ›แˆตแŒŒแŒฅ แ‹จแ‰ฐแˆˆแ‹ซแ‹ฉ แŒจแˆญแ‰†แ‰ฝแŠ• แ‹ญแŒ แ‰€แˆ›แˆ‰แข แŒจแˆญแ‰†แ‰ฝ แ‰คแ‰ถแ‰ฝแŠ• แˆˆแˆ›แˆตแŒŒแŒฅ แ‹จแˆšแŒ แ‰…แˆ™แ‰ต แ‹จแ‰ฐแˆˆแ‹ซแ‹จ แ‰…แˆญแŒฝ แ‰ แˆ˜แˆตแŒ แ‰ต แŠฅแŠ•แ‹ฐ แ‹šแŒแ‹›แŒ แ‰…แˆญแŒฝ แ‰ แˆ›แˆตแ‹ฐแˆจแŒ แŠฅแŠ•แ‹ฐ แ‰ขแˆซ แ‰ขแˆฎ แАแŒˆแˆฎแ‰ฝแŠ• แ‰ แˆ˜แˆตแˆซแ‰ต แŒแ‹ตแŒแ‹ณ แˆ‹แ‹ญ แ‰ แˆ˜แˆˆแŒ แ แ‰ แˆ›แŠ•แŒ แˆแŒ แˆ แˆˆแ‰คแ‰ฑ แ‹แ‰ แ‰ต แ‹ญแˆฐแŒฃแˆ‰แข
3 Wolaytta Issi heeraani asaa naatussi dumma dumma ayfiyan be'iyobati de'oososna. Hegeetuppekka meretaanne merettaa heeraara gayttiyabata be'iyaba gidikko issi heeraani mitatta woykko dumma dumma adil'e ciishshata be'iyode keehippe lo'oosonanne ubbakka ufaysoosona. [WAL] issi heeran asaa naatussi dumma dumma ayfiyan be'iyobati de'oosona. hegeetuppekka meretaanne meretaa heeraara gayttiyaabata be'iyaaba gidikko issi heeran mittata woykko dumma dumma adile ciishshabata be'iyode keehippe lo'oosonanne ubbakka ufayssoosona.
4 Tigrinya แŠฅแ‹™แ‹ญ แŠฅแŠ•แˆญแŠฅแ‹ฎ แ‹˜แˆˆแŠ“ แˆแˆตแˆŠ แŠ“แ‹ญ แ‹ฐแˆ˜แˆซ แˆแˆตแˆŠ แŠฅแŠ•แ‰ตแŠจแ‹แŠ• แŠฃแ‰ฅ แ‰ฃแ‹“แˆ แˆ˜แˆตแ‰€แˆ แ‹แŠญแ‰ แˆญ แ‰ฅแ‹“แˆ แŠฅแ‹ฉแข แˆƒแ‹ญแˆ›แŠ–แ‰ณแ‹Š แ‰ฅแ‹“แˆแŠ• แ‰ฃแˆ…แˆ‹แ‹Š แˆ‚แ‹ฐแ‰ต แ‹˜แˆˆแ‹Ž แŒˆแ‹ญแˆฉ แ‹แŠซแ‹จแ‹ต แŠฅแ‹ฉแข แŠซแ‰ฅ แ‹แ‰ฐแˆแˆ‹แˆˆแ‹จ แ‹“แ‹ญแАแ‰ต แˆ“แˆฐแˆซแ‰ต แ‹•แŠ•แ€แ‹ญแ‰ตแ‰ณแ‰ต แ‰ฅแˆแŠฅแŠซแ‰ฅ แ‹แ‰ƒแ€แˆ แ‹“แ‰ฅแ‹ช แ‰ฃแˆ…แˆ‹แ‹ŠแŠ• แˆƒแ‹ญแˆ›แŠ–แ‰ณแ‹แŠ• แ‹แŠฎแА แ‰ฅแ‹“แˆ แŠฅแ‹ฉแข [TIR] แŠฅแ‹š แŠฅแŠ•แˆชแŠฆ แ‹˜แˆˆแŠ“ แˆแˆตแˆŠ แŠ“แ‹ญ แ‹ฐแˆ˜แˆซ แˆแˆตแˆŠ แŠฅแŠ•แ‰ตแŠธแ‹แŠ• แŠฃแ‰ฅ แ‰ แ‹“แˆ แˆ˜แˆตแ‰€แˆ แ‹แŠฝแ‰ แˆญ แ‰ แ‹“แˆ แŠฅแ‹ฉแข แˆƒแ‹ญแˆ›แŠ–แ‰ณแ‹Š แ‰ แ‹“แˆแŠ• แ‰ฃแˆ…แˆ‹แ‹แŠ• แˆ‚แ‹ฐแ‰ต แ‹˜แˆˆแ‹Ž แ‹แŠซแ‹จแ‹ต แŠฅแ‹ฉแข แŠซแ‰ฅ แ‹แ‰ฐแˆแˆ‹แˆˆแ‹ฉ แ‹“แ‹ญแАแ‰ต แˆ“แˆฐแˆซแ‰ต แ‹•แŠ•แŒจแ‹ญแ‰ตแ‰ณแ‰ต แ‰ฅแˆแŠฅแŠซแ‰ฅ แ‹แ‰ƒแ€แˆ แ‹“แ‰ฅแ‹ญ แ‰ฃแˆ…แˆ‹แ‹ŠแŠ• แˆƒแ‹ญแˆ›แŠ–แ‰ณแ‹แŠ• แ‹แŠฎแА แ‰ แ‹“แˆ แŠฅแ‹ฉแข
5 Oromo Fakkii kanarraa kan arginu hangafa Oromoo kan taโ€™e godina Booranaa keessatti uffata naannoo godina Booranaatiin faayamanii abbootiin bokkuu isaanii qabatanii, haadholiin immoo siinqee isaanii qabatanii kan dhaabbachaa jiranidha. [ORM] fakkii kanarraa kan arginu hangafa oromoo kan ta'e godina booranaa keessatti uffata naannoo godina booranaatiin faayamanii abbootiin bokkuu isaanii qabatanii haadholiin immoo siiqee isaanii qabatanii kan dhaabbachaa jiranidha.
6 Sidaama Daganna maate yaa mayyaate? Daganna maate yee su'ma fushshihu ayeeti? Daganna maate hiiko heedhanno? Daganna maate yinihu mayiraati? Daganna maate ayira horo uyitanno? [SID] daganna maate yaa mayyaate? daganna maate yee su'ma fushshihu ayeeti? daganna maate hiikko heedhanno? daganna maate yinihu mayraati? daganna maate ayira horo uyitanno

๐ŸŽง Direct Use

from transformers import AutoModelForCTC, AutoProcessor
import torchaudio, torch

processor = AutoProcessor.from_pretrained("badrex/Ethio-ASR-multilingual-600M")
model = AutoModelForCTC.from_pretrained("badrex/Ethio-ASR-multilingual-600M")

audio, sr = torchaudio.load("audio.wav")
inputs = processor(audio.squeeze(), sampling_rate=sr, return_tensors="pt")

with torch.no_grad():
    logits = model(**inputs).logits

pred_ids = torch.argmax(logits, dim=-1)
transcription = processor.batch_decode(pred_ids)[0]

print(transcription)

๐Ÿ”ง Downstream Use

  • Voice assistants
  • Accessibility tools
  • Research baselines

๐Ÿšซ Outโ€‘ofโ€‘Scope Use

  • Languages outside Amharic, Tigrinya, Afaan Oromo, Sidama, and Wolaytta.
  • Highโ€‘stakes deployments without human review
  • Noisy audio without speech enhancement

โš ๏ธ Risks & Limitations

Performance might vary across dialects, genders, ages, and recording quality.

๐Ÿ“Œ Citation

@misc{ethio_asr_2026,
  author = {
    Abdullah, Badr M. and
    Azime, Israel Abebe and
    Tonja, Atnafu Lambebo and
    Alabi, Jesujoba O. and
    Alemu, Abel Mulat and
    Hagos, Eyob G. and
    Balcha, Bontu Fufa and
    Nerea, Mulubrhan A. and
    Yadeta, Debela Desalegn and
    Marilign, Dagnachew Mekonnen and
    Fentahun, Amanuel Temesgen and
    Kebede, Tadesse and
    Gebru, Israel D. and
    Woldeyohannis, Michael Melese and
    Sewunetie, Walelign Tewabe and
    Mรถbius, Bernd and
    Klakow, Dietrich
  },
  title = {Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages},
  year = {2026},
  howpublished = {\url{https://huggingface.co/badrex/Ethio-ASR-multilingual-600M}}
}
Downloads last month
684
Safetensors
Model size
0.6B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for badrex/Ethio-ASR-multilingual-600M

Finetuned
(469)
this model

Dataset used to train badrex/Ethio-ASR-multilingual-600M

Spaces using badrex/Ethio-ASR-multilingual-600M 3

Collection including badrex/Ethio-ASR-multilingual-600M

Paper for badrex/Ethio-ASR-multilingual-600M