google/WaxalNLP
Viewer • Updated • 2.56M • 34.7k • 224
How to use badrex/Ethio-ASR-sidaama with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="badrex/Ethio-ASR-sidaama") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("badrex/Ethio-ASR-sidaama")
model = AutoModelForCTC.from_pretrained("badrex/Ethio-ASR-sidaama")
Ethio-ASR is a suite of Automatic Speech Recognition (ASR) models for Ethiopian languages. This repo contains a monolingual Sidaama ASR model based on wav2vec2‑bert-2.0, fine-tuned on the Sidaama subset of the WAXAL Speech Dataset.
📌 ASR model in this HF repo
| Model | # Params | Sidaama WER (↓) |
|---|---|---|
| Ethio-ASR (afrihubert) | 94M | 34.02 |
| Ethio-ASR (mms-300) | 300M | 32.66 |
| Ethio-ASR (mms-1b) | 1B | 31.02 |
| Ethio-ASR (w2v-bert-2.0) | 600M | 31.65 |
| Monolingual SFT (w2v-bert-2.0) 📌 | 600M | 30.04 |
from transformers import AutoModelForCTC, AutoProcessor
import torchaudio, torch
processor = AutoProcessor.from_pretrained("badrex/Ethio-ASR-sidaama")
model = AutoModelForCTC.from_pretrained("badrex/Ethio-ASR-sidaama")
audio, sr = torchaudio.load("audio.wav")
inputs = processor(audio.squeeze(), sampling_rate=sr, return_tensors="pt")
with torch.no_grad():
logits = model(**inputs).logits
pred_ids = torch.argmax(logits, dim=-1)
transcription = processor.batch_decode(pred_ids)[0]
print(transcription)
Performance might vary across dialects, genders, ages, and recording quality.
@misc{ethio_asr_2026,
author = {
Abdullah, Badr M. and
Azime, Israel Abebe and
Tonja, Atnafu Lambebo and
Alabi, Jesujoba O. and
Alemu, Abel Mulat and
Hagos, Eyob G. and
Balcha, Bontu Fufa and
Nerea, Mulubrhan A. and
Yadeta, Debela Desalegn and
Marilign, Dagnachew Mekonnen and
Fentahun, Amanuel Temesgen and
Kebede, Tadesse and
Gebru, Israel D. and
Woldeyohannis, Michael Melese and
Sewunetie, Walelign Tewabe and
Möbius, Bernd and
Klakow, Dietrich
},
title = {Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages},
year = {2026},
howpublished = {\url{https://huggingface.co/badrex/Ethio-ASR-multilingual-600M}}
}
Base model
facebook/w2v-bert-2.0