CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text Paper • 2503.18247 • Published Mar 24, 2025
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 3
Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches Paper • 2508.21512 • Published Aug 29, 2025
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages Paper • 2603.23654 • Published Mar 24
AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages Paper • 2604.00706 • Published Apr 1
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation Paper • 2403.13737 • Published Mar 20, 2024
The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages Paper • 2502.15916 • Published Feb 21, 2025 • 1
MasakhaNEWS: News Topic Classification for African languages Paper • 2304.09972 • Published Apr 19, 2023
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge'ez Script Paper • 2507.15142 • Published Jul 20, 2025
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 3
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 3
MasakhaNER: Named Entity Recognition for African Languages Paper • 2103.11811 • Published Mar 22, 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets Paper • 2103.12028 • Published Mar 22, 2021 • 3
Enhancing Amharic-LLaMA: Integrating Task Specific and Generative Datasets Paper • 2402.08015 • Published Feb 12, 2024 • 1
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper • 2406.05967 • Published Jun 10, 2024 • 6
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation Paper • 2403.13737 • Published Mar 20, 2024
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages Paper • 2412.00948 • Published Dec 1, 2024