Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
madoss
's Collections
OCR - Models
Machine Translation
Low Res NLP
MT Quality Estimation
Language ID
Synthetic Data Gen
Tokenization
African Languages Datasets
Audio
MT Models
SLM
LLMs Distillation
IE and Entity Linking
NL2SQL Models
Text to sql papers
African Languages Datasets
updated
Apr 5
Upvote
-
google/WaxalNLP
Viewer
•
Updated
11 days ago
•
2.56M
•
35.3k
•
224
openlanguagedata/flores_plus
Viewer
•
Updated
Mar 10
•
893k
•
11.5k
•
134
McGill-NLP/african_celtic_dataset
Viewer
•
Updated
23 days ago
•
57.5k
•
212
•
1
HPLT/HPLT3.0
Updated
Nov 14, 2025
•
141
•
19
google/smol
Viewer
•
Updated
18 days ago
•
842k
•
3.81k
•
108
CohereLabs/Global-MMLU
Viewer
•
Updated
Aug 14, 2025
•
602k
•
42.3k
•
156
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
814k
•
572
cis-lmu/Glot500
Viewer
•
Updated
Dec 10, 2025
•
1.23B
•
27.7k
•
43
facebook/omnilingual-asr-corpus
Viewer
•
Updated
Nov 14, 2025
•
548k
•
5.33k
•
201
UBC-NLP/SimbaBench_dataset
Viewer
•
Updated
Feb 27
•
748k
•
2.09k
lelapa/Inkuba-Mono
Viewer
•
Updated
Sep 5, 2024
•
68.8M
•
13
•
14
UBC-NLP/afroscope-data
Viewer
•
Updated
Feb 9
•
18.9M
•
3.38k
27Group/InstructLR_Generate_Datasets
Viewer
•
Updated
Sep 10, 2025
•
417k
•
27
facebook/bouquet
Updated
3 days ago
•
1.39k
•
36
allenai/nllb
Updated
Sep 29, 2022
•
936
•
108
masakhane/mafand
Viewer
•
Updated
Sep 11, 2023
•
143k
•
4.23k
•
15
Upvote
-
Share collection
View history
Collection guide
Browse collections