Aletheia-ng/pidgin-corpus-synth
Viewer
•
Updated
•
57.1k
•
69
Aletheia-ng/yoruba-corpus-synth
Viewer
•
Updated
•
20.2k
•
23
Aletheia-ng/nigerian-pidgin-corpus-synth
Updated
•
2
Aletheia-ng/pretrain_data10
Viewer
•
Updated
•
40.9M
•
8
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
•
Updated
•
469M
•
4
Aletheia-ng/pretrain_data11
Updated
Aletheia-ng/pretrain_data9
Viewer
•
Updated
•
79.1M
•
2
Aletheia-ng/pretrain_data5
Viewer
•
Updated
•
9.43M
•
2
Aletheia-ng/pretrain_data4
Viewer
•
Updated
•
124M
•
23
Aletheia-ng/pretrain_data7
Viewer
•
Updated
•
13M
Aletheia-ng/pretrain_data3
Viewer
•
Updated
•
143M
•
80
Viewer
•
Updated
•
136
•
1
Aletheia-ng/pretrain_data
Viewer
•
Updated
•
109M
•
33
Aletheia-ng/pretrain_data2
Viewer
•
Updated
•
18.2M
•
21
Aletheia-ng/low_resource_languages_pretrain
Viewer
•
Updated
•
202M
•
1.29k
•
1
Aletheia-ng/masakhaner_eval
Aletheia-ng/noisy_dataset
Viewer
•
Updated
•
84k
•
1
Viewer
•
Updated
•
84k
•
3
Aletheia-ng/personal_finance_v0.2
Viewer
•
Updated
•
56.6k
•
6
•
1
Aletheia-ng/bloomberg-news-articles-pretraining-dataset
Viewer
•
Updated
•
437k
•
3
•
5
Aletheia-ng/ChatML-aya_dataset
Viewer
•
Updated
•
202k
•
3
Aletheia-ng/yo_wiki_processed
Viewer
•
Updated
•
43.5k
•
1
Viewer
•
Updated
•
270k
•
2
Viewer
•
Updated
•
4.4k
•
1
Viewer
•
Updated
•
43.5k
•
2
Viewer
•
Updated
•
288
•
1
Viewer
•
Updated
•
1.01k
•
4
Viewer
•
Updated
•
3.67k
•
2