Aletheia-ng/amharic-pretraining-corpus
Viewer
• Updated • 600k • 36
Viewer
• Updated • 690M • 210
Viewer
• Updated • 11M • 2.13k
Viewer
• Updated • 12.2M • 5
Aletheia-ng/processed_data
Viewer
• Updated • 2.81M • 6
Viewer
• Updated • 94.8M • 16
Viewer
• Updated • 158M • 1.95k
Viewer
• Updated • 200M • 25
Aletheia-ng/pidgin-corpus-synth
Viewer
• Updated • 57.1k • 17
Aletheia-ng/yoruba-corpus-synth
Viewer
• Updated • 20.2k • 5
Aletheia-ng/nigerian-pidgin-corpus-synth
Updated • 11
Aletheia-ng/pretrain_data10
Viewer
• Updated • 40.9M • 16
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
• Updated • 469M • 298
Aletheia-ng/low_resource_languages_pretrain_data5
Viewer
• Updated • 212M • 114
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
• Updated • 79.1M • 155
Aletheia-ng/pretrain_data5
Viewer
• Updated • 9.43M • 15
Aletheia-ng/pretrain_data4
Viewer
• Updated • 124M • 40
Aletheia-ng/pretrain_data7
Viewer
• Updated • 13M • 26
Aletheia-ng/pretrain_data3
Viewer
• Updated • 143M • 170
Aletheia-ng/low_resource_languages_pretrain_data2
Viewer
• Updated • 587M • 353
Aletheia-ng/low_resource_languages_pretrain_data
Viewer
• Updated • 734M • 317
Aletheia-ng/pretrain_data6
Viewer
• Updated • 205M • 209
Viewer
• Updated • 136 • 8
Aletheia-ng/pretrain_data
Viewer
• Updated • 109M • 16
Aletheia-ng/pretrain_data2
Viewer
• Updated • 18.2M • 30
Aletheia-ng/low_resource_languages_pretrain
Viewer
• Updated • 202M • 688
• 1
Aletheia-ng/masakhaner_eval