ngusadeep 's Collections

Swahili Datasets

~1.69M raw Swahili text samples from news, government, education, and legal domains, ideal for LLM pretraining and unsupervised NLP research.