Tokenizers for linguistic families
WikiLangs
community
AI & ML interests
Wikilangs is an open-source initiative to democratize access to natural language processing models for every language represented on Wikipedia - A project by @OmarKamali. Graciously sponsored by Featherless.ai.
Recent Activity
models
323
wikilangs/tokenizers_uralic-finnic
Updated
wikilangs/tokenizers_austronesian-malay
Updated
wikilangs/tokenizers_bantu-all
Updated
wikilangs/tokenizers_atlantic-gur
Updated
wikilangs/vi
Text Generation
•
Updated
wikilangs/tr
Text Generation
•
Updated
wikilangs/th
Text Generation
•
Updated
wikilangs/sl
Text Generation
•
Updated
wikilangs/sh
Text Generation
•
Updated
wikilangs/ro
Text Generation
•
Updated
datasets
0
None public yet