AI & ML interests

Tools for processing Russian and other languages for the development of corpus linguistics

Recent Activity

morozow  updated a collection 5 days ago
Ruscorpora Morpheme Segmentation Datasets
morozow  updated a collection 5 days ago
Ruscorpora Morpheme Segmentation Datasets
morozow  published a dataset 8 days ago
ruscorpora/morphodict-bel-wordforms
View all activity

Organization Card

Welcome to Russian National Corpus!

The Russian National Corpus is a representative collection of texts in Russian, counting more than 2 bln tokens and completed with linguistic annotation and search tools. We also prepare datasets and develop tools for linguistic markup of languages, primarily Russian. Feel free to visit Russian National Corpus website!

models 0

None public yet