Catherine Arnett

catherinearnett

AI & ML interests

multilingual NLP, tokenization

Recent Activity

updated a dataset about 12 hours ago
catherinearnett/monolingual_tokenizers
updated a dataset about 13 hours ago
catherinearnett/bilingual_tokenizers2
published a dataset about 13 hours ago
catherinearnett/monolingual_tokenizers
View all activity

Organizations

Blog-explorers's profile picture Language and Cognition Lab (UCSD)'s profile picture Common Crawl Foundation's profile picture Beetles's profile picture