Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
7
15
Catherine Arnett
catherinearnett
Follow
mrajbrahma's profile picture
pszemraj's profile picture
derguene's profile picture
113 followers
·
38 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated
a dataset
about 19 hours ago
catherinearnett/bilingual_tokenizers2
published
a dataset
2 days ago
catherinearnett/bilingual_tokenizers
updated
a dataset
2 days ago
catherinearnett/monolingual_tokenizers
View all activity
Organizations
catherinearnett
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
28 days ago
mrlbenchmarks/global-piqa-parallel
Viewer
•
Updated
6 days ago
•
13.5k
•
1.52k
•
8
liked
a dataset
3 months ago
commoncrawl/CommonLID
Viewer
•
Updated
Feb 10
•
373k
•
239
•
52
liked
4 datasets
4 months ago
aaparajit02/punjabi-asr
Viewer
•
Updated
Jul 23, 2023
•
39.2k
•
101
•
3
aznlp/azerbaijani-blogs
Viewer
•
Updated
Apr 14, 2024
•
6.93k
•
23
•
3
MWirelabs/assamese-monolingual-corpus
Viewer
•
Updated
Nov 13, 2025
•
1.61M
•
32
•
1
Atnafu/Afri-MCQA
Viewer
•
Updated
Jan 15
•
15.3k
•
600
•
17
liked
a dataset
6 months ago
mrlbenchmarks/global-piqa-nonparallel
Viewer
•
Updated
6 days ago
•
13.5k
•
10.9k
•
35
liked
a dataset
8 months ago
nlip/DIWALI
Viewer
•
Updated
22 days ago
•
8.82k
•
175
•
6
liked
4 datasets
10 months ago
classla/ParlaSpeech-PL
Viewer
•
Updated
Jul 2, 2025
•
531k
•
149
•
6
classla/ParlaSpeech-HR
Viewer
•
Updated
Jul 2, 2025
•
868k
•
364
•
5
classla/ParlaSpeech-CZ
Viewer
•
Updated
Jul 2, 2025
•
711k
•
1.87k
•
5
classla/ParlaSpeech-RS
Viewer
•
Updated
Dec 1, 2025
•
278k
•
439
•
4
liked
a dataset
11 months ago
filbench/UD_Tagalog-NewsCrawl
Viewer
•
Updated
Jul 23, 2025
•
15.6k
•
52
•
1
liked
a dataset
about 1 year ago
jumelet/multiblimp
Viewer
•
Updated
May 16, 2025
•
121k
•
7.7k
•
17
liked
a dataset
almost 2 years ago
ambean/lingOly
Viewer
•
Updated
Jun 11, 2024
•
90
•
6.05k
•
9