Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
7
7
15
Catherine Arnett
catherinearnett
Follow
pszemraj's profile picture
suchirsalhan's profile picture
BramVanroy's profile picture
112 followers
·
38 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated
a Space
about 4 hours ago
catherinearnett/multiblimp-leaderboard
published
a Space
about 4 hours ago
catherinearnett/multiblimp-leaderboard
updated
a collection
about 4 hours ago
Multilingual Leaderboards
View all activity
Organizations
catherinearnett
's datasets
31
Sort: Recently updated
catherinearnett/apertus_multiblimp
Updated
about 5 hours ago
•
266
catherinearnett/bilingual_tokenizers
Updated
6 days ago
•
841
•
1
catherinearnett/low_resource_clean
Viewer
•
Updated
7 days ago
•
1.74M
•
34
catherinearnett/low_german
Viewer
•
Updated
7 days ago
•
97.5k
•
12
catherinearnett/komi_permyak
Viewer
•
Updated
7 days ago
•
1.25k
•
13
catherinearnett/komi_zyrian
Viewer
•
Updated
7 days ago
•
22.5k
•
10
catherinearnett/erzya
Viewer
•
Updated
7 days ago
•
30.8k
•
15
catherinearnett/veps
Viewer
•
Updated
7 days ago
•
9.07k
•
15
catherinearnett/moksha
Viewer
•
Updated
7 days ago
•
4.06k
•
14
catherinearnett/blimps_evals
Updated
9 days ago
•
4
catherinearnett/urubu_kaapor
Viewer
•
Updated
10 days ago
•
7.75k
•
10
catherinearnett/warlpiri
Viewer
•
Updated
10 days ago
•
9.45k
•
11
catherinearnett/zacatlan_nahuatl
Viewer
•
Updated
10 days ago
•
7.96k
•
11
catherinearnett/skolt_sami
Viewer
•
Updated
10 days ago
•
2.48k
•
10
catherinearnett/livvi
Viewer
•
Updated
10 days ago
•
8.31k
•
14
catherinearnett/karelian
Viewer
•
Updated
10 days ago
•
6.78k
•
17
catherinearnett/kiche
Viewer
•
Updated
10 days ago
•
4.77k
•
13
catherinearnett/hp_nahuatl
Viewer
•
Updated
11 days ago
•
150k
•
14
catherinearnett/kangri
Viewer
•
Updated
11 days ago
•
198k
•
34
catherinearnett/gothic
Viewer
•
Updated
11 days ago
•
4.48k
•
19
catherinearnett/hittite
Viewer
•
Updated
13 days ago
•
145
•
14
catherinearnett/abkhazian
Viewer
•
Updated
13 days ago
•
129
•
14
catherinearnett/classical_armenian
Viewer
•
Updated
13 days ago
•
1.35k
•
15
catherinearnett/ancient_egyptian
Viewer
•
Updated
13 days ago
•
118k
•
20
•
1
catherinearnett/old_church_slavonic
Viewer
•
Updated
14 days ago
•
260k
•
53
catherinearnett/gheg_albanian
Viewer
•
Updated
14 days ago
•
3.12k
•
17
catherinearnett/classical_armenian_pd
Viewer
•
Updated
28 days ago
•
102
•
84
catherinearnett/bilingual-tokenizer-training-data
Viewer
•
Updated
Feb 21
•
30.7M
•
1.63k
catherinearnett/montok
Updated
Sep 19, 2025
•
4.55k
•
3
catherinearnett/morphscore
Viewer
•
Updated
Jul 10, 2025
•
5.09M
•
134
•
4
Previous
1
2
Next