A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries.
Catherine Arnett
catherinearnett
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated a dataset about 6 hours ago
catherinearnett/monolingual_tokenizers updated a dataset about 7 hours ago
catherinearnett/bilingual_tokenizers2 published a dataset about 7 hours ago
catherinearnett/monolingual_tokenizersOrganizations
Multilingual Leaderboards
Leaderboards for languages other than English
- Running on CPU UpgradeAgents76
La Leaderboard
🌸76Evaluate open LLMs in the languages of LATAM and Spain.
- Running on CPU UpgradeAgents125
Open Chinese LLM Leaderboard
🏆125Explore LLM benchmark scores and submit your model
- Running on CPU UpgradeAgents180
Open Arabic LLM Leaderboard
🏆180Track, rank and evaluate open Arabic LLMs and chatbots
- Build errorAgents40
OpenLLM French leaderboard 🇫🇷
🥇40Explore and submit LLM benchmarks
Low Resource Language Datasets
B-GPT
Bilingual GPT-2 models with checkpoints
-
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation • 0.1B • Updated • 1.68k -
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation • 0.1B • Updated • 1.45k -
catherinearnett/B-GPT_en_nl_sequential
Text Generation • 0.1B • Updated • 1.13k -
catherinearnett/B-GPT_nl_en_sequential
Text Generation • 0.1B • Updated • 1.07k
Monolingual Models with Checkpoints
Global PIQA
A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries.
B-GPT
Bilingual GPT-2 models with checkpoints
-
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation • 0.1B • Updated • 1.68k -
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation • 0.1B • Updated • 1.45k -
catherinearnett/B-GPT_en_nl_sequential
Text Generation • 0.1B • Updated • 1.13k -
catherinearnett/B-GPT_nl_en_sequential
Text Generation • 0.1B • Updated • 1.07k
Multilingual Leaderboards
Leaderboards for languages other than English
- Running on CPU UpgradeAgents76
La Leaderboard
🌸76Evaluate open LLMs in the languages of LATAM and Spain.
- Running on CPU UpgradeAgents125
Open Chinese LLM Leaderboard
🏆125Explore LLM benchmark scores and submit your model
- Running on CPU UpgradeAgents180
Open Arabic LLM Leaderboard
🏆180Track, rank and evaluate open Arabic LLMs and chatbots
- Build errorAgents40
OpenLLM French leaderboard 🇫🇷
🥇40Explore and submit LLM benchmarks
Monolingual Models with Checkpoints
Low Resource Language Datasets