Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
languagebench
like
19
Running
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
f8a3dad
languagebench
/
evals
/
tasks.py
Commit History
Implement MMLU task
a683732
David Pomerenke
commited on
Apr 18, 2025
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
Apr 18, 2025
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
Apr 17, 2025
Translation both from and to
731eddd
David Pomerenke
commited on
Apr 13, 2025
Run on 100 languages, adjust display
8274634
David Pomerenke
commited on
Apr 6, 2025
spBLEU tokenizer, run on more languages
eaf2d97
David Pomerenke
commited on
Mar 25, 2025
Refactor eval code into files
da6e1bc
David Pomerenke
commited on
Mar 15, 2025