Spaces:
Running
Running
Commit History
Upload from GitHub Actions: model name no bracket stuff aa92add verified
Upload from GitHub Actions: drop normalization 972026c verified
Upload from GitHub Actions: improve norwegian fix 6f0e312 verified
Upload from GitHub Actions: add filters 3018273 verified
Upload from GitHub Actions: Merge pull request #22 from datenlabor-bmz/dev 2cdada4 verified
Upload from GitHub Actions: Add auto-translated datasets 68a93b5 verified
Upload from GitHub Actions: Merge pull request #18 from datenlabor-bmz/pr-17 a0d1624 verified
Upload from GitHub Actions: Add auto-translated datasets c790fdb verified
Upload from GitHub Actions: updated frontend and backend to fix bugs 4e8cb1a verified
Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev 7c06aef verified
Upload from GitHub Actions: fixed Type Error 71ab1e9 verified
Upload from GitHub Actions: Merge pull request #8 from datenlabor-bmz/jn-dev 3665390 verified
Upload from GitHub Actions: Merge pull request #5 from datenlabor-bmz/jn-dev abd65a6 verified
Upload from GitHub Actions: Fix crashes when searching low-resource languages fe700d4 verified
Upload from GitHub Actions: Get more results, compute average based on all tasks 98c6811 verified
Upload from GitHub Actions: Correlation plot b0aa389 verified
Upload from GitHub Actions: Fix linter problems in frontend e8341d2 verified
Upload from GitHub Actions: More models and languages a73f888 verified
Upload from GitHub Actions: Improve UX and style 70582ce verified
Upload from GitHub Actions: Improve UX and style 53d2039 verified
Upload from GitHub Actions: Merge remote changes with local frontend updates 760c6c6 verified
Upload from GitHub Actions: Merge remote changes and apply terminology updates: Commercial->closed-source, Open->open-source ebaf279 verified
Upload from GitHub Actions: Use task subset for average score b1e5b40 verified
Upload from GitHub Actions: Eavaluate on 40 languages 941d5c5 verified
Upload from GitHub Actions: Add math benchmarks 549360a verified
Upload from GitHub Actions: Quick fixes 9c2c019 verified
Upload from GitHub Actions: Display N/A scores as such 1e8952a verified
Add symbols for progress plot 68e918f
David Pomerenke commited on
Display more language names de40d0a
David Pomerenke commited on
Run on 40 languages, additional models 260c1a3
David Pomerenke commited on
Add scores to world map hover title 3680a5f
David Pomerenke commited on
Fix response when no evals data is available c856043
David Pomerenke commited on
Remove unnecessary function a5cf2d9
David Pomerenke commited on
Fix: sort copy, not in place 2eeba23
David Pomerenke commited on
Improve plots and dataset table a9e6b9b
David Pomerenke commited on
Add model history plot f52ec6e
David Pomerenke commited on
Add nice cumulative language population plot b54f543
David Pomerenke commited on
Implement MMLU task a683732
David Pomerenke commited on
Add dataset metadata about human/machine translation d8f2dee
David Pomerenke commited on
Refactor score columns 4106f13
David Pomerenke commited on
Translation both from and to 731eddd
David Pomerenke commited on
Add OpenRouter metadata to models 9002fc2
David Pomerenke commited on
Run on 100 languages, adjust display 8274634
David Pomerenke commited on
Dataset table grouping 9051509
David Pomerenke commited on
Adjust font sizes 51cb38c
David Pomerenke commited on
Add Dockerfile 4d13673
David Pomerenke commited on
Fix world map and apply filters for it 92d8154
David Pomerenke commited on
AutoComplete improvements and examples a3e21c6
David Pomerenke commited on
Speed things up 566c57e
David Pomerenke commited on