Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ar08
/
zzz
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
zzz
/
evaluation
/
benchmarks
6.23 MB
1 contributor
History:
1 commit
ar08
Upload 1040 files
246d201
verified
about 1 year ago
EDA
Upload 1040 files
about 1 year ago
agent_bench
Upload 1040 files
about 1 year ago
aider_bench
Upload 1040 files
about 1 year ago
biocoder
Upload 1040 files
about 1 year ago
bird
Upload 1040 files
about 1 year ago
browsing_delegation
Upload 1040 files
about 1 year ago
commit0_bench
Upload 1040 files
about 1 year ago
discoverybench
Upload 1040 files
about 1 year ago
gaia
Upload 1040 files
about 1 year ago
gorilla
Upload 1040 files
about 1 year ago
gpqa
Upload 1040 files
about 1 year ago
humanevalfix
Upload 1040 files
about 1 year ago
logic_reasoning
Upload 1040 files
about 1 year ago
miniwob
Upload 1040 files
about 1 year ago
mint
Upload 1040 files
about 1 year ago
ml_bench
Upload 1040 files
about 1 year ago
scienceagentbench
Upload 1040 files
about 1 year ago
swe_bench
Upload 1040 files
about 1 year ago
the_agent_company
Upload 1040 files
about 1 year ago
toolqa
Upload 1040 files
about 1 year ago
webarena
Upload 1040 files
about 1 year ago