GAIA release - a gaia-benchmark Collection

gaia-benchmark 's Collections

GAIA release

updated Nov 23, 2023

Gather the items of the GAIA release

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 249

Note The arxiv paper (arxiv.org/abs/2311.12983) describing the benchmark and dataset creation methodology.
Running on CPU Upgrade

Agents

613

GAIA Leaderboard

🦾

613

Submit and view GAIA model evaluation leaderboard

Note The leaderboard itself with the scored models and information on how to submit a new model.
gaia-benchmark/GAIA

Viewer • Updated Oct 28, 2025 • 932 • 14.4k • 721

Note The dataset with questions for the GAIA benchmark.
gaia-benchmark/results_public

Viewer • Updated about 13 hours ago • 3.59k • 3k • 25

Note Open dataset of submission results.