Running on CPU Upgrade Agents 27 Gaia2 Agents Evaluation Leaderboard 🐠 27 Explore AI model performance on the Gaia2 benchmark