Running Agents 95 Nexus Function Calling Leaderboard π 95 Display benchmark results for models on various tasks
Running on CPU Upgrade Agents 604 GAIA Leaderboard π¦Ύ 604 Submit and score your model on the GAIA benchmark