Running Agents 95 Nexus Function Calling Leaderboard π 95 Display benchmark results for models on various tasks
Running on CPU Upgrade Agents 611 GAIA Leaderboard π¦Ύ 611 Submit and view GAIA model evaluation leaderboard