Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running 3 Open Agent Leaderboard 🤖 3 Explore AI agents' performance leaderboard and efficiency chart Running The Open Agent Leaderboard 📊 Compare AI agents' performance and cost across benchmarks open-agent-leaderboard/results Viewer • Updated May 18 • 150 • 169 • 6 open-agent-leaderboard/agent-cards Updated Mar 30 • 12
Open Agent Leaderboard An open benchmark for comparing full agent systems across diverse real-world tasks. Reports both quality and cost. Running 3 Open Agent Leaderboard 🤖 3 Explore AI agents' performance leaderboard and efficiency chart Running The Open Agent Leaderboard 📊 Compare AI agents' performance and cost across benchmarks open-agent-leaderboard/results Viewer • Updated May 18 • 150 • 169 • 6 open-agent-leaderboard/agent-cards Updated Mar 30 • 12