General Agent Evaluation
Paper • 2602.22953 • Published • 12
This is a tracking repo for OpenAI Solo, used by the Open Agent Leaderboard to report evaluation results on HuggingFace.
OpenAI's Agent SDK for building single-agent workflows with tool use and structured outputs.