metadata
tags:
- agent
- agent-evaluation
- agent-card
Claude Code
This is a tracking repo for Claude Code, used by the Open Agent Leaderboard to report evaluation results on HuggingFace.
Anthropic's agentic coding tool. Uses extended thinking, file editing, and shell execution to solve tasks autonomously.
- Framework: claude-code
- Leaderboard: Open Agent Leaderboard
- Paper: arXiv:2602.22953