Paper • 2602.22953 • Published • 12
Claude Code
This is a tracking repo for Claude Code, used by the Open Agent Leaderboard to report evaluation results on HuggingFace.
Anthropic's agentic coding tool. Uses extended thinking, file editing, and shell execution to solve tasks autonomously.
- Framework: claude-code
- Leaderboard: Open Agent Leaderboard
- Paper: arXiv:2602.22953
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Paper for Exgentic/claude-code
Evaluation results
- open-agent-leaderboard/results
- Overall
- model: Claude Opus 4.5 View evaluation results source 0.67 *
- model: DeepSeek V3.2 View evaluation results source 0.42 *
- model: GPT-5.2 View evaluation results source 0.39 *
