Paper • 2602.22953 • Published • 12
OpenAI Solo
This is a tracking repo for OpenAI Solo, used by the Open Agent Leaderboard to report evaluation results on HuggingFace.
OpenAI's Agent SDK for building single-agent workflows with tool use and structured outputs.
- Framework: openai-agents-python
- Leaderboard: Open Agent Leaderboard
- Paper: arXiv:2602.22953
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Paper for Exgentic/openai-solo
Evaluation results
- open-agent-leaderboard/results
- Overall
- model: Claude Opus 4.5 View evaluation results source 0.73 *
- model: DeepSeek V3.2 View evaluation results source 0.32 *
- model: GPT-5.2 View evaluation results source 0.39 *
