VOOZH about

URL: https://thenewstack.io/openai-codex-claude-code/

⇱ I tested the new OpenAI Codex features on a real Python codebase, and it's the strongest Claude Code rival yet - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-05-07 11:27:18
I tested the new OpenAI Codex features on a real Python codebase, and it's the strongest Claude Code rival yet
AI Agents / AI Engineering / Developer tools

I tested the new OpenAI Codex features on a real Python codebase, and it’s the strongest Claude Code rival yet

OpenAI has updated Codex with computer use, an in-app browser, and PR reviews. Testing on HTTPie revealed how these features handle real-world Python bug fixes.
May 7th, 2026 11:27am by Jessica Wachtel
👁 Featued image for: I tested the new OpenAI Codex features on a real Python codebase, and it’s the strongest Claude Code rival yet
Illustration by Alghozy on Unsplash.

Late last month, OpenAI shipped a new product they dubbed “Codex for (almost) everything.” Its goal was to shift Codex from a code editing product to a general-purpose tool.

The “Codex for (almost) everything” launch included features such as computer use, an in-app browser, PR review, SSH connections to remote dev boxes, and over 90 new plugins. More than 3 million developers use Codex every week, and many of them probably opened the app the next morning and didn’t notice anything different.

I noticed. And now I have opinions.

The setup

I spent a day testing three of the biggest new features against the same real-world codebase I used to test Cursor 3 and Claude Code in this article (but tested a different bug). The test codebase is HTTPie, a popular open source Python CLI tool for making HTTP requests.

I ran my tests on the Codex desktop app. I was able to access all its new features with my OpenAI account. I use the free version. Computer use requires a Mac and specific system permissions (more on that later).

Codex in-app browser

This is the feature I was most excited to test because it changes the prompt workflow entirely. Instead of copying and pasting a bug description into the chat, I opened the GitHub issue directly inside Codex and pointed the agent at it. The browser is a plugin, so there isn’t an option for the browser in the initial pop-up menu in the text box. You’ll find it once you select the plugin option. Yes, that tripped me up. No, I don’t want to talk about it.

I opened GitHub issue #1665 in the in-app browser and typed:

“I have the GitHub issue open in the browser. Please read it and fix the bug described there.”

And I was very happy with everything that followed. The page opened in a split-screen layout, directly beside the chat interface.

Codex understands not only the task at hand but also the codebase itself.

Codex fixed it in 3 minutes. It read the issue, traced the bug to three files in the codebase, wrote a fix, added a regression test, and ran the relevant tests. It also noticed that downloads.py had unrelated changes from my earlier testing and explicitly left it untouched. That let me know Codex understands not only the task at hand but also the codebase itself.

Codex computer use test

Computer use is a big feature. Codex can now see your screen, move its own cursor, click, and type in apps on your Mac. I had mixed feelings about granting Codex that much access, but I was willing to try for the sake of this article.

A coding agent with unrestricted terminal access is a security risk.

I granted screen recording and accessibility permissions and asked Codex to open Terminal, navigate to the HTTPie repo, and fix the same bug hands-free. Codex immediately flagged that Terminal.app is blocked from computer use in this session, citing security reasons. It completed the task using its built-in shell instead. I think this limitation is for the best. A coding agent with unrestricted terminal access is a security risk.

I moved to a less risky task. I asked Codex to open Finder, navigate to the project folder, and take a screenshot. It navigated the folder correctly, but the screenshot capture failed. It generated a text-based rendering of the folder contents instead and acknowledged the limitation. The text-based rendering was accurate, though.

Computer use is real, and it works for GUI tasks such as running desktop apps, performing browser-based workflows, and visual UI testing. For terminal-heavy developer workflows, I wouldn’t recommend it. For frontend developers testing UI flows or operating desktop apps, it may prove more beneficial.

Codex pull request review test

For this test, I pushed a branch with a fixed bug to my GitHub fork and asked Codex to review the PR. Codex read the pull request, confirmed the fix was conceptually correct, and cited the relevant urllib3 and Requests documentation to support its assessment. It then ran the specific regression tests and flagged a genuine gap in the test coverage. The mock test doesn’t exercise real gzip streaming end-to-end, it said, and suggested a follow-up integration test.

The sandbox blocked the full test suite again due to port binding restrictions. This keeps showing up across every tool I have tested, and it is a real limitation of sandboxed agent environments.

It wasn’t perfect, but it worked. The PR review read a real GitHub PR, cited documentation, performed tests, and gave actionable feedback.

Codex is now more than the standard coding agent we’ve come to know and love in the past year. Though not without room for improvement (looking at you, computer use), it’s the most complete alternative to Claude Code.

TRENDING STORIES
Jessica Wachtel is a developer marketing writer at InfluxData where she creates content that helps make the world of time series data more understandable and accessible. Jessica has a background in software development and technical journalism.
Read more from Jessica Wachtel
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: OpenAI.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.