VOOZH about

URL: https://thenewstack.io/openai-says-its-new-codex-max-model-is-better-faster-and-cheaper/

⇱ OpenAI Says Its New Codex-Max Model Is Better, Faster and Cheaper - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-11-19 10:01:32
OpenAI Says Its New Codex-Max Model Is Better, Faster and Cheaper
AI / AI Agents / Large Language Models

OpenAI Says Its New Codex-Max Model Is Better, Faster and Cheaper

OpenAI launched a new variant of its Codex frontier models for coding tasks today that it says is smarter and will save developers time and money.
Nov 19th, 2025 10:01am by Frederic Lardinois
👁 Featued image for: OpenAI Says Its New Codex-Max Model Is Better, Faster and Cheaper
OpenAI CEO Sam Altman on stage at the company’s DevDay 2025 (Credit: The New Stack).

OpenAI today launched GPT-5.1-Codex-Max, a new variant of its GPT-5.1-Codex foundation model that was specifically trained to excel at coding tasks and which powers OpenAI’s Codex agent.

The original Codex model launched about two months ago and, at the time, it was extremely competitive — and often led the competition — on most benchmarks. But nobody in this field is standing still. OpenAI itself launched the 5.1 versions of its GPT models, including Codex, just a few days ago, and Google’s Gemini 3, which launched earlier this week, also pushed the envelope for coding with frontier models.

Codex-Max, OpenAI said, was specifically trained on agentic tasks related to software engineering, math, research  and more. It’s meant to handle long-running tasks; OpenAI stressed that this is also the first model it trained to work across multiple context windows. Using compaction to compress context into more manageable units, OpenAI claims the Codex agent can now work “over millions of tokens in a single task.”

What Are Codex-Max’s Benchmarks?

That’s likely part of why Codex-Max does quite well on the standard coding benchmarks, too. Codex-Max, on its highest settings, scores 77.9% on the SWE-Bench Verified benchmark, for example, which tests how well the agent can handle real-world pull requests from a number of popular Python projects.

The GPT-5.1-Codex model on its high setting scored 73.1%, Anthropic’s Sonnet 4.5 got to 77.2% (though with test-time compute added, it got to 82%) and Google’s new Gemini 3 comes in at 76.2%.

On TerminalBench, Codex-Max scores 58.1%, while GPT-5.1-Codex achieved 52.8%, Sonnet 4.5 hit 50% and Gemini 3 scored 54.2%.

👁 Image

GPT-5.1-Codex-Max benchmarks (Credit: OpenAI).

Is Codex-Max Better and Cheaper?

Like with most modern models, Codex-Max will feature different reasoning modes that govern how many reasoning tokens the model can use to perform a given task. For Codex-Max, OpenAI is adding a new extra high (“xhigh”) mode, which lets developers push the model’s rephrasing efforts even further. This obviously increases the latency and may not be ideal for all use cases, but it does improve accuracy by a few percentage points.

Benchmarks aren’t everything, though. How well the model performs on real-world tasks remains to be seen.

What’s maybe even more important for developers, though (and especially those who use the API), is that in OpenAI’s tests, Codex-Max was often able to produce similar or better results with fewer tokens and tool calls — and it produced fewer lines of code to get to the same results. Because of this, OpenAI argues that Codex-Max is 27 to 42% faster on real-world coding tasks than its predecessor.

One place where it will surely do well, though, is on Windows machines. OpenAI notes that this is the first model the company has trained to operate in Windows environments.

What’s Codex-Max’s Availability?

The new model is now available in Codex in the CLI, IDE extension, cloud and code review, and will be available for all users with ChatGPT Plus, Pro, Business, Edu and Enterprise plans. Access for users who want to use it in Codex via their API key is coming soon.

TRENDING STORIES
Before joining The New Stack as its senior editor for AI, Frederic was the enterprise editor at TechCrunch, where he covered everything from the rise of the cloud and the earliest days of Kubernetes to the advent of quantum computing....
Read more from Frederic Lardinois
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Anthropic, OpenAI.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.