VOOZH about

URL: https://thenewstack.io/intels-generational-on-chip-change-apx-will-make-all-the-apps-faster/

⇱ Intel’s Generational On-Chip Change APX Will Make All the Apps Faster - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-08-09 03:00:54
Intel’s Generational On-Chip Change APX Will Make All the Apps Faster
Hardware / Linux / Software Development

Intel’s Generational On-Chip Change APX Will Make All the Apps Faster

Intel previously had 16 registers, but the APX doubles that to 32, resulting in faster and more power-efficient load and restore times for programs.
Aug 9th, 2023 3:00am by Agam Shah
👁 Featued image for: Intel’s Generational On-Chip Change APX Will Make All the Apps Faster
Feature image by Hans Linde from Pixabay.

Intel has made an under-the-radar change [pdf download] in its chips that have long-term implications in helping software run faster on servers and PCs.

The chip maker has doubled the number of registers on its x86 chip architecture, which will give developers an instant boost in application performance on all computers.

Coders simply need to recompile code [pdf download] to take advantage of the new features. Intel executives said programs do not need to be rewritten.

Intel previously had 16 registers [pdf download], but the new Advanced Performance Extensions (APX) doubles that to 32. The technology will result in faster and more power-efficient load and restore times for programs.

“APX is around general purpose, integer computing, it gives you more registers. Recompile, and you get better performance,” said Ronak Singhal, a senior fellow at Intel.

The compiler uses registers to store local variables, and it previously ran out after 16. After that, compilers had to go to memory to manage the variables, which affected performance.

“Now the compiler can go all the way to 32 local variables — we’re running out of these registers all the time and the compiler has to manage those in memory, which costs some of the runtime. We are giving the compiler room to optimize, basically, because there is more space to do things faster,” said Arjan van de Ven, also a senior fellow at Intel.

Modest Gains

Intel’s goal with APX is to provide an incremental benefit on every workload, but do not expect a radical 10x-type boost, Singhal said.

“It’s not hard for people to use. It does not require a restructuring of your application, coming up with new algorithms for application, none of that,” Singhal said.

Intel did not comment on when the APX instructions would be in chips. Intel’s chip lineup includes server chips Emerald Rapids (due later this year), Granite Rapids and Sierra Forest (next year), and PC chips Meteor Lake (this year) and Arrow Lake (next year). Intel typically boosts performance and power efficiency with new generations of chips.

“Who will be the first people to use this — think of the companies that control their own code, and it’s easy for them to recompile and get that benefit. And they are also the savviest users,” Singhal said.

Intel has worked a lot to make the transition simpler, but it will take time to get APX-related tools to get to developers. The company has posted documentation on APX to begin engaging the open source community.

“GCC — we have patches coming out soon. Same for the LLVM compiler. As you can imagine, we will be doing similar things with Microsoft and their compilers, all of them,” van de Ven said.

Application developers have the choice to boost performance for an entire application, or parts of the source code that are performance sensitive.

Binary Capability

“The nice thing with APX is we keep compatibility with existing binaries completely. You can mix and match. You can make one piece new and faster, and the part of the application that is not performance sensitive you do not have to do that. You can just keep it,” van de Ven said.

If a developer cares about performance, it might be useful to ship two copies of an application or parts of your application that are very performance sensitive, van de Ven said.

“You can imagine that the Linux distribution might be able to ship a second build of the same source code that is … optimized for the processor and it’s always completely compatible,” van de Ven said.

APX can also benefit discerning programmers who write code direct to GPUs, CPUs, and other hardware.

“When you’re a compiler writer or you do things like that, yes, this gives you more freedom, which tends to result in better code that is generated,” van de Ven said.

Singhal said more registers were needed as programming is much more complex today than it was 20 to 30 years ago. Computing is forking in multiple ways with applications and programming frameworks.

Applications have also evolved to run in parallel across CPUs and accelerators, and Intel had to bring parallelism on chips, and APX creates a foundation to boost performance on other parallel environments.

Intel has added new instruction sets such as AMX for on-chip AI acceleration, and TDX for on-chip data security. APX will also provide a minor boost to those instructions.

APX also undoes many risky performance-improvement features that Intel has implemented in previous chips.

The company uses a feature called “speculative executive” to anticipate processor behavior. By predicting behavior, the chip was able to reduce delays and run some applications much faster.

But speculative execution has its own issues and was at the center of the Meltdown vulnerability detected on Intel chips in 2018.

The APX instructions have provided an opportunity to remove branch prediction, which typically assigns a task for execution based on “true” and “false” values.

“We can remove that and turn it into a conditional move. If that condition is this, then move this or that? No branch needed,” Singhal said.

Intel always had capabilities for conditionals, but “this makes it much richer and much easier for compilers to take advantage,” Singhal said.

Intel also introduced the new AVX10 instructions, which impacts coders that write applications for high-performance computing.

The AVX10 instructions are a successor to the AVX-512 instructions, which are used for scientific computing, machine learning, security, and other applications.

The biggest improvement in the new AVX10 is around usability, not performance. Intel has had multiple generations of AVX, but the listing of features got convoluted, which made it difficult for programmers to match up the right set of features and CPUs.

“If we have a hard time figuring this out, outside of Intel, nobody has a chance,” van de Ven said.

AVX10 reorganizes the versioning into a linear form, such as 10.1 or 10.2, with each enumeration listing the new features. Customers do not need to check various versions to identify CPUs and match up the AVX features.

The AVX10 feature set will first appear in the upcoming Xeon server chip code-named Granite Rapids, which is due earlier next year. Granite Rapids is based on an entirely new processor design, and it will be made on Intel 3 manufacturing process, in which the chip maker will use Extreme Ultraviolet (EUV) technologies to etch finer features on chips.

Intel has been aggressively pushing its OneAPI computing tools for developers to develop a common codebase that can be easily exported across hardware and accelerators. Intel may first include the APX and AVX10 tools in OneAPI, and make it available through Intel Dev Cloud.

TRENDING STORIES
Agam Shah has covered enterprise IT for more than a decade. Outside of machine learning, hardware and chips, he's also interested in martial arts and Russia.
Read more from Agam Shah
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.