VOOZH about

URL: https://thenewstack.io/why-ai-pcs-are-not-for-developers/

⇱ Why AI PCs Are Not for Developers - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2024-09-26 13:00:58
Why AI PCs Are Not for Developers
AI

Why AI PCs Are Not for Developers

There is no compelling reason for developers to compile local AI models that take advantage of new AI processors installed on AI PCs.
Sep 26th, 2024 1:00pm by Agam Shah
👁 Featued image for: Why AI PCs Are Not for Developers
Image via Unsplash+. 

There is no compelling reason for developers to compile local AI models that take advantage of new AI processors installed on up-and-coming AI PCs.

There are multiple issues: the hardware isn’t capable, the models aren’t available, and the development tools are a headache to deploy.

I did months of testing on so-called AI PCs with Windows 11 and specialized AI processors, in the hope of running a local LLM with an internet connection turned off. The laptops included chips from Intel and Qualcomm with neural processors, which were designed for AI.

Microsoft has hyped its AI PCs as supporting lightweight AI models — or SLMs (Small Language Models) — such as Meta’s Llama 2 and Microsoft’s Phi Silica.

My attempts to load those models onto the PCs was a frustrating experience, with bumps at every step. Finding lightweight models compatible with the neural processors in the Qualcomm and Intel chips was the first challenge. Loading the Jupyter notebook and neural networks in which to run those SLMs was another.

When I did run the models, the SLMs were not using specialized AI processors, but were instead relying on GPUs or CPUs.

The Hype

Microsoft announced Copilot+ PCs at its Build conference this year. The first Co-Pilot PCs had hardware so that customers could run inferencing on devices, saving a trip to the cloud.

The Copilot PCs had some minimum requirements, which included a minimum AI performance of 45 TOPS. The first AI PCs with Qualcomm’s Snapdragon chips met that requirement.

Microsoft’s CEO Satya Nadella said the company had 40-plus models available out of the box to run locally on your inputs on Copilot+ PCs. One was Phi Silica, a 3.8-billion parameter SLM.

The DirectML and ONNX runtime allows users to run the Phi-3 models on Windows devices, but Qualcomm wasn’t ready when the devices came out. Qualcomm provides its own list of AI models it supports on its Snapdragon Elite X chips, via an AI Development Hub.

An early attempt to load Llama v2 wasn’t smooth and didn’t work for me. I sought Qualcomm’s help to load models, but there was no clear outcome.

Creating a Jupyter notebook using the tools recommended by Qualcomm was confusing, and I couldn’t load any AI model manually. Qualcomm recommended downloading the ONNX runtime to exploit NPUs, which was confusing.

Lately, LMStudio is providing a version of its AI software for Qualcomm chips.

I loaded the 8-billion parameter Llama v3.1 model using LMStudio, but it only used the Snapdragon CPU, not the GPU or NPU. It dished out 17.34 tokens per second, but it chewed up 87% of memory after just a few queries.

Meaningful models aren’t yet ready to take advantage of Qualcomm’s NPUs, which, like GPUs, are designed to speed up AI. Even if the NPUs worked, the Copilot PCs don’t have the memory to run long queries. It’ll also drain the battery quickly.

Microsoft is providing the tooling for developers to integrate AI capabilities within desktop applications. There’s no reason for them to help load Llama v3.1, since they already have Copilot features on PCs.

Microsoft’s Phi Silica support is more for developers to bring large language model style querying capabilities to Windows applications via the Windows App SDK.

Meteor Lake Failures

Intel got into the AI PC game late last year with a chip called Meteor Lake, which had a neural processing unit.

The chip is now a paperweight, and those who bought laptops with the chip for on-PC AI have been abandoned. There are no useful applications; the NPU was utilized for basic AI models like TinyLlama.

To be sure, Intel’s Meteor Lake chips don’t qualify under Microsoft’s minimum specs for an AI PC. Intel claimed 34 TOPS (trillion operations per second) of AI performance on Meteor Lake, which is lower than the 40 TOPS required for Windows PCs.

Meteor Lake opened to poor reviews. It was slower than the previous-generation laptop chip and offered no improvement in battery life.

In about six months after releasing Meteor Lake, Intel shipped its next-generation AI PC chip, Lunar Lake, which made it to PCs and offers 120 TOPS of AI.

I tried manually running AI models locally on Meteor Lake PCs.

Loading a neural network to exploit the NPU involved installing OpenVINO 2024.2 and following the instructions on OpenVINO’s website.

The install provides the NPU plugin, which you’d expect to run when loading a model on a Jupyter notebook. Intel said that I’d need the right NPU driver and firmware.

Installing the new NPU driver itself was a challenge — I had to uninstall the old driver within Windows’ Device Manager settings and then detect the new driver. In the end, I just updated the driver using driver search.

I ran models like TinyLlama from the Jupyter notebook, which ran just fine, but gave poor answers. But like Qualcomm, it didn’t utilize the NPU.

A handful of models like Stable Diffusion 1.4 utilize the NPU, but it was directly within the GIMP interface.

Intel’s AI software development is largely focused on its server CPUs.

Back to Nvidia

Developers should stick to Nvidia to run Jupyter notebooks for any meaningful AI on their PCs.

Buy AI PCs for productivity, but not for AI-related coding or trial and error. The NPUs by chip makers aren’t friendly to developers. The problems start with initiating neural networks, and every chip maker has its own. But on-device AI is an emerging field with plenty of opportunities for developers to experiment, such as optimizing AI for PCs by quantization.

For adventurous coders, the typical Windows challenges come into play — ensuring you have the right drivers and development toolkits. Qualcomm and Intel have their own preferential tools on which to compile and load models.

Thankfully, Windows command line and PowerShell make the command line adventures fun.

Expect AI features that take advantage of NPUs to be pre-packaged in applications. Intel is working with companies to take advantage of NPUs. It is the same as making software compatible with specific chip architectures.

AI hardware is emerging quickly, and Intel is hyping up its latest Lunar Lake chips. Recent reviews have praised the chip with great battery life. But don’t buy it for development purposes — it doesn’t have enough memory or bandwidth to run language models locally.

TRENDING STORIES
Agam Shah has covered enterprise IT for more than a decade. Outside of machine learning, hardware and chips, he's also interested in martial arts and Russia.
Read more from Agam Shah
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.