VOOZH about

URL: https://thenewstack.io/streaming-ai-energy-efficiency/

⇱ The software fix that could shrink AI's energy bill without new hardware - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-05-15 12:00:00
The software fix that could shrink AI's energy bill without new hardware
sponsor-confluent,sponsored-post-contributed,
AI Infrastructure / AI Operations / Data Streaming

The software fix that could shrink AI’s energy bill without new hardware

Reduce AI energy consumption by moving from batch to data streaming. Flatten the compute curve and cut idle infrastructure waste today.
May 15th, 2026 12:00pm by Warren Vella
👁 Featued image for: The software fix that could shrink AI’s energy bill without new hardware
Alghozy for Unsplash
Confluent sponsored this post.

The load on the energy infrastructure that AI is placing should not be underestimated.

Most approaches to addressing the AI energy crisis focus on hardware, such as more efficient chips, better cooling, and greener data centers. Those matters, but there’s a faster, cheaper lever that gets less attention — the way organizations process data.

Shifting more workloads from batch processing to real-time data streaming is one of the most accessible and near-term ways to reduce AI’s energy footprint. The main difference is in the load profile. Batch processing creates sharp spikes in demand that require infrastructure to be provisioned for peak load. Streaming flattens that curve, distributing compute more evenly over time. 

“Batch processing creates sharp spikes in demand that require infrastructure to be provisioned for peak load. Streaming flattens that curve, distributing compute more evenly over time.”

The implications for energy consumption are significant and address an important issue. Electricity prices jumped 6.9% last year, and data centers will account for 40% of electricity demand growth through the end of the decade, according to Goldman Sachs. Meanwhile, hyperscalers are signing long-term power purchase agreements on a vast scale, and grid operators in several regions have already flagged capacity concerns.

Why batch processing deserves more scrutiny

Batch processing is still the most common approach to data analysis, dating back to the mainframe era. With batch loads, data is accumulated over time, staged in storage, and then processed in large, scheduled runs. 

Because these batch jobs run in concentrated bursts, operators have to provision infrastructure for peak load, meaning capacity sits idle between runs and consumes energy without doing any useful work. When a batch job kicks off, CPU and memory demand spike, taxing cooling systems and drawing heavily on power for a relatively short window. Then the cycle repeats.

In energy terms, it’s like flooring the accelerator from a standing start rather than maintaining a steady cruising speed. The approach made sense when compute was scarce, and data volumes were modest, but it’s less practical when AI systems require both speed and scale simultaneously.

A more efficient architecture

Streaming technologies like Apache Kafka and Apache Flink are already widely used in industries with real-time data needs, like financial services, retail, and telecommunications. But the operational case for streaming now extends beyond latency into total cost of ownership and sustainability.

Because data is processed continuously as it arrives, event by event, data streaming shifts the resource profile from spiky and unpredictable to steady and manageable. The compute load is distributed over time, which means peak demand is lower and provisioning can be more precise. 

Systems no longer need to be sized for the worst-case burst capacity; they can scale dynamically in response to actual throughput. This reduces idle compute running in reserve, one of the more significant sources of energy waste.

“Systems no longer need to be sized for the worst-case burst capacity; they can scale dynamically in response to actual throughput.”

There are further efficiencies downstream. Streaming architectures typically clean and deduplicate data in transit, before it reaches storage. That means data warehouses have less redundant data and the queries that run against them are leaner. Disk I/O, another energy-intensive operation in data processing, is reduced as a result. 

Shifting to a decoupled, event-driven architecture also means that individual systems can process data independently, without triggering cascading compute loads across tightly integrated pipelines.

Where to start

Not every workload needs to move to streaming at once. A strong initial candidate is preprocessing for AI workloads — using a stream processor to filter, aggregate, and normalize data before it reaches an AI model. This produces leaner, curated inputs instead of raw logs or wide tables, reducing memory, CPU, and GPU load.

A streaming architecture can also improve AI performance, because agents often need continuous access to current data. Static datasets that are periodically refreshed lead to outdated context or require reprocessing. Batch processing can end up being a bottleneck more than the models themselves.

Harnessing short-term gain

Migrating data pipelines from batch to streaming typically occurs at the software layer, so it doesn’t require waiting for new power or cooling infrastructure. It won’t eliminate AI’s energy problem, but it offers a fast, low-investment way to measurably reduce unnecessary consumption.

As AI workloads continue to grow, the pressure to be responsible energy stewards will only intensify from regulators, customers, and communities where data centers are built. Hardware improvements are already underway. The software conversation is overdue.

Confluent, founded by the original creators of Apache Kafka, pioneered a complete data streaming platform that streams, connects, processes, and governs data as it flows throughout a business. With Confluent, any organization can modernize their business and run it in real-time.
Learn More
The latest from Confluent
TRENDING STORIES
Warren's career started off as a 13-year adventure at the Australian Energy Market Operator (AEMO), where he supported and architected critical energy market integration platforms. After moving from operations into a strategic enterprise architecture role, he joined Confluent as a...
Read more from Warren Vella
Confluent sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.