VOOZH about

URL: https://thenewstack.io/automation-can-solve-resource-overprovisioning-in-kubernetes/

⇱ Automation Can Solve Resource Overprovisioning in Kubernetes - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-03-05 14:00:52
Automation Can Solve Resource Overprovisioning in Kubernetes
kubecon-cloudnativecon-eu-2025,sponsor-cncf,sponsored-post-contributed,
Kubecon Cloudnativecon EU 2025 / Kubernetes / Operations

Automation Can Solve Resource Overprovisioning in Kubernetes

Organizations can significantly reduce cloud waste and improve resource utilization with automation to address inefficiencies in Kubernetes environments.
Mar 5th, 2025 2:00pm by Laurent Gil
👁 Featued image for: Automation Can Solve Resource Overprovisioning in Kubernetes
Image from Joshua Sortino on Unsplash.
CNCF sponsored this post.

Teams running Kubernetes applications in the cloud often fall into the overprovisioning trap while trying to guarantee high performance and availability. As a result, they generate cloud waste that translates into a significant cost item for the organization.

Recently, Cast AI published the 2025 Kubernetes Cost Benchmark Report, which showed that the gap between provisioned and requested resources is still considerable — 40% for CPUs and 57% for memory. This indicates teams are deploying fewer workloads onto clusters than their capacity.

Notably, 99.94% of clusters analyzed were overprovisioned with CPU, a problem consistent across major cloud providers (AWS, Google Cloud Platform and Microsoft Azure), with no significant variations in resource efficiency.

These inefficiencies suggest that the challenge lies not with the cloud platforms themselves but with the complexities of managing Kubernetes clusters manually.

Overprovisioning Is Just One Side of the Coin

The average levels of Kubernetes resource utilization show that the problem doesn’t only relate to the size or type of compute instances selected. CPU utilization across Kubernetes clusters averaged 10%, with memory utilization at 22%. This shows teams face another challenge: setting the right requests for Kubernetes workloads without too much headroom.

Despite the widespread adoption of Kubernetes in cloud native environments, managing cloud resources remains manual and effort-intensive. When teams spend time on repetitive tasks and micromanage the cloud infrastructure, they tend to overprovision their clusters, fail to efficiently assign applications to them and generate cloud waste by leaving large headrooms in workload requests.

The data underscores the need for better tooling and automation in Kubernetes environments to reduce overprovisioning, improve resource utilization and eliminate unnecessary cloud costs.

Six Ways to Boost Resource Utilization and Efficiency

Flexible Compute Generation Selection

An automation engine lets teams dynamically select from different generations of compute instances based on real-time price trends. This will enable them to capitalize on the latest hardware for performance or select older, cost-effective generations to balance their budgets.

The graph below illustrates the price evolution of three compute instances representing three generations, showing that the flexibility in instance choice can be a game changer.

👁 Image

Automating Processor Architecture Selection (x86 vs. Arm)

The choice between x86 and Arm processors can lead to significant cost savings as Arm CPUs are generally more affordable than x86. Arm spot instances on platforms like Azure, GCP and AWS consistently offer better pricing, with up to 65% savings. By automating workload placement across architectures, teams can ensure the best performance-to-cost ratio without manual intervention.

Cloud provider Azure GCP AWS
Avg. x86 spot price per CPUper hour $0.0254 $0.0212 $0.0389
Avg. x86 on-demand price per CPU per hour $0.1354 $0.0659 $0.0783
Avg. Arm spot price per CPU per hour $0.0079 $0.0156 $0.0200
Avg. Arm on-demand price per CPU per hour $0.0474 $0.0410 $0.0496

Custom Autoscalers for Dynamic Resource Scaling

Companies like Akamai have usedcustom autoscalers to adjust cloud resources based on real-time demand automatically. This approach ensures applications always have the necessary resources while minimizing waste during low-usage periods. Automated scaling optimizes both costs and performance, eliminating the need for manual adjustments.

👁 Image

Bin-Packing Workloads for Maximum Efficiency

Organizations can significantly reduce overprovisioning by bin-packing workloads, particularly spot-friendly, stateless workloads. Heureka Group, for example, achieved a 30% reduction in compute costs by automatically optimizing workload placement and removing unused nodes. This technique reduces the number of idle CPUs and boosts overall utilization.

Autonomous Request Setting Based on Real-Time Data

Stateful workloads are memory-intensive. The example below illustrates what may happen if a company uses autoscaling solutions at the workload level and, at some point, turns it off.

The resulting sharp rise in memory requests forced the system to provision more resources. This, in turn, drove CPU provisioning up due to the correlation between memory and CPU, causing substantial overspending on resources.

👁 Image

Leveraging Spot Instances Safely

Spot instances offer substantial discounts but are often underused due to the risk of interruptions. Automation can help monitor price fluctuations and interruption rates, allowing teams to adopt spot instances for non-critical workloads confidently. By automating this process, companies can maximize savings without risking performance.

Conclusion

Automation is a powerful solution for addressing the inefficiencies that plague Kubernetes environments. Organizations can significantly reduce cloud waste and improve resource utilization by using automated tools and best practices like flexible compute generation selection, dynamic autoscaling and intelligent workload placement.

As cloud environments grow more complex, this approach will become essential to ensuring that resources are efficiently used, workloads are optimized and teams continue to innovate without overspending on cloud infrastructure.

To learn more about Kubernetes and the cloud native ecosystem, join us at KubeCon + CloudNativeCon Europe in London on April 1-4.

The Cloud Native Computing Foundation (CNCF) hosts critical components of the global technology infrastructure including Kubernetes, OpenTelemetry, and Argo. CNCF is the neutral home for cloud native collaboration, bringing together the industry’s top developers, end users, and vendors.
Learn More
The latest from CNCF
TRENDING STORIES
Laurent Gil is cofounder and president of Cast AI, overseeing product and business development. Previously, he co-founded Zenedge, where he served as chief product and business officer before its acquisition by Oracle in 2018. He was also CEO and cofounder...
Read more from Laurent Gil
CNCF sponsored this post.
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.