VOOZH about

URL: https://thenewstack.io/how-to-slash-cloud-waste-without-annoying-developers/

⇱ How to Cut Cloud Waste Without Constricting Developer Productivity - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-05-07 12:00:07
How to Cut Cloud Waste Without Constricting Developer Productivity
kubecon-cloudnativecon-eu-2025,sponsor-scaleops,sponsored-event-coverage,
Cloud Native Ecosystem / Kubecon Cloudnativecon EU 2025 / Operations

How to Cut Cloud Waste Without Constricting Developer Productivity

Yes, you can reduce wasteful cloud spending without undermining developer productivity, agility or autonomy, says ScaleOps.
May 7th, 2025 12:00pm by B. Cameron Gain
👁 Featued image for: How to Cut Cloud Waste Without Constricting Developer Productivity
Featured image by Getty Images for Unsplash+.
ScaleOps sponsored this post. Insight Partners is an investor in ScaleOps and TNS.

LONDON — The massive and surging volume of software being generated with AI is contributing to skyrocketing cloud costs. It’s also exponentially increasing demand for additional compute resources to manage all of that code across cloud native infrastructures. These factors make cloud cost optimization even more critical for enterprises than ever.

Adding to the challenge, Kubernetes introduces significant complexity at scale. There’s often limited visibility into how resources are used and what is being spent — a difficult environment for cost optimization. But existing GitOps practices orchestrated by Flux and Argo CD, based on traditional GitHub- or Git-type operations, are inadequate for proper resource optimization.

What’s required is an abstraction geared specifically towards resource optimization. It must automate analytics that cover the minutiae of infrastructure inside every pod and at scale across potentially multiple clusters and cloud and on-premises environments. In addition to automating the operations aspect of infrastructure management, such a platform should remove any concern around resource provisioning that might otherwise involve manual YAML configuration or other operations-related tasks.

Waste Makes Waste

Waste in cloud spending is not necessarily due to negligence or a lack of resources; it’s often due to poor visibility and understanding of how to optimize costs and resource allocations. Ironically, Kubernetes and GitOps were designed to enable DevOps practices by providing building blocks to facilitate collaboration between operations teams and developers, Gartner’s Tony Iams wrote in “Cost Optimization for Containers and Kubernetes in the Cloud.”

“However, while the operations team is responsible for only some sizing aspects, the responsibility for specifying the necessary resources to applications ultimately falls to developer teams,” Iams wrote. “The key to implementing optimization is to facilitate collaboration between development and operations teams. With the right tools and practices in place, and by working together, these teams can control the costs of running containers in the cloud and eventually optimize them.”

Many organizations that run large-scale or even mid-scale production environments on Kubernetes face a common issue: Workloads are either grossly overprovisioned or severely underprovisioned. Developers set the resource requests for CPU and memory, but they are forced to overprovision because the dynamics of the clusters constantly change. Overprovisioning means resources are allocated but remain unused, but the alternative, underprovisioning, negatively impacts performance. This inefficiency is exacerbated by the increasing workloads and complexity of modern software development, including CI/CD and AI.

The main challenge is the multidimensional character of scheduling decisions within Kubernetes clusters. On the one hand, developers set high resource limits and provision for peak demand, while administrators lack accurate data on actual resource needs over time, Torsten Volk, an analyst with TechTarget’s Enterprise Strategy Group, told me. At the same time, affinity rules are often set based on simple static labels that are not informed by an application’s actual performance requirements. Resource requests only consider CPU and memory, without being able to define network throughput and latency. High-priority apps might unnecessarily evict lower-priority ones, despite not actually requiring more resources, Volk said.

“All of this leads to a multilevel guessing game, where developers want to make sure to build in a static buffer to proof their application for worst-case scenarios, while operators are unable to cut down the resulting waste as they do not understand how an application might react,” Volk said. “In a nutshell, Kubernetes does not know anything about the real-life resource requirements of an application, while the application pod is entirely unaware of the performance of the underlying cluster hardware.”

Oh, So Human

Historically, resource allocation strategies were specified statically in YAML files and synced by a human manager using Flux or Argo CD. While Flux and Argo CD vendors and users seek to integrate resource optimization into their feature sets, they often fall short.

A few years ago, Git-based workflows were the standard for resource provisioning. However, with the acceleration of software development and CI/CD pipelines, and the rise of AI-generated code, “these traditional methods are breaking down,” noted Guy Baron, ScaleOps’ CTO and co-founder, when I spoke with him at KubeCon+CloudNativeCon Europe in April.

Instead, automating resource allocation in real time is required, along with optimizing vertical and horizontal scaling and pod placement. Such a platform continuously adjusts resource allocation based on workload needs and cluster health, ensuring efficiency and cost savings across associated multicloud and hybrid infrastructures at scale.

The optimization should cover:

  • Vertical scaling: Adjusting the amount of CPU and memory assigned to each pod.
  • Horizontal scaling: Optimizing the number of replicas in Horizontal Pod Autoscaler (HPA) or Argo-supported workloads.
  • Placement optimization: Intelligently scheduling pods to maximize cluster efficiency.

ScaleOps’ platform serves as an example of an option that abstracts and automates the process. It’s positioned not as a platform for analysis and visibility but for resource automation. ScaleOps automates decision-making by eliminating the need for manual analysis and intervention, helping resource management become a continuous optimization of the infrastructure map.

Scaling decisions, such as determining how to vertically scale, horizontally scale, and schedule pods onto the cluster to maximize performance and cost savings, are then made in real time. This capability forms the core of the ScaleOps platform.

Savings and scaling efficiency are achieved through real-time usage data and predictive algorithms that determine the correct amount of resources needed at the pod level at the right time. The platform is “fully context-aware,” automatically identifying whether a workload involves a MySQL database, a stateless HTTP server, or a critical Kafka broker, and incorporating this information into scaling decisions, Baron said.

With ScaleOps, cluster state is monitored continuously. If a pod is scheduled on a node with noisy neighbors that impact performance or health checks, it’s automatically migrated to a more suitable node with greater available resources.

Recognizing that a WordPress website, a Kafka broker, a MySQL database, and an Airflow pipeline have different availability requirements and criticality levels, each workload is treated uniquely. Resource allocation and scaling decisions are adjusted dynamically to meet these needs.

The platform also responds in real time to changes within the cluster, supporting auto-healing and adjusting to usage spikes.

In practice, developers no longer have to worry about operational responsibilities, like cost tracking and resource allocation, and have more time to code and engineer software to solve business needs more directly. Operations teams are no longer perceived as blockers to software development and deployment by imposing rigid constraints that stifle agility. They also do not have to worry as much about overprovisioning and overpaying for redundant resources to account for spikes in future demand by the developers, which CTOs and — especially — CFOs do not appreciate amid rising cloud costs.

The platform serves developers by automating resource allocation and requests. Instead of developers handling these tasks manually, the platform introduces an infrastructure layer that takes care of them. At the same time, it gives developers visibility and insight into how their workloads and resources are being utilized in production, Baron said. “Ultimately, it’s about finding a balance — automating repetitive tasks while keeping developers informed and empowered.”

Taking ScaleOps Out for a Ride

ScaleOps has a free trial option, which is a good platform to get started, because the playground offers an immediate way to link ScaleOps directly to your cluster. For example, I easily attached ScaleOps to Helm, and then ScaleOps began seeking the cluster to manage.

The interface itself is very straightforward, allowing you to easily navigate between different time periods — whether it’s 70 days, 30 days, etc. — in a time series graph that shows CPU usage and memory over time. Metrics such as optimized requests, usage and waste are among the key indicators displayed in the time series graph.

👁 ScaleOps UI

For more detailed analysis, each workload offers a range of functionalities and data analysis that is both straightforward and accessible. This includes cost savings achieved for each workload. It’s also possible to change policies — whether it’s batch, cost, high availability, production or others — meaning the system offers leeway to dynamically change or update policy on an as-needed basis.

For a positive developer experience, the platform was designed to automate the mundane and dynamic aspects of resource allocation and scaling while keeping developers “in the loop,” garnering insights into how their workloads and resources are performing in production, Baron said.

The platform serves developers by automating resource allocation and requests. Instead of developers handling these tasks manually, the platform introduces an infrastructure layer that takes care of them. At the same time, it gives developers visibility and insight into how their workloads and resources are being utilized in production, Baron said. “Ultimately, it’s about finding a balance — automating repetitive tasks while keeping developers informed and empowered.”

ScaleOps is a real-time, automated Kubernetes resource management platform that helps DevOps teams cut cloud costs by up to 80% and keep apps running smoothly. Trusted in mission-critical production environments, fully self-hosted and installed with a single Helm command.
Learn More
The latest from ScaleOps
TRENDING STORIES
BC Gain is founder and principal analyst for ReveCom Media. His obsession with computers began when he hacked a Space Invaders console to play all day for 25 cents at the local video arcade in the early 1980s. He then...
Read more from B. Cameron Gain
ScaleOps sponsored this post. Insight Partners is an investor in ScaleOps and TNS.
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: ScaleOps.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.