VOOZH about

URL: https://thenewstack.io/kubernetes-complexity-realigns-platform-engineering-strategy/

⇱ Kubernetes Complexity Realigns Platform Engineering Strategy - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-07-09 03:20:16
Kubernetes Complexity Realigns Platform Engineering Strategy
sponsor-nutanix,sponsored-post,
DevOps / Kubernetes / Platform Engineering

Kubernetes Complexity Realigns Platform Engineering Strategy

By simplifying Kubernetes’ inherent complexity, platform engineering eases the DevOps workload and, in turn, developers’ cognitive burden.
Jul 9th, 2025 3:20am by Jennifer Riggins
👁 Featued image for: Kubernetes Complexity Realigns Platform Engineering Strategy
Featured image by Google DeepMind on Unsplash.
Nutanix sponsored this post.

When Kubernetes came onto the tech scene about a decade ago, it quickly became the shining example of the grassroots, bottom-up developer story.

This open source project was a fast developer favorite as it promised an easy way to orchestrate containers at scale, providing automation, self-healing and simplified deployment processes. It facilitated the separation of constraints that devs were dreaming of while also letting them get as creative and custom as they needed.

But this Kubernetes flexibility doesn’t scale smoothly across teams and organizations so easily. Enterprises continue to struggle with scaling, security, governance and platform consistency. And they’re finding themselves stuck again — unable to move between cloud providers and overwhelmed by the complexity of Day 2 operations.

DevOps and this shift toward the cloud have left developers and operations bearing immense cognitive load. Organizations have long-standing tool sprawl and Software as a Service (SaaS) subscription fatigue. All while most teams are trying to do more with less, including managing large-scale AI adoption.

Over the last few years, platform engineering has risen as a way to abstract away layers of operational complexity through creating golden paths — the simplest, self-serviced routes for developers to accomplish “non-differential but not unimportant tasks” like provisioning containers and deploying to the cloud. It’s achieved via an internal developer platform (IDP), which creates a single automated operating model for Kubernetes, operated by a centralized platform engineering team. When implemented well, it also enables a common language among technical, business and compliance stakeholders.

I sat down with the team behind Nutanix Kubernetes Platform (NKP) to further dive into these trends dominating enterprise IT strategy.

The Kubernetes Complexity Challenge

According to the Cloud Native Computing Foundation (CNCF), 80% of organizations have adopted Kubernetes. But just because it’s prevalent, doesn’t mean Kubernetes management is easy.

After all, Kubernetes is complex by design, whether you’re deploying across clouds, on premises or in edge environments.

This container orchestrator was originally built and then open sourced by Google to simplify application development by abstracting away the underlying infrastructure. However, this means any team in charge of adopting Kubernetes needs to then not only understand Kubernetes — including pods, deployments, services and ingresses — but also its many interacting components, including the Kubernetes API server, etcd and Redis stores, kubelet, kube-proxy, policy management, controller managers, controller schedulers and more. These measures are incredibly difficult to scale.

While many organizations choose the cloud for simplicity, there are still challenges with Kubernetes due to hidden costs as well as lack of portability.

For example, Kubernetes is known for its flexibility, allowing for a vast array of configuration options, which only adds to variety — and complexity. This makes Kubernetes logging and monitoring particularly challenging without centralized visibility into clusters. Visibility is not only essential for compliance and security, but also for performance.

Managed Kubernetes from cloud providers can provide the necessary monitoring, but that can greatly increase costs — and take away the option of portability, one of the primary benefits of Kubernetes.

There are also state-of-the-art open source solutions available, but using them means increased responsibility for the platform engineering team, including mitigating common vulnerabilities and exposures, testing compatibility and rolling out new versions across a fleet. They must also have a solid grasp of the underlying infrastructure, including the codebases, networking and operating systems, to avoid common cloud migration roadblocks.

Since most enterprises are running Kubernetes in more than one location — on premises, in the cloud and at the edge — platform engineering teams have to be ready to deal with varying levels of abstraction, automation and responsibilities, as Kubernetes can be deployed across bare metal, virtual machines (VMs) and/or in the cloud.

Don’t forget enterprise decision paralysis. Most notably, the Cloud Native Computing Foundation landscape seems to be ever-expanding, meaning that these open source tooling options and adoptions are continually adding to this complexity.

“There are a lot of hidden costs associated with deploying Kubernetes in the cloud. While cloud providers focus primarily on Kubernetes orchestration, enterprises still need to integrate additional capabilities to Kubernetes enterprise-grade including observability, networking, service mesh and other tools needed alongside Kubernetes,” said Dan Ciruli, cloud native product leader at Nutanix and co-founder of the OpenAPI Initiative.

Even if they adopt the best tool stack available, individual DevOps and platform teams still have to spend time making updates — configuring security patches and new features — then testing compatibility, before rolling out the patches and upgrades across the organization.

And, because Kubernetes doesn’t have built-in tools to audit whether protection is correctly configured, these teams need to work with site reliability engineering (SRE) and cybersecurity teams to integrate the right security policies and monitoring. It’s no wonder that DevOps and platform teams are facing burnout in response to widespread Kubernetes adoption.

However, by establishing consistency across environments, the Nutanix Kubernetes Platform (NKP) aims to reduce this operational complexity.

A Return to Stateful Storage

While Kubernetes was originally created with ephemeral, stateless workloads in mind, as more and more workloads are deployed in Kubernetes, the need for stateful workloads becomes greater.

Stateful workloads retain persistent, reliable and consistent data, within context, across sessions, requests and even application restarts. Stateful data is necessary for databases, financial systems, real-time communication, email servers, messaging queues, content management systems and even e-commerce shopping carts. And while cloud providers often provide storage for their hosted solutions, both industry and regional regulations increasingly require on-premises data storage for compliance. It’s also essential for quick and detailed incident management and disaster recovery.

No matter where the data sits, higher volumes of stateful data are being deployed in a Kubernetes environment. And there’s a demand to support both stateful and stateless workloads, despite the industry still lacking “prescriptive guidelines for choosing between stateless and stateful architectures, often leaving architects to rely on contextual judgments and experiential heuristics,” according to MIT researchers Mei Song and Margaret Joshua.

Kubernetes provides persistent volumes (PVs) and persistent volume claims (PVCs) to address the stateful demand for reliable storage, consistent network identities and stable scheduling. Even so, managing storage access modes, performance and capacity adds significant complexity for DevOps teams to manage across stateful workloads in Kubernetes.

Add to this, homegrown IDPs and most out-of-the-box container platforms don’t have integrated storage or data services, or perhaps they don’t have enough storage to meet the demand of new customers or new AI-based workloads.

Enterprises are often left with having to “bolt on” extra storage capacity — either cloud storage, which limits portability, or buying on-premises storage, which introduces yet another vendor, said Ciruli.

This “less-than-duct-tape” approach to extra storage, he continued, usually results in a fragile integration that risks data inconsistencies, which in turn can trigger both finger pointing and tracing problems.

Once you get that data in the cloud or on-premises, it becomes again very difficult to transfer that data to support a hybrid or multicloud strategy.

AI Accelerates Adoption of Kubernetes and Cloud

Now that AI is scaling both the demand for compute and the sheer size of codebases, it’s all about to get even more complicated.

Gartner researchers predicted in May that, by 2029, 50% of cloud compute resources will be devoted to AI and machine learning (ML) workloads.

“Now is the time for organizations to assess whether their data centers and cloud strategies are ready to handle this surge in AI and ML demand,” said Joe Rogus, director of advisory at Gartner, at the most recent Gartner IT Infrastructure, Operations and Cloud Strategies conference. “In many cases, they might need to bring AI to where the data is to support this growth.”

Ciruli agreed, telling me, “I think AI is going to be on prem. It’s going to be in the cloud. It’s going to be everywhere. It’s going to be on a mobile phone — it’s already there. All cloud apps are eventually going to be AI apps, right?”

However, he emphasized, AI is hard to implement due to the complexity of Kubernetes.

“With AI applications, it’s good to build them on containers because that gives you scalability and the portability that you would need,” Ciruli said.

“Because AI requires these containers as a foundation, a lot of companies are now starting to think about actually having to manage these containers at scale,” often for the first time.

A lot of organizations still don’t have the basics down, Ciruli added, including microservices and cloud native architecture. Let alone how to do Kubernetes and cloud at scale.

The Case for Platform Engineering

Enterprises would need to spend 3.5 times more on software than they currently do if open source software didn’t exist. But by no means does open source translate to free.

The complexity of open source — and especially the cloud computing and Kubernetes that drive that adoption — has triggered massive tool sprawl. Most enterprises don’t have insight into which  tools have been adopted and what data is being exposed across tens of thousands of internal and external APIs. And they are not prepared to scale and maintain Kubernetes.

Most organizations are left with do-it-yourself, ad-hoc ways, as engineers have engineered their own pathways to the cloud. This simply doesn’t scale across hundreds, thousands or tens of thousands of engineers within the enterprise environment.

Perhaps most importantly, Kubernetes management has become toil that distracts from the true objectives of DevOps teams:

  • Flow toward business value.
  • Faster feedback loops.
  • Continuous experimentation, learning and improvement.

Having DevOps manage Kubernetes quickly becomes inefficient and leads to greater technical debt, including manual processes, a lack of automation and configuration drift, when there’s a disparity between what’s in your repositories and what is actually deployed.

As Govardhana Miriyala Kannaiah wrote in “Kubernetes Anti-Patterns,” this debt then leaves DevOps teams spending a significant portion of their time managing existing systems, instead of innovating: “It’s the interest that accrues as you choose shortcuts, bypass best practices and opt for expedient solutions.”

DevOps teams benefit from the platform’s paved roads as much as application teams do.

Platform engineering has risen as an answer to this complexity at scale, as a way to better abstract that Kubernetes abstraction. And the platform team has formed to ease much of the DevOps workload. Which in turn also eases developers’ cloud native cognitive load.

“There’s a lot of ambiguity, knowledge gaps, things that developers know they don’t know. Developers don’t need to know the infrastructure platform,” Ciruli said. “We are, in fact, asking the platform engineers to build that platform and to serve it to them so that they don’t have to worry about the backend. They just have to worry about applications.”

Then as enterprises approach Kubernetes adoption, he continued, “they want to serve up a platform to DevOps, to make their lives and experience a lot richer and easier, and to streamline that process.”

What a Platform Engineering Strategy and IDP Deliver

A platform engineering strategy, whether based on a build-your-own or a SaaS-based platform offering, is a way to do more with less. It takes away some of the extreme developer autonomy and tool sprawl that marked the 2010s and trades it for the convenience of golden paths or opinionated workflows for common processes, standardized permissions and greater visibility across organizations.

Often, a platform engineering strategy kicks off with a focus on discoverability. It allows platform engineering, SRE and security teams to get a cross-organizational view of:

  • What’s already adopted.
  • Where dependencies are.
  • Who owns which services and APIs.
  • What data is being exposed within and outside the organization.

An internal developer platform emphasizes self-service so that developers can figure things out faster and just get things done. This usually cuts costs because it enables greater reusability of those services and APIs, instead of just building everything from scratch. An IDP is usually built with a search engine or, more recently, a conversational AI overlay, which enables search and documentation, helping get back the average one day a week developers waste looking for things.

While an IDP intentionally focuses on building what internal DevOps and engineering customers actually want to use, it does include some features of its predecessors, like role-based access control.

An internal developer platform centralizes software life cycle management, enabling single-deploy security patches and upgrades. This is essential because, according to the Infosec Institute, on average, it takes 60 to 150 days to remediate a vulnerability, which, depending on the vulnerability, can threaten your uptime, reputation and more.

Besides IT leadership feeling the need to cut costs while putting up more guardrails and processes in place, Ciruli said, “platform engineering for us is delivering that platform, which is open, complete, easy to install [and] easy to manage for DevOps [teams], to get their applications faster to market with easy-to-use self-service tools and capabilities.”

As a global leader in cloud software, Nutanix transforms the way organizations do business. We offer a single platform to run all your apps and data across on-premises, public clouds, hybrid environments, and at the edge, while simplifying operations and reducing complexity.
Learn More
The latest from Nutanix
TRENDING STORIES
Jennifer Riggins is a tech storyteller and journalist, event and panel host. She bridges the gap between business, culture and technology, with her work grounded in the developer experience. She has been a working writer since 2003, and is based...
Read more from Jennifer Riggins
Nutanix sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
👁 Image
Take a quick test drive. See how easy it is to build, deploy, manage, and scale modern apps.