VOOZH about

URL: https://thenewstack.io/diy-kubernetes-agentic-ai/

⇱ Why your DIY Kubernetes stack won't survive the era of agentic AI - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-02-26 04:00:45
Why your DIY Kubernetes stack won't survive the era of agentic AI
sponsor-vmware,sponsored-post-contributed,
AI Agents / Kubernetes / Platform Engineering

Why your DIY Kubernetes stack won’t survive the era of agentic AI

Your DIY Kubernetes stack can't handle Agentic AI. Learn why enterprises must replace Frankenstein platforms with invisible infrastructure.
Feb 26th, 2026 4:00am by Oren Penso
👁 Featued image for: Why your DIY Kubernetes stack won’t survive the era of agentic AI
Graphicook Studio for Unsplash+
VMware Tanzu sponsored this post.

We are at an inflection point in enterprise IT. For the last decade, “modernization” has been synonymous with containerization — essentially packaging applications so they can run anywhere. But the goalposts have moved. The challenge is no longer just about running static microservices; it is about preparing for an era of Agentic AI and dynamic, resource-hungry workloads that defy traditional infrastructure planning.

This isn’t just a technical pivot; it’s a survival strategy driven by a few massive shifts in the landscape.

First, we have the AI reality check. We are moving quickly from simple chatbots to autonomous agents -software that plans, reasons, and acts. These agents don’t fit the predictable “9-to-5” usage patterns of legacy apps. They are bursty, demanding massive compute for short inference windows and scaling to zero when idle. Attempting to support these fluid workloads with rigid, ticket-based infrastructure provisioning is a non-starter. You simply cannot manually provision for an agent that needs to scale now.

Then there is the efficiency collapse. Organizations are drowning in the “Day 2” costs of their DIY platforms. By stitching together disparate open-source tools, teams have inadvertently created a massive maintenance burden. Highly paid engineers are spending their days patching ingress controllers and debugging YAML instead of shipping business value. The cost of maintaining the platform has eclipsed the value of the applications it hosts.

Finally, we have the private cloud mandate. As data gravity and sovereignty concerns grow, the public cloud isn’t always the answer. Enterprises need a “Cloud Operating Model” — the ability to vend APIs, not tickets — within their own data centers.

The bottom line is simple: We need to stop building platforms and start consuming them. The infrastructure must become invisible so that the intelligence, both human and artificial, can flourish.

The end of “roll your own platform”

We’ve spent the last decade stuck in a false binary. Organizations were either “Cloud Native” running Kubernetes, or “Legacy” running Virtual Machines. The industry has poured billions of dollars and endless engineering hours into rewriting the laws of physics, convincing ourselves that if we just containerized everything, we’d hit operational nirvana.

Spoiler alert: It didn’t happen.

Instead, we built the “Frankenstein Platform.” Look at your average platform engineering team. They are drowning in complexity, trying to stitch together ArgoCD, Istio, Tekton, Prometheus, and a dozen other CNCF projects just to recreate the developer experience Pivotal Cloud Foundry (now Tanzu Platform) solved back in 2012.

“The hidden cost of these Frankenstein platforms isn’t just initial engineering; it’s the “Day 2″ nightmare … Your platform team stops innovating and starts firefighting.”

The hidden cost of these Frankenstein platforms isn’t just initial engineering; it’s the “Day 2” nightmare. When a security vulnerability hits a specific version of Istio, or when a Prometheus update breaks your Grafana dashboards, your platform team stops innovating and starts firefighting. They become overworked operators of a fragile stack rather than enablers of business value. We promised developers a self-service experience, but we gave them a ticket queue for a platform team that is perpetually underwater.

But looking at where infrastructure is heading in 2026, we recognize a new pattern. It isn’t about choosing between the stability of the past and the flexibility of the future. It’s about stacking them.

The universal control plane

The most important development in Kubernetes recently has nothing to do with containers. It is the realization that the Kubernetes API is effectively the universal control plane for the data center. This is the main idea behind the notion that Kubernetes is a platform for building platforms; a better place to start, not the endgame.

It took us longer than it should have to understand this and instead treated Kubernetes as a product sold directly to developers, forcing them to wrangle YAML and figure out pod disruption budgets. Eventually, we understood that Kubernetes was never meant to be the user interface. It’s the assembly code of the modern data center, the invisible layer we build on.

The missing link: Autonomous operations

While Kubernetes solved the infrastructure API, it arguably made the application lifecycle worse. We traded the guardrails of a Platform as a Service (PaaS) for the raw flexibility of endless configuration. But once again, we look to Cloud Foundry for insight into the future. If the ultimate goal is truly autonomous operations, then you need a platform that enforces standardization to ensure consistency and scale. By delivering standard tools and practices through the platform, we can achieve automation nirvana!

Let’s consider BOSH, the heart of Cloud Foundry. While on the surface, BOSH looks like a deployment tool, it functions more like an Availability Engineer. It not only deploys code but also monitors system health, resurrects failed components, rotates credentials, and patches operating systems without downtime.

Unlike standard Kubernetes operators, which often focus on the application layer, BOSH deeply understands infrastructure dependencies. It knows how to drain a node, reprovision the OS, and reattach storage without the application and the developer ever noticing a blip. It provides the “dynamic repair” capability that raw Kubernetes assumes is handled by someone else. In a world of increasing complexity, having a platform that can self-heal at the VM level is not a luxury; it is a prerequisite for scale.

The AI catalyst

This need for an opinionated, consistent application platform is getting pushed hard by the explosion of Agentic AI.

Look at how the public clouds are solving for AI Agents. They definitely aren’t handing developers raw Kubernetes clusters. Public cloud agentic platforms abstract the infrastructure entirely. Why? Because AI agents are non-deterministic. They scale fast, chain complex tasks, and need ephemeral environments. Managing the lifecycle of thousands of autonomous agents at the “Pod” level is a nightmare.

“Public cloud agentic platforms abstract the infrastructure entirely. Why? Because AI agents are non-deterministic. They scale fast, chain complex tasks, and need ephemeral environments.”

Furthermore, the resource demands of Agentic AI surge unpredictably. An agent might sit idle for hours and then suddenly require massive compute to parallelize a complex reasoning chain. Hard-coding this infrastructure is inefficient and expensive. We need a platform that treats these agents as first-class citizens, automatically scaling the underlying compute down to zero when idle and bursting instantly when the agent needs to “think,” without a human operator needing to provision a single server.

The industry is voting with its feet here: AI needs high-level, opinionated platforms. If we want to run these workloads on-premise (where the data lives), we can’t rely on raw plumbing. We need a stack that mimics the public cloud model.

The post-hype era

The era of “Resume Driven Development” – picking tools because they are trendy – is over. The next wave of thought leadership is about Convergence.

We are entering a phase of “Industrialized Kubernetes.” The tinkering phase is over. The goal is no longer to see if we can build it, but to see how fast we can ship value on top of it. The winners of the next decade won’t be the ones building the most complex Kubernetes clusters. They will be the ones who realize the best platform is the one you don’t have to build yourself.

Trusted by enterprises and loved by developers, VMware Tanzu is built for platform and data teams who want to accelerate agentic software delivery and AI-ready data. Tanzu provides a pre-engineered, agentic app platform and an AI-ready data intelligence platform that helps enterprises build, run, manage and safeguard agents, their integrations and data so you can capitalize on AI at scale. 
Learn More
The latest from VMware Tanzu
Hear more from our sponsor
TRENDING STORIES
Oren Penso is a seasoned IT executive and senior product strategist with more than 25 years of experience in automation, security, observability, native public clouds and platform engineering. His leadership roles span across public financial companies and VMware. Beyond his...
Read more from Oren Penso
VMware Tanzu sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.