VOOZH about

URL: https://thenewstack.io/agentic-ai-and-platform-engineering-how-they-can-combine/

⇱ Agentic AI and Platform Engineering: How They Can Combine  - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-04-01 11:00:23
Agentic AI and Platform Engineering: How They Can Combine 
AI / AI Agents / Kubecon Cloudnativecon EU 2025 / Platform Engineering

Agentic AI and Platform Engineering: How They Can Combine 

Agentic AI is a way to expand on top of platform engineering initiatives, to enable asymmetric scaling in the face of Kubernetes complexity.
Apr 1st, 2025 11:00am by Jennifer Riggins
👁 Featued image for: Agentic AI and Platform Engineering: How They Can Combine 

More than a decade after Kubernetes was introduced, and even as adoption of the container orchestrator has skyrocketed, a skills gap persists.

This is a big problem for enterprises that need K8s to scale. For Sebastian Kister, Kubernetes has become the public transport for compute.

“Kubernetes makes it possible to supply computing power automatically, at scale and, most of all, securely and reliably — which is not the case for many, many of the other technologies that we had before,” said Kister, product team lead of the container competence center, platforms and operations team at the car maker Audi and transformation consultant for other enterprises.

But that doesn’t mean Kubernetes has become easier to work with.

“The challenge is especially in the skillset of people using it,” he said. “The market makes it difficult to find truly senior people who have a deep understanding of Kubernetes.”

It all came to a head recently when one of his teams wanted to add 12 new clusters, and the site reliability engineering team responded: We need time to find and hire two more SREs.

With all the automation around Kubernetes in place, Kister was surprised by so many barriers to scaling. In the face of these perpetuating complexities, vulnerabilities and incidents, Kister looked toward AI.

Six months ago, Kister adopted the Kubiya agentic AI platform to support security responses that are, as he put it, “real-time, context-aware and continuously updated.” This adoption of agentic AI not only took an enterprise he works with from risk acceptance to active, intelligent remediation — it decreased team friction and stopped the blame game.

Agentic AI Aids Asymmetric Scaling

Like most companies of late, Kister’s platform engineering and operations teams felt urgent pressure to scale while facing shrinking budgets and rigid processes.

“We couldn’t hire fast enough, and educating junior talent at scale was too slow and unpredictable. The market made it nearly impossible to attract top-tier talent,” Kister said.

“We had to find another way — an asymmetric way to scale that didn’t rely on scarce resources.”

Kister aimed to leverage AI agents to get rid of toil and incident remediation, to free senior developers from operations tasks and all developers from focus drift. He looked to agentic AI platforms, where AI agents can be trained on special tasks to get rid of repetitive tasks and shift focus more on features, innovation and enablement of projects using the platform.

Building an Army of Very Specific AI Agents

The plan to leverage AI agents is not about deploying an AI agent for every use case.

It does not even follow the common platform engineering practice of covering use cases that affect 80% of engineers. Right now, Kister’s team is prioritizing AI agent use cases around runtime security, reliability and incident remediation that affect all engineering teams.

Kubiya has an “agentic native” internal developer platform for programmable agents that are configured to act as dedicated SRE AI agent soldiers for software development teams. There are 200 AI agent use cases out of the box but, like all platform engineering initiatives, organizations can build on top with custom agents for specific use cases.

Kubiya runs within this company’s Red Hat OpenShift clusters, scaling across its environments and integrating within its identity and access management (IAM) and role-based access control (RBAC) policies, with all the production-ready security and compliance guardrails in place.

“We have full visibility and control, and we trust these agents to do exactly what they’re supposed to — no more, no less,” Kister said.

Unlike other AI agent platforms that are still prone to hallucinations, Kubiya has added programmability and predictability controls, so even when a developer asks the AI agent to do something out of scope, it will limit the response to only the tool calls and permissions granted to it.

That scope is very specific to a policy or environment to which it has access. It is Open Policy Agent enforced, therefore working within on-premise or in air-gapped environments.

“It’s not a Software as a Service,” Kister said. “It’s your very special trained little Navy SEAL,  sitting there doing this one job every day, every night, 24/7.” It heavily contributes to enterprise resiliency, he added.

In addition, by relying on Kubiya’s in-house SREs to create an AI agentic workforce, some of his clients’ platform teams were able to scale the technology without adding another training — or “an enormous team,” as he put it — to learn these nascent skills.

Kubiya has a full-stack AI platform that allows organizations to build on top of or bring their own AI agents for production-ready use cases. It also offers an enterprise version that includes on-premise deployments, a choice of large language models, and service assistance, which Kister’s team leaned on to avoid adding another skills gap.

“I bought an AI ‘platform engineer’ to deploy agentic workflows in a production-grade environment,” he said. “Then, as the requirements expand, we can take leverage of this asymmetric way to scale our workforce into new areas of the business.”

“Right now, as I don’t have the people or the knowledge to scale horizontally, I use their repository of pre-built AI agents to augment my teams’ efforts in running operations without needing to think twice about it.”

Measuring the Success of an AI Agent Platform

An engineering strategy is only as good as it is measured to be.

Before Kubiya, common vulnerabilities and exposures (CVEs) would sit in Jira, Kister said, treated like routine tasks — although they are anything but that.

“That backlog delayed responses and exposed risks,” he said. “With Kubiya, we automated mission-critical operations — on-call handling, real-time remediation and operational deflection — freeing our top developers from context overload so they can focus on innovation.”

In just six months, security at scale is proven:

  • Mean time to resolution (MTTR) dropped from eight hours to 30 minutes.
  • Weekly resolution time went from 64 hours to four.
  • Incidents reduced by 80%, due to proactive, AI-powered troubleshooting.
  • Repetitive requests for engineers dropped by 80%.
  • Annual run-rate for cloud infrastructure costs dropped by 20%, by identifying failed deployments running unnecessarily.
  • Compliance audits and security checks now take half the time to generate.

The project doubled the team’s value proposition, Kister said, because the cost of tooling increased by only 10%, all managed by his small, focused team.

AI Agents Help Developers Communicate

Kubiya didn’t just remove some of the biggest technical frustrations. It removed a lot of the interpersonal ones, too.

“This little agent talks to your junior developer and it can provide insights, and we got rid of finger pointing,” Kister said, because if something doesn’t meet standards, the platform won’t allow it to be deployed, and the developer knows exactly why.

Developers simply have a conversation with the AI agent, asking: What happened here? What’s your advice? In the future, he said, his team will test making remediation more automated, too.

Now, “80% of troubleshooting is just off the table because it’s instantly clear through the AI, through the little agent that sits there,” he said. “You ask it, what happened here? And it’s like: Do you have a root cause for that? Yes, and it tells you the root cause and you just know what happened.”

Many of these core developer productivity metrics are conduits for cost because it reduces engineering hours spent on the frustration of finding what went wrong and reallocating that time to creating new features faster.

With Kubiya’s new AI agent platform, Kister’s team — and its internal developer customers — unlock visibility, scale builds asymmetrically, and truly do more with less. Or, better put: Do more with exactly the team he has.

TRENDING STORIES
Jennifer Riggins is a tech storyteller and journalist, event and panel host. She bridges the gap between business, culture and technology, with her work grounded in the developer experience. She has been a working writer since 2003, and is based...
Read more from Jennifer Riggins
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.