VOOZH about

URL: https://thenewstack.io/automated-infrastructure-hidden-costs/

⇱ Why "automated" infrastructure might cost more than you think - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-02-24 04:00:46
Why "automated" infrastructure might cost more than you think
sponsor-pagerduty,sponsored-post-contributed,
AI Operations / DevOps / Infrastructure as Code

Why “automated” infrastructure might cost more than you think

Stop losing engineering velocity to automation sprawl. Identify hidden costs, consolidate operational patterns, and use operations-as-code.
Feb 24th, 2026 4:00am by Justyn Roberts
👁 Featued image for: Why “automated” infrastructure might cost more than you think
Milhad for Unsplash+
PagerDuty sponsored this post.

Somewhere in the organization, there’s a Jenkins job that nobody wants to touch. The job is mission-critical and deploys to production every Thursday. It was written three years ago by someone who’s since left, references environment variables that may or may not still exist, and the only documentation is a Slack message that reads “Just run it. It works.”

This is automation sprawl, and it’s quietly bleeding the organization’s engineering velocity dry.

The hidden costs nobody tracks

The thing that doesn’t show up in the infrastructure bill is the cognitive load of remembering which automation lives where. The thirty minutes it may take to figure out why the same job works in Jenkins, but fails in GitHub Actions. The subtle drift between environments occurs because two different tools are managing overlapping resources. Teams may find senior engineers spending 20% of their time simply maintaining automation and keeping existing pipelines functional, rather than building new capabilities.

That’s not velocity. That’s running to stand still.

Teams may find senior engineers spending 20% of their time simply maintaining automation and keeping existing pipelines functional… That’s not velocity. That’s running to stand still.”

While everyone is focused on cloud spend, the real costs start to surface through the meetings to coordinate changes across platforms, the incident response, where three teams need to be on a call because nobody knows which automation modified what, and the security review that takes a month, after all, auditors need to understand five different permission models.

What consolidation actually looks like

Successful consolidation efforts share common traits.

The smart teams start with an inventory. It’s not a spreadsheet that goes stale in a month, but an actual, maintained, living catalog of what automation exists, who owns it, and what it does. Teams can’t consolidate what they can’t see.

They establish patterns before platforms. The question isn’t “Should we use Tool X or Tool Y?” It’s “What are the three or four automation patterns we actually need, and what’s the simplest way to implement each?” Most organizations need scheduled jobs, event-triggered workflows, human-initiated runbooks, and deployment pipelines. Everything else is probably a special case that should justify its own existence.

These teams accept that migration is a longer-term strategy. The sprawl didn’t happen overnight, and it won’t be fixed overnight. The goal is to stop adding to the problem while gradually reducing the existing footprint.

The operations-as-code evolution

With operations-as-code, teams can transition from patterns to templates. These templates serve as foundations for extending applications and for infrastructure-as-code (IaC). The same principles that drove IaC — version control, repeatability, peer review — are now being applied to operations themselves. Not just in a manner of “How do we deploy this service?” but “How do we respond to this alert?” and “How do we onboard a new database?”

The teams doing this well aren’t treating runbooks as documentation that might be out of date, but as executable code that self-documents through its implementation. When the steps change, the automation changes. When the automation runs, it generates an audit trail.

This also matters beyond efficiency. When operational knowledge lives in people’s heads (or worse, in undocumented scripts on someone’s laptop), the organization is one resignation away from an outage.

Practical steps forward

For organizations looking to get past this trap, here’s where to start:

Audit before acting. Spend a week cataloging. Talk to team leads to find out what happens when X breaks, which scripts they use, and what happens if the person who usually runs the scripts isn’t there. Grep repos for pipeline definitions. Teams may find automation they didn’t know existed, as well as automation that nobody uses anymore, but everyone’s afraid to delete

Define ownership explicitly. Every piece of automation should have an owner who knows they’re the owner and is tagged to a service where appropriate. The number of orphaned pipelines in most organizations suggests this isn’t as obvious as it sounds. Teams can move, but services generally don’t.

Create a decision framework. When someone needs new automation, where should it live? Write down the criteria, such as “If it’s CI/CD, use X. If it’s scheduled operations, use Y. If it’s incident response, use Z.” Make the default path obvious and consider creating a GitHub repo to store these artifacts.

Measure the tax. Track time spent maintaining existing automation separately from time building new capabilities. If maintenance consistently accounts for more than 30% of automation-related work, the organization has a consolidation problem.

Bringing it together

The pattern that works isn’t about finding one tool to rule them all. It’s about establishing clear boundaries for what goes where.

CI/CD pipelines belong in the CI/CD system. Infrastructure provisioning belongs in Terraform, Pulumi, or whatever tool the team has standardized on. But the middle layer, which includes scheduled maintenance tasks, incident response procedures, and the ad-hoc operational work that currently lives in 20 different places, is where consolidation pays off the fastest.

This is the space where runbook automation platforms like Rundeck have carved out a niche. It’s not about replacing existing tools, but providing a unified control plane for operational work. The jobs orchestrate across the organization’s infrastructure rather than duplicating it. Access controls let teams delegate execution without handing out SSH keys. Audit trails give the compliance story that scattered scripts never will.

Other tools play in this space, too. Ansible Automation Platform, various Kubernetes operators, and even well-structured GitHub Actions with proper OpenID Connect (OIDC). Operational automation deserves the same rigor as application code, which means consolidating it so teams can actually manage it.

The uncomfortable truth

Automation sprawl persists because it’s locally optimal. Each team, solving its immediate problem with the tools they know, is making a reasonable choice. The sprawl emerges from reasonable choices compounding over time.

“Automation sprawl persists because it’s locally optimal… The sprawl emerges from reasonable choices compounding over time.”

Fixing it requires thinking at a different level. It’s not just about choosing the best tool for the job, but also about selecting the best tool ecosystem for the organization over the next five years. It’s a harder question to answer, and one that requires the sort of buy-in most individual contributors can’t secure on their own.

However, the alternative is technical debt that doesn’t look like technical debt and appears to be business as usual. That is, until someone leaves, or an audit happens, or an incident exposes just how fragile the foundation really is.

PagerDuty is the global leader in AI-first operations management serving more than 35,000 organizations worldwide. The PagerDuty Operations Cloud is a comprehensive, multi-product operations cloud platform that sits at the center of the enterprise technology stack.
Learn More
The latest from PagerDuty
Hear more from our sponsor
TRENDING STORIES
Justyn Roberts is a Principal Solutions Consultant for Automation at PagerDuty, where he helps enterprises untangle automation sprawl and build sustainable operational practices.
Read more from Justyn Roberts
PagerDuty sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.