VOOZH about

URL: https://thenewstack.io/how-ai-and-automation-can-improve-operational-resiliency/

⇱ How AI and Automation Can Improve Operational Resiliency - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-11-03 06:00:58
How AI and Automation Can Improve Operational Resiliency
podcast,sponsor-pagerduty,sponsored-podcast-video,video,
AI / DevOps / Operations

How AI and Automation Can Improve Operational Resiliency

In this episode, Dormain Drewitz of PagerDuty explains how using automation fueled by AI can help companies bounce back from incidents faster and gain the confidence to keep taking risks.
Nov 3rd, 2023 6:00am by Heather Joslyn
👁 Featued image for: How AI and Automation Can Improve Operational Resiliency
PagerDuty sponsored this post.

When asked to define “operational resiliency,” Dormain Drewitz, vice president of platform advocacy at PagerDuty, recalled a recent conversation she had with Sam Newman, the noted O’Reilly author and technology consultant, about this very topic.

Newman cited the timeless sentiment of ”Tubthumping,” the 1997 global smash by Chumbawamba about surviving a boozy night at the pub: “I get knocked down/but I get up again.”

Resiliency, Drewitz said in this episode of the New Stack Makers podcast, is “about having the ability to bounce back and recover.”

But, she added, “to Sam’s point, it’s more than just a sort of technical backup and recovery type of conversation. It has to also manifest in organizational recovery.”

True resiliency, she said, also means surviving incidents with your collective willingness to take risks intact: “You are able to deal with issues when they arise and you don’t let that stop you from still trying to move fast.”

She spoke to Heather Joslyn, host of this episode of TNS Makers, about the role AI and automation can play in dealing with incidents, and how they can help to build and maintain operational resiliency.

This conversation was sponsored by PagerDuty.

How to Avoid ‘Squeezing the Balloon’

With teams being asked to become more productive than ever due to economic pressures on their organizations, automation aimed at developers — including automation fueled by the new wave of generative AI, such as code completion tools —  has become more widespread.

But making developers more productive can create what Drewitz called a “squeezing of the balloon problem”: moving bottlenecks from one part of the organization (developers) to another (operations).

“We face a test in our maturity around DevOps, when suddenly you’re gonna give the people ‘throwing things over the wall,’ so to speak — they’re now got a forklift, and they’re throwing more,” she said.

What can automation coupled with AI do to improve the productivity of team members who aren’t frontend developers?

By looking at the entire value chain, Drewitz said, organizations can identify areas where AI coupled with automation can help. It takes “thinking about what happens after things get shipped to production and the teams that are involved there,” she said.

Tasks that would always be followed in various types of incidents — see if the Kubernetes API is responding, close a particular port, check to see if the database is loading, and so on — can be automated. “Those are all opportunities to automate because they’re small, but they add up in terms of the blast radius, from an interruption and productivity loss perspective.”

Using AI to Draft Postmortems

PagerDuty’s AI-powered operations platform is using generative AI to help automate repetitive tasks, with the idea of freeing engineers to tackle the causes of incidents and restore service more quickly.

For instance, users can now create automated runbooks to handle those repetitive, troubleshooting tasks that must be implemented in case of an incident.

PagerDuty is the global leader in AI-first operations management serving more than 35,000 organizations worldwide. The PagerDuty Operations Cloud is a comprehensive, multi-product operations cloud platform that sits at the center of the enterprise technology stack.
Learn More
The latest from PagerDuty
Hear more from our sponsor

And, Drewitz said, PagerDuty now uses generative AI to draft status updates during an incident. “You still get to look over and say, ‘Yes, that looks right,’ or ‘I would change that here.’ But not having that blinking cursor, when you’re in the midst of an incident? … it’s valuable for folks.”

It can also draft an incident postmortem report. “It can take time to go back and do that, go back and relive this battle we just fought and write it down,” Drewitz said. “ And if you pause and come back to it, then you may not have everything fresh in your mind anymore.”

Having an operations platform that can simply generate a postmortem draft at the push of a button helps save time and stress. “it’s so much easier to review and edit and approve something that’s been drafted,” she said. “Then you don’t have to be starting from scratch.”

Listen to the full episode for more on how automation and AI can improve incident management.

PagerDuty is the global leader in AI-first operations management serving more than 35,000 organizations worldwide. The PagerDuty Operations Cloud is a comprehensive, multi-product operations cloud platform that sits at the center of the enterprise technology stack.
Learn More
The latest from PagerDuty
Hear more from our sponsor
TRENDING STORIES
Heather Joslyn is the former editor-in-chief of The New Stack. She previously worked as editor-in-chief of Container Solutions, a Cloud Native consulting company, and as an editor/reporter at The Chronicle of Philanthropy and the Baltimore City Paper.
Read more from Heather Joslyn
PagerDuty sponsored this post.
SHARE THIS STORY
TRENDING STORIES
PagerDuty is a sponsor of The New Stack.
TNS owner Insight Partners is an investor in: Pragma.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.