VOOZH about

URL: https://thenewstack.io/re-evaluating-mttr-as-key-metric-for-operational-performance/

⇱ Re-Evaluating MTTR as Key Metric for Operational Performance - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2021-12-10 09:33:21
Re-Evaluating MTTR as Key Metric for Operational Performance
contributed,sponsor-bmc,sponsored,sponsored-post-contributed,
DevOps / Observability

Re-Evaluating MTTR as Key Metric for Operational Performance

If mean time to repair (MTTR) doesn't accurately portray the success of an operations team, what is a new metric in an AIOps-enabled future?
Dec 10th, 2021 9:33am by Ali Siddiqui
👁 Featued image for: Re-Evaluating MTTR as Key Metric for Operational Performance
Photo by RODNAE Productions from Pexels.
BMC sponsored this post.
Ali Siddiqui
Ali is chief product officer for BMC Software Inc. In this role, he has end-to-end responsibility for the company’s entire product portfolio, including the BMC Helix suite, Control-M and its Automated Mainframe Intelligence (AMI) solutions. Ali earned a Bachelor of Science degree in electrical engineering from the California Institute of Technology and a Master of Science degree from Stanford University.

While the technology for monitoring systems and applications has changed dramatically over the years, the way we measure performance and availability hasn’t changed much at all. But it might be time to think differently about the metrics we use when it comes to managing our IT systems.

Most IT organizations use fairly standard metrics to assess operational performance: application performance and availability, service-level agreement (SLA) fulfillment, incident number and severity, and mean time to repair (MTTR).

When these numbers perform well, we know that our systems are generally stable, our teams and their workflows are well-balanced, we are managing issues competently, and we are recovering quickly when there are problems.

With these numbers in hand, IT can effectively demonstrate its value to the business, the business can better plan its workload and deliverables, and both can look for ways to make changes and improvements backed by data.

Within IT teams, these numbers are frequently used to set benchmarks and reward those who surpass them, because if we are making continual improvements in how quickly we respond and problem-solve, we are surely improving the customer’s experience and their impression of our business.

But with the growing scope and use of artificial intelligence for IT operations, or AIOps, at least one of these metrics may soon be seen differently.

AIOps Enables Effective Use of Data

Though you may not yet have adopted it in any obvious way in your own organization, in its most elemental form, AIOps was developed to help better manage today’s astounding volumes and varieties of data.

What’s the problem with so much data? As with anything, too much of a good thing isn’t actually a good thing. Too much data means more time to comb through the data to find any sort of actionable insights.

If you have an outage and 100 alerts are triggered, how much time are you wasting investigating 99 false alarms before you get to the one that can tell you what really went wrong?

Enter AIOps. Combining big data and machine learning to automate the kinds of IT operations processes that have up to now required massive time and effort, AIOps creates efficiencies at scale, enables visibility across your infrastructure, and helps your team derive the insights needed to make powerful, data-driven business decisions more easily.

When event correlation, anomaly detection and root cause determination are essentially taken off your team’s work docket, thanks to the analytical capabilities of AIOps, IT teams will find themselves with more time to dedicate to more interesting, and more productive, projects.

Oh, the Irony

But, there’s a catch. Think about the powerful problem-solving abilities you gain with AIOps. With the greater efficiency, visibility and insight provided by the machine learning capabilities of AIOps, your MTTR numbers may actually go … up.

So, if you’ve been evaluating your team’s performance based on incremental reductions in the time it takes to restore services, you may soon want a new measure. Here’s why:

As AIOps-enabled solutions automate routine testing and proactively find, suggest fixes for, and potentially even remediate the issues — all without human intervention or oversight — these disruptions will actually cease to exist. Your AIOps solution has stopped that outage before it even happened.

But what’s left? The bigger, more complex service and operations issues that can’t be automated. The ones that may indeed require the talent of your operations staff, and possibly a lot more time.

All isn’t lost though. While these remaining types of challenges may be gnarlier, they’re also the kinds of problems that engineering minds love, that you actually want to pay those competitive wages for — and that ultimately leads to innovation.

BMC delivers industry-leading automation, operations, and service management solutions to customers and partners around the world, including 86% of the Forbes Global 50, helping them free up time and space to become an Autonomous Digital Enterprise that conquers the opportunities ahead.
Learn More
The latest from BMC

Metrics Moving Forward

If MTTR isn’t going to accurately portray the success of an operations team, then what is a metric to watch in an AIOps-enabled future? Size of the problem resolved? Complexity index? A clever ratio between problem severity and time to fix? Or have we truly entered an era where the “if you can’t measure it, you can’t manage it” axiom no longer fits?

Maybe this is your team’s next puzzle to solve. No matter how you frame the parameters of progress in this next era, the beauty is that we all win: fewer small and mundane problems, more interesting big ones and greater overall efficiency. Those are the numbers that count.

BMC delivers industry-leading automation, operations, and service management solutions to customers and partners around the world, including 86% of the Forbes Global 50, helping them free up time and space to become an Autonomous Digital Enterprise that conquers the opportunities ahead.
Learn More
The latest from BMC
TRENDING STORIES
Ali is chief product officer for BMC Software Inc. In this role, he has end-to-end responsibility for the company’s entire product portfolio, including the BMC Helix suite, Control-M and its Automated Mainframe Intelligence (AMI) solutions. Ali earned a Bachelor of...
Read more from Ali Siddiqui
BMC sponsored this post.
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Pragma.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.