VOOZH about

URL: https://thenewstack.io/ai-powered-service-models-speed-troubleshooting/

⇱ Diagnose Issues Faster With AI-Powered Service Modeling - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2024-08-07 06:16:40
Diagnose Issues Faster With AI-Powered Service Modeling
sponsor-bmc,sponsored-post-contributed,
AI / Observability / Operations

Diagnose Issues Faster With AI-Powered Service Modeling

Service modeling with AI enables faster root cause analyses, continuous optimization and continuous compliance to resolve problems faster.
Aug 7th, 2024 6:16am by Mary Chen
👁 Featued image for: Diagnose Issues Faster With AI-Powered Service Modeling
Featured image by Unsplash+ in collaboration with Getty Images.
BMC sponsored this post.

If you manage a modern distributed IT environment, context is critical for troubleshooting and analyzing the business impact of production issues. But that context can be hard to acquire.

You might have different teams and observability solutions managing the different layers that contribute to a business service, or different tools that generate useful telemetry data, such as metrics, events, logs, traces and topology, but they operate in silos. Maybe you don’t have a model of the connections in your environment. Or possibly all the knowledge about cause-and-effect relationships, actions and consequences is not documented but locked in someone’s institutional memory.

To pinpoint the root cause of service issues accurately and quickly in complex environments, you need deep understanding of critical paths and dependency levels across the application, API and network layers.

Highly performant graph databases, dynamic service modeling capabilities and causal AI can help you understand and model the cause-and-effect relationships between different applications, APIs and network and infrastructure layers. Modeling your service — building a visualization of services and the relationships between various system and infrastructure components — provides critical context for troubleshooting. A well-defined service gives you an end-to-end view to quickly identify an impacted node for faster root cause analysis.

How Service Modeling Works

Assuming you have a dynamic and reconciled graph database of your IT landscape where all types of ingested data (metrics, events, logs, traces, topology) are normalized, modeling your service involves the following steps:

  • Identify end-user services that you want to model and add service details as inputs to the service modeling tool. An application performance monitoring (APM) tool can provide application-specific details about software components and their relationships across cloud, mainframe and container topologies. Infrastructure and network monitoring tools and scanning tools can detail the infrastructure’s connectivity to underlying virtual and physical devices, such as servers, databases, switches, routers, firewalls and load balancers.
  • Use blueprints to dynamically traverse all layers to automatically connect the application topology to the hosts and network devices. Discovery and monitoring tools can provide service blueprints to simplify creating and maintaining dynamic service models. These service models support modern technologies like microservices, Kubernetes, cloud services, application performance tracking and mainframes to keep accurate tabs on all IT resources and relationships. Blueprints make it easy to express a simple rule that identifies all the elements of your service. You define the rule once and apply it to as many services as needed.
  • Calculate the health score for a service. Understanding a service’s current and historical health based on indicators, abnormalities and events in the service model’s components helps you identify root causes of health impacts or service performance degradation. Machine learning (ML) algorithms can calculate health scores, so you can quickly understand the scope of an issue.

How to Incorporate AI for Faster Troubleshooting

AI technologies such as causal AI and generative AI (GenAI) can help accelerate the troubleshooting process by connecting cause to effect and translating root cause insights. True AIOps requires a complete system designed to collect and model data through the lens of end users and business impacts. Service modeling, using the process above, allows you to confidently use AI to generate reliable insights.

Causal AI integrates knowledge graph and transformer-based AI techniques to understand and model relationships across telemetry data variables. Casual AI can reason about casual relations or patterns using topological data. A knowledge graph–based causality analysis analyzes how causal relationships change depending on how the variables influence one another.

Using causal AI in production troubleshooting:

  • Helps you understand and explain a problem by providing visual representations of how events were correlated and how the root cause was identified.
  • Accelerates troubleshooting by automatically identifying whether a similar situation previously occurred. If you’ve already seen and resolved a problem, you shouldn’t need to go through the entire discovery process again. Causal AI fingerprints recurring situations for future identification to help speed mean time to recovery (MTTR) and reduce incident noise.

GenAI also has a powerful role in the troubleshooting process. It can be used to generate:

  • Plain-language summaries, making it quicker and simpler to understand an issue compared to decoding a string of output error codes.
  • Best-action recommendations to resolve issues.
  • Responses to commonly asked questions during troubleshooting.

For AI algorithms to give results that you trust, the quality of your data matters. Establishing the right foundation with well-defined service models is critical.

Real-World Applications

Service modeling is already making a significant impact in managing services. It decreases investigation time, helping you see and respond to issues before they impact the business.

Here are examples of how service modeling enables faster root cause analyses, continuous optimization and continuous compliance.

  • Root cause analysis: By modeling service dependencies as a reconciled topology, you can isolate the root cause of an issue, whether it’s:
    • The application software components: Don’t impact the infrastructure.
    • The network: Impacts the infrastructure and application.
    • The mainframe database: Impacts distributed applications.
  • Capacity optimization: By analyzing interactions between services, service modeling can provide insights into how to right-size and align IT resources with changing business requirements. When used with AI to analyze bottlenecks and recommend areas to minimize risk and cost, you can continuously optimize your IT environment’s performance.
  • Continuous compliance: Collecting and modeling IT assets, services and relationships provides up-to-date information and processes to meet security and regulatory compliance requirements. Instead of chasing down individual developers to document what’s running and where, you can stay ahead of ever-growing risk and complexity with automated discovery and service modeling.

There’s no question that AI will continue to play an important role in observability. It can greatly accelerate the troubleshooting workflow and improve efficiency with the right contextual data.

The BMC Helix IT Operations Management (ITOM) portfolio provides out-of-the-box service blueprints that make it easier to create and maintain dynamic service models. And BMC Helix Operations Management with AIOps (BMC Helix AIOps) takes it a step further with a fully integrated, cloud-native, observability and AIOps solution that gives you the right tools and data to significantly reduce the time to isolate root causes and resolve problems.

BMC delivers industry-leading automation, operations, and service management solutions to customers and partners around the world, including 86% of the Forbes Global 50, helping them free up time and space to become an Autonomous Digital Enterprise that conquers the opportunities ahead.
Learn More
The latest from BMC
TRENDING STORIES
Mary Chen is a Principal Solutions Marketing Manager for the BMC Helix IT Operations Management (ITOM) portfolio. She previously held various marketing roles in the observability and security markets at Check Point Software, Splunk, VMware, and a couple of startups....
Read more from Mary Chen
BMC sponsored this post.
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real, Root.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.