![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
If you manage a modern distributed IT environment, context is critical for troubleshooting and analyzing the business impact of production issues. But that context can be hard to acquire.
You might have different teams and observability solutions managing the different layers that contribute to a business service, or different tools that generate useful telemetry data, such as metrics, events, logs, traces and topology, but they operate in silos. Maybe you don’t have a model of the connections in your environment. Or possibly all the knowledge about cause-and-effect relationships, actions and consequences is not documented but locked in someone’s institutional memory.
To pinpoint the root cause of service issues accurately and quickly in complex environments, you need deep understanding of critical paths and dependency levels across the application, API and network layers.
Highly performant graph databases, dynamic service modeling capabilities and causal AI can help you understand and model the cause-and-effect relationships between different applications, APIs and network and infrastructure layers. Modeling your service — building a visualization of services and the relationships between various system and infrastructure components — provides critical context for troubleshooting. A well-defined service gives you an end-to-end view to quickly identify an impacted node for faster root cause analysis.
Assuming you have a dynamic and reconciled graph database of your IT landscape where all types of ingested data (metrics, events, logs, traces, topology) are normalized, modeling your service involves the following steps:
AI technologies such as causal AI and generative AI (GenAI) can help accelerate the troubleshooting process by connecting cause to effect and translating root cause insights. True AIOps requires a complete system designed to collect and model data through the lens of end users and business impacts. Service modeling, using the process above, allows you to confidently use AI to generate reliable insights.
Causal AI integrates knowledge graph and transformer-based AI techniques to understand and model relationships across telemetry data variables. Casual AI can reason about casual relations or patterns using topological data. A knowledge graph–based causality analysis analyzes how causal relationships change depending on how the variables influence one another.
Using causal AI in production troubleshooting:
GenAI also has a powerful role in the troubleshooting process. It can be used to generate:
For AI algorithms to give results that you trust, the quality of your data matters. Establishing the right foundation with well-defined service models is critical.
Service modeling is already making a significant impact in managing services. It decreases investigation time, helping you see and respond to issues before they impact the business.
Here are examples of how service modeling enables faster root cause analyses, continuous optimization and continuous compliance.
There’s no question that AI will continue to play an important role in observability. It can greatly accelerate the troubleshooting workflow and improve efficiency with the right contextual data.
The BMC Helix IT Operations Management (ITOM) portfolio provides out-of-the-box service blueprints that make it easier to create and maintain dynamic service models. And BMC Helix Operations Management with AIOps (BMC Helix AIOps) takes it a step further with a fully integrated, cloud-native, observability and AIOps solution that gives you the right tools and data to significantly reduce the time to isolate root causes and resolve problems.