VOOZH about

URL: https://thenewstack.io/distributed-tracing-is-failing-how-can-we-save-it/

⇱ Distributed Tracing Is Failing. How Can We Save It? - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-05-03 06:52:43
Distributed Tracing Is Failing. How Can We Save It?
sponsor-chronosphere,sponsored-post-contributed,
Observability / Software Development / Software Testing

Distributed Tracing Is Failing. How Can We Save It?

Why not break down silos by putting metric and trace data in a single view? Then users can interact with observability data in both simple and advanced ways.
May 3rd, 2023 6:52am by Joel Groen
👁 Featued image for: Distributed Tracing Is Failing. How Can We Save It?
Chronosphere sponsored this post.

Distributed tracing has been around for over a decade, and it arrived on the scene with great fanfare. There were promises that distributed tracing would unlock the mysteries of large, complex microservices environments.

However, the early wave of tools was complex, and the majority of its value was centered on a small number of power users. This created workflow bottlenecks — and made it hard for casual users to understand just how beneficial distributed tracing can be.

Ten years in, we now know distributed tracing isn’t a silver bullet. To truly offer value, distributed tracing must get out of its silo and harness expert insight and deliver more “out of the box” functionality for casual users. It must be accessible across the entire organization and become integrated across the observability platform so users can solve problems in ways they couldn’t before.

Chronosphere, a Palo Alto Networks company, is the observability platform built for control in the modern, containerized world. Recognized as a leader by major analyst firms, Chronosphere empowers customers to focus on the data and insights that matter to reduce data complexity, optimize costs, and remediate issues faster. Visit chronosphere.io.
Learn More
Hear more from our sponsor

Distributed Tracing Isn’t Useful Siloed

Engineers are to some degree creatures of habit. The engineering organizations I’ve spent time with have a deep level of comfort with dashboards, and statistics show that’s where engineers spend the most time — they provide data in an easy-to-understand graphical user interface (GUI) for engineers to quickly answer questions.

However, it’s challenging when trace data is kept in its own silo. To access its value, an engineer must navigate away from their primary investigation to a separate place in the app — or worse, a separate app. Then the engineer must try to recreate whatever context they had when they determined that trace data could supplement the investigation.

Over time, all but a few power users start to drift away from using the trace query page on a regular basis. Not because the trace query page is any less useful. It’s simply outside of the average engineer’s scope. It’s like a kitchen appliance with lots of uses when you’re cooking, but because it’s kept out of sight in the back of a drawer, you never think to use it — even if it’s the best tool for the job.

In the end, a handful of tracing power users take advantage of the trace query page while the rest slide back into solving problems with familiar tools. Engineers require tools that present specific data with actionable insights. Even with all of distributed tracing’s potential, it’s still not integrated well enough into engineers’ toolkits for everyday use.

How Do We Make Distributed Tracing Better?

To make distributed tracing better, it must be useful and accessible.

How Do We Make Distributed Tracing Useful?

For engineering organizations to realize the full potential of trace data, engineers have to use it. Any calculation that attempts to measure the value of tracing is going to include some measure of breadth of use or the number of engineers who regularly use trace data.

The obvious answer to this question is to build a solution that has two different modes of operations: One that can meet the needs of both advanced power users and casual users.

Of course this is easier said than done. A truly valuable distributed tracing product must harness the workflows and insights of your expert users to deliver more “out of the box” value for the casual users. The ideal environment will offer a standard UI/UX across the organization but enough customization options so that power users can extract the insights they need.

How Do We Make Trace Data Accessible? 

Trace data can, and should be, integrated into workflows that matter to engineers, whether that is making alert notifications more actionable (including where to route them) or providing service-dependency data alongside dashboards responsible for that service.

Trace data isn’t valuable if it’s inaccessible. Every engineer in the organization must have easy access to trace data setup in tools they already work with on a regular basis. For a huge portion of software developers, that ends up being good, old reliable dashboards.

Dashboards historically are the visualization of metric time series as line charts, gauges and bar graphs. But engineering teams shouldn’t limit themselves to just displaying metrics; they can use dashboards to display insights from trace and other observability data types as well.

How to Put Trace Data to Work

This can be done by breaking down silos and bringing metric and trace data together under a single interface — like dashboards or alerts. Solutions are practical and effective; they are inclusive to the audience they cater to, bring together data in a meaningful way and take advantage of the real potential of distributed trace data.

We must treat dashboards like a tool in our observability toolkit that enhances other observability data but also relies on other observability data to fully achieve its own potential. Dashboards can provide the foundation to integrate distributed trace data across the entire organization instead of keeping it siloed, only to be used once in a great while.

The reality is dashboards are a go-to spot for observability data, so why not break down silos by including metric and trace data together in a single view? By doing so, an experienced user or a new team member can interact with observability data in both simple and advanced ways, such as out-of-the-box dashboards or advanced queries. With dashboards, engineers don’t have to learn an entirely new tracing tool, which means they can start using tracing data immediately.

Chronosphere’s observability platform brings together your metrics and traces data into one view so that everyone gets the most value out of all available observability data. This makes it easier to connect an incident’s details, like location and performance, without needing to think of the underlying data types, product definitions or dead ends.

Want to learn about how Chronosphere can help implement distributed tracing? Watch a live demo now.

Chronosphere, a Palo Alto Networks company, is the observability platform built for control in the modern, containerized world. Recognized as a leader by major analyst firms, Chronosphere empowers customers to focus on the data and insights that matter to reduce data complexity, optimize costs, and remediate issues faster. Visit chronosphere.io.
Learn More
Hear more from our sponsor
TRENDING STORIES
Joel Groen is a member of the product team at Chronosphere, and has guided a series of early stage companies to success in emerging markets.
Read more from Joel Groen
Chronosphere sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.