VOOZH about

URL: https://thenewstack.io/honeycombs-charity-majors-go-ahead-test-in-production/

⇱ Honeycomb's Charity Majors: Go Ahead, Test in Production - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2018-10-02 13:43:14
Honeycomb's Charity Majors: Go Ahead, Test in Production
analysis,
Observability

Honeycomb’s Charity Majors: Go Ahead, Test in Production

Continuing to question the traditional wisdom that software changes should be tested in their own sandbox environment,  Charity Majors, CEO of observability software provider Honeycomb.io, spoke to the audience at Gremlin's chaos engineering conference last week, ChaosConf about the benefits of testing in the production environment.
Oct 2nd, 2018 1:43pm by Joab Jackson
👁 Featued image for: Honeycomb’s Charity Majors: Go Ahead, Test in Production

Continuing to question the traditional wisdom that software updates should be tested in their own sandbox environment, Charity Majors, CEO of observability software provider Honeycomb.io, spoke at Gremlin‘s chaos engineering conference last week, ChaosConf, about the benefits of testing in the production environment.

“Testing in prod has gotten a bad rap,” she told the audience, referring to the conventional wisdom that running untested code on live users is asking for trouble — and customer dissatisfaction.

She explained that the initial negative reaction to the idea is based on a false dichotomy. It assumes that there are only two options for software development which is, at heart, an exercise in repeated testing. One is to test software totally in its own “sandboxed” environment. The other option would be to upload the new code to the cloud for all users.

But there are multiple techniques, such as A/B testing and Canary testing, that allow you to try code on a small portion of the entire user-base, so you can collect metrics before rolling the update out system-wise.

“Trying to mirror your staging environment to production is a fool’s errand. Just give up.”

What we think of as the typical software upgrade cycle is due for an upgrade, she said, especially as our architectures move towards container-based distributed systems, which can’t be easily managed by the old tools.

“Distributed systems are incredibly hostile to being cloned or imitated, or monitored or staged,” she said. “Trying to mirror your staging environment to production is a fool’s errand. Just give up.”

Most architectures are way more complicated than the standard LAMP stack. “Finding the right level of detail to ask the right questions is challenging,” she said.

With distributed systems, what can go wrong is this infinitely long tail of things that will probably never happen, but one day they do. Photos load slowly for some people but not for others. “How are you going to find that in a staging environment? Spoiler alert: You’re not.”

This is why observability is so important — which Honeycomb specializes in — because you can’t predict where the failure will occur. Maintaining distributed systems involves more than attaching a standard set of monitoring agents to your application.

👁 Image

Adrian Cockcroft

Majors’ views were echoed earlier in the day by by Amazon Vice President, and all-around scalable microservices expert, Adrian Cockcroft. “Your failure model will not include the outlier that breaks everything,” he said.

Problems with distributed systems will not show up in a dashboard, Majors noted. The temptation is to create a new dashboard entry that would capture the problem the next time it occurs. The problem is, that exact problem will probably never occur again. You need to take a more open-ended, exploratory approach, she said.

“Monitoring itself is not enough for complex systems,” she said. “Dashboards are a relic.”

Events Not Metrics

“The hard part is figuring out where the problem lies. The hard part is not debugging the code. The hard part is figuring out which part of the code to debug,” Majors said.

She talked about how most problems in distributed systems are ones involving high-cardinality. Cardinality is the number of elements in a set, and perfectly cardinality is one where each element gets its own set. The answers to most all problems in distributed sets come from high-cardinality data — or a very small subset of factors that somehow worked together to create trouble.

A traditional control theory definition of observability is that it is a measure of how well internal states of a system can be inferred from knowledge of its external outputs. In this light, the trick would be to gather all the data you would need so that you could ask any question, without having to write more code to harvest more data.

Cockcroft’s presentation was in accordance on this point as well. For something to “observable” means you can totally predict its behavior just from the metrics it provides. By having no state, microservices are inherently observable. Monoliths are harder to model because they have so many potential states.

The hard part of any debugging any distributed system is finding the right question to ask. As anyone who runs them knows, distributed systems are never fully operational. There are always some issues threatening to take them down at any point.

You ask questions. Based on the answers you get, you ask more questions, and follow the breadcrumbs. “You have to be able to ask questions of your raw events,” she said.

This is an AWESOME piece by @joab_jackson.. clear, crisp overview of the industrywide lurch towards distributed systems, and some of the ripple effects as experienced by tooling and teams.

I'd like to sharpen up a couple small points. https://t.co/DzYyB5EOpY

— Charity Majors (@mipsytipsy) October 4, 2018

More insights from the Chaos Conference can be found here:

ChaosConf 2018

Gremlin assisted in the travel costs of covering this event.

TRENDING STORIES
Joab Jackson is a senior editor for The New Stack, covering cloud native computing and system operations. He has reported on IT infrastructure and development for over 30 years, including stints at IDG and Government Computer News. Before that, he...
Read more from Joab Jackson
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Honeycomb.io, Honeycomb.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.