VOOZH about

URL: https://thenewstack.io/how-the-opentelemetry-collector-scales-observability/

⇱ How the OpenTelemetry Collector Scales Observability - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-12-13 06:00:28
How the OpenTelemetry Collector Scales Observability
Observability

How the OpenTelemetry Collector Scales Observability

Two talks at KubeCon+CloudNativeCon 2023 showed the range of tools and services in the observability space.
Dec 13th, 2023 6:00am by B. Cameron Gain
👁 Featued image for: How the OpenTelemetry Collector Scales Observability

You can manage without an OpenTelemetry collector, an open source observability framework, but you probably won’t want to, especially if you’re deploying and monitoring applications at scale.

You’ll probably also want to use the OpenTelemetry Collector whenever multiple applications or microservices are involved, particularly for security considerations.

This becomes apparent as OpenTelemetry expands its scope and becomes widely accepted as a way to use your favorite observability tools in a unified interface or component that’s compatible as vendors seek to meet the OpenTelemetry standard.

OpenTelemetry Collector is an observability pipeline middleware that can receive, process and export data at scale, explained Evan Bradley, a senior software engineer at Dynatrace, described during a talk “OTTL Me Why Transforming Telemetry in the OpenTelemetry Collector” with Tyler Helmuth senior software engineer, open source, at Honeycomb.io during KubeCon + CloudNativeCon last month.

“So why might you want to use the collector? Well, there are numerous reasons but the first significant one is that you can process at the edge — processing at the edge allows you to split this work across multiple machines, which can help increase data throughput in your pipeline,” Bradley said. “You can run the collector at the edge or anywhere else in your pipeline because it can be deployed anywhere and it can be deployed in containerized, virtualized, or even functions as a service environment. Moreover, you can process data close to its origins or further away, such as at critical points of your pipeline, like at the point of ingress at the boundary of a secure network.”

The collector is designed to adapt well to different use cases since it’s fast and versatile, Bradley said. “It has been designed with high throughput and low latency in mind, so it won’t slow down your pipeline. Additionally, it has low CPU, memory, and disk space requirements,” Bradley said.

What Does an OpenTelemetry Collector Do?

An OpenTelemetry collector serves to collect data sent to it from one or many sources. In addition to receiving data, it sends data to an endpoint, such as for visualization with a Grafana panel.

With it, it can be configured to collect certain types of logs, traces and metrics for observability.

Initially, you can opt not to use it, especially when employing a monitoring application that collects and transfers all data directly to the observability platform or through OpenTelemetry, gathering metrics, logs, traces, etc.

However, this approach becomes challenging when monitoring multiple applications or microservices. Without the OpenTelemetry collector, you’d need to configure each backend or user monitoring separately for those, which can be cumbersome.

On the contrary, an OpenTelemetry collector serves as a single endpoint for all microservices, streamlining access to applications and microservices through a unified point facilitated by the collector.

Utilizing this collector, you can view and manage them collectively, offering a consolidated view on a platform like Grafana. While Grafana provides certain alternatives without an OpenTelemetry collector, the collector significantly simplifies this process.

Custom Collectors

A custom collector can also be tailored to fit the situation at hand by selecting only the components you need, Bradley said. For cases where existing options are unavailable, all collector components are written using the same core APIs, allowing you to leverage these to add your own code to accomplish a task,” Bradley said.

👁 Image

The data flow through the collector is organized into pipelines, composed of individual components, each handling a specific task, Bradley said. The collector has five classes of components, but in his talk, receivers, processors and exporters were covered. The diagram above illustrates an example pipeline, where data enters the collector at one of the points on the left, proceeds through the pipeline, and is emitted on the right, Bradley said.

With the OpenTelemetry Transformation Language, the OpenTelemetry Collector’s filter or processing function can be used to filter the kinds of telemetry data it receives and sends. Helmuth showed how OTTL supports the filter functionality.

During his presentation, Helmuth showed when it makes sense to reduce the ingest volume by dropping events categorized as completed, as they are deemed unnecessary, he said.

👁 Image

In the image above, the intention was to utilize the filter processor to implement the decision of which data to drop, which operated based on an OTTL condition. These conditions interacted with the underlying telemetry without altering it. The filter processor employed OTTL conditions to select data for dropping; when the condition was satisfied, the processor removed the data, Helmuth said.

In the case of a Kubernetes objects receiver, it would emit Kubernetes events in the form of logs, with these events existing as nested maps within the log body.

Any body not structured as anticipated (i.e., not resembling a K8s event) was to be discarded, Helmuth described. In the top box of the image above, the body was a map containing a nested map within the object key, so the conditions were not met, and the data was retained. Conversely, in the second box in the above image, the body was a string, which did not align with the expected map structure, Helmuth said.

Google Cloud’s @RageCage64_ on how to gauge your observability agent performance at Observability Days today @KubeCon_ @CloudNativeFdn @nybooks pic.twitter.com/wGAcUdsXxf

— BC Gain (@bcamerongain) November 6, 2023

Different alternatives exist for telemetry collection. As such the OpenTelemetry Collector falls under the category of an observability agent. Observability agents, such as the OpenTelemetry Collector, include FluentBit, Vector, and others, “exhibit high robustness and perform various tasks to achieve their remarkable outcomes,” Braydon Kains, software developer at Google, said during his own KubeCon + CloudNativeCon talk “How Much Overhead How to Evaluate Observability Agent Performance.”

At the end of the talk, the question was asked about which collector is the best collector. Kains described how the Google Cloud Ops agent is a fusion of two agents. Behind the scenes, it combines Fluent Bit for log collection and OpenTelemetry for gathering metrics and traces, he said.

The team manages a central configuration layer responsible for generating configurations for both the underlying OpenTelemetry and Fluent Bit. These configurations include recommended optimizations tailored for users primarily operating on virtual machines, like plain VMs, to efficiently collect metrics using OpenTelemetry, he said.

“There are a lot of knobs to keep track of and it can be hard for a customer who’s new to this to keep track of them all,” Kains said. “We take on the responsibility of keeping track of those knobs, and try to come up with the settings that are going to be optimal in the most general cases.”

TRENDING STORIES
BC Gain is founder and principal analyst for ReveCom Media. His obsession with computers began when he hacked a Space Invaders console to play all day for 25 cents at the local video arcade in the early 1980s. He then...
Read more from B. Cameron Gain
SHARE THIS STORY
TRENDING STORIES
Dynatrace and CNCF are sponsors of The New Stack.
TNS owner Insight Partners is an investor in: Bit, Honeycomb.io, Honeycomb.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.