![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
Data collaboration platform LiveRamp helps companies build enduring brand and business value with technology solutions for customer intelligence, identity enhancement, cross-screen measurement, media networks and more.
I first joined LiveRamp 15 years ago as an intern. Today, I am responsible for LiveRamp’s data platform architecture, with particular emphasis on ingestion and activation — in other words, all the data coming into and out of LiveRamp. This includes customer data that is processed, masked and made available for hundreds of downstream systems.
Part of the value LiveRamp adds is connecting consumer data with durable and privacy-conscious identifiers. There is a lot of complexity in this: You might have offline attributes like names, addresses and phone numbers, and these can change over time. The same is true for online identifiers like IP addresses and device IDs. LiveRamp replaces these changing and inconsistent attribute sets with durable, secure and pseudonymized identifiers that are sustainable against evolving regulations and privacy policies.
The LiveRamp data collaboration platform enables privacy-centric customer engagement.
While real-time data streaming with gigabytes per second throughput is the direction all our workloads are headed, currently we still manage a large number of batch processing systems built on Hadoop and Spark. That’s because the AdTech ecosystem in which a large portion of our business operates has traditionally been oriented around encrypted batch files, where systems read one record at a time.
However, the industry is changing rapidly. Our customers increasingly need real-time solutions for time-sensitive problems like cart abandonment and ad suppression. They also need to prepare for industry headwinds such as Google Chrome’s phasing out of third-party cookies in 2024, which will drive ever more demand for more optimized identifiers.
So we embarked on a journey to modernize our data infrastructure from batch systems to streaming data systems. We knew it was going to be a gradual process, and we started with a newer use case, our pixel traffic application — a system that helps customers understand users’ web and mobile traffic trends. We also recognized that we would need to select the streaming data platform that would drive our real-time transformation.
LiveRamp developed our own batch systems, but in our migration to real-time data pipelines we wanted to enable faster development and more efficient collaboration with partners. This led us to seek a partner with a robust streaming data ecosystem around its APIs.
Redpanda fulfilled this need with a Kafka API-compatible platform offering everything needed to stream data — brokers, HTTP proxy, schema registry, Raft consensus and cluster balancing. And because Redpanda consumes about 1/3rd of the compute resources as other vendors due to its lean design, it’s much more cost-efficient.
In addition to simplicity and cost-efficiency, Redpanda brought other benefits to the table including:
Redpanda’s Bring Your Own Cloud (BYOC) deployment helps LiveRamp enjoy SaaS benefits while keeping data inside its own network.
We started with a system that collects pixel traffic for mobile applications. When that implementation proved successful, we next adopted it for our application monitoring tooling and saw improved reliability. We are now expanding our use of Redpanda across the organization to gradually support all incoming and outgoing data across the more than five hundred different platform integrations we manage.
Redpanda currently sits on the edges of LiveRamp. The producers read from files and put messages on Redpanda, and consumers read the data and make outbound calls. Consumers can be simple Go or Java processes that can massage data for specific platforms like Facebook. In this way, we store data from the edges and use it where it makes sense. We also push data from Redpanda to our internal warehouse built on SingleStore, where it becomes available for analytical and measurement use cases.
This event-driven architecture eliminates the complexity of our legacy batch system, which requires a complex architecture for parallel processing. As a result, we’ve boosted engineer productivity and enhanced the maintainability of our simplified codebase. We’ve also significantly lowered our infrastructure costs and reduced our carbon footprint as a result of Redpanda’s hardware-efficient design.
If there’s a constant in our industry, it’s change. It’s been widely reported that Chrome is phasing out third-party cookies in 2024. LiveRamp has been preparing for this future for more than five years by helping publishers, marketers and the ecosystem at large transition to addressable audiences without relying on third-party cookies or mobile identifiers. Redpanda will remain a key partner for safely syncing data as LiveRamp advances its offline reference graph, which has thousands of data sources and over a billion syncs a day.
As streaming data continues to become the heart of our infrastructure, we’re looking to further simplify operations with the use of WebAssembly (Wasm). With Wasm transformations directly in Redpanda, we’ll be able to read data, prepare messages and make API calls without the “data ping pong.”
In the meantime, we’re excited to continue on our batch-to-streaming journey and help pioneer the real-time evolution of the AdTech industry.