VOOZH about

URL: https://thenewstack.io/intuits-numaflow-abstracts-away-infrastructure-for-ml-engineers/

⇱ Intuit's Numaflow Abstracts Away Infrastructure for ML Engineers - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-10-06 07:00:54
Intuit's Numaflow Abstracts Away Infrastructure for ML Engineers
Data Streaming / Kubecon Cloudnativecon NA 2025 / Kubernetes

Intuit’s Numaflow Abstracts Away Infrastructure for ML Engineers

Numaflow allows engineers without Kubernetes experience to easily build and manage data processing pipelines for high-throughput workloads.
Oct 6th, 2025 7:00am by Joab Jackson
👁 Featued image for: Intuit’s Numaflow Abstracts Away Infrastructure for ML Engineers

Built by the team at Intuit that created Argo CDNumaflow is a Kubernetes-based open source stream processing engine with a UI that allows engineers to easily compose data processing pipelines. No experience in Kubernetes necessary.

Built for high-throughput workloads, Numaflow connects to Kafka, Pulsar and SQS, and can analyze, filter or process the stream of data before sending it along to its destination. Easily scalable, it will work as fast as you need.

Last week, at the Kubecrash 2025 virtual conference, two Intuit team members on the project described how Numaflow could be used for running AI pipelines.

👁 screenshot

The Role of Stream Processing in AI

Think of stream processing as the backbone of AI.

Turns out there is a lot of event processing in AI: feature engineering, where features are calculated and added to the model; inferencing, where a trained model makes predictions; and, of course, training, where the models get the latest data.

A real-time stream processing platform is essential if “you want to understand or process events and then try to respond as they’re happening,” said Sriharsha Yayi, Numaflow product manager for Intuit. For instance, user behavior could be tracked in real time to provide recommendations. Fraudulent activity can be thwarted while it is still going on.

Yet building data processing pipelines can be a thorny task, let alone making it scalable and real time.

Common Challenges in Event Processing on Kubernetes

Numaflow set out to solve a number of challenges with event processing on Kubernetes, Yayi said.

For one, data engineers, who know procedural logic, weren’t super familiar with the Java and Scala platforms they had to design upon. Nor are there many other developers who also wanted to tie into a stream engine.

“We have observed where people wanted to have a stream processing capability or framework that is beyond Java,” Yayi said.

Also, setting up an entire data stream for some sort of processing involved writing a lot of boilerplate code, such as all the duplicated functionality needed across the multiple messaging queues.

“If I’m a developer or maybe an ML [machine learning] guy, why should I really spend a lot of time writing these integrations again and again whenever I write these new pipelines or consumers?” Yayi asked.

Lastly, scaling is a hurdle. In event processing, the need for scalability was measured by an event backlog, but had to be expressed — through the Kubernetes Horizontal Pod Autoscaler — through additional pods needed at that moment. Some users even hand-tuned the number of pods needed when traffic surged in.

👁 Screenshot

How Numaflow Solves Common Stream Processing Challenges

“Numaflow is a serverless platform for stream processing,” explained Krithika Vijayakumar, Intuit senior software engineer. It was designed to hide (“abstract”) all the infrastructure bits away from the data engineers.

Numaflow allows ML engineers to “focus just on their stream processing or inferencing, and eliminate the need for them understanding the underlying infrastructure,” Vijayakumar said.

It also whisks away the need to learn all the event processing complexities behind such concepts, such as sinks and sources, abstracting them down to a single data object.

“We realize that ML engineers are focused largely on the payload, and they don’t really care about where they are reading the data from. ‘Is it Kafka? Or is it Pulsar? Or is it HTTP?'” Vijayakumar elaborated.

👁 screenshot

So, details around the sinks and sources are hidden from the engineers, who can get back to worrying about their inferencing and processing logic. Users write their inference logic as user-defined functions (UDFs).

Also, the platform automatically scales based on traffic coming in. No more spinning up pods manually!

Building an AI Pipeline With Numaflow: A Demo

Vijayakumar ran a demo of a simple task of image recognition. Numaflow comes bundled with a UI, so you can see the pipelines as you build and run them:

👁 screenshot

The data is pulled from the source and sent to a prediction vertex. A vertex is a core computational component, which in this case returns a written description of the contents of the image back to the sink, an HTTP endpoint. The vertex itself is run with a local Natural Language Processing model.

The pipelines themselves are defined in YAML, a declarative language.

She also showed a glimpse of a working anomaly-detection pipeline, one in production for a year. Pipelines can have multiple sources, sinks and UDFs. UDFs can be written in a mixture of Python or Java. In the GUI, Vertices can display the number of pods they are running. They work independently, so they can each scale according to their own incoming workload.

A ‘Pretty Impressive’ Data Stack

“If you are a native Kubernetes shop, this is the way to go,” said data engineer Dan Young in his walkthrough video of Numaflow. He suggested that Numaflow, along with Argo, could be used to build a “pretty impressive data processing stack.”

If you want to learn more, the Numaflow engineers will also be presenting at upcoming AllThingsOpen and KubeCon North America.

TRENDING STORIES
Joab Jackson is a senior editor for The New Stack, covering cloud native computing and system operations. He has reported on IT infrastructure and development for over 30 years, including stints at IDG and Government Computer News. Before that, he...
Read more from Joab Jackson
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.