![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
The current focus on AI chatbots overlooks the real opportunity for businesses: building autonomous agents. We want AI systems that don’t merely tell a customer their shipment is delayed, but one that can prevent the delay in the first place. This requires a system that is always on, embedded deep within your infrastructure and capable of monitoring the constant stream of events that defines the state of your business.
This is the true promise of enterprise AI: not a chatbot that waits for a question, but a fleet of agents that constantly watches, understands and reacts. An agent that detects a fraudulent pattern and freezes an account before the money is gone. An agent that sees a surge in user activity and proactively scales a service.
However, this vision exposes a fundamental architectural gap.
You cannot build these always-on, state-aware systems using the stateless, request-response models designed for chatbots. They require an architecture built from the ground up to process continuous event streams and manage evolving state.
Let’s explore that architecture. We will make the case for stateful stream processing as the necessary foundation for this new class of AI and demonstrate how Apache Flink provides the robust, low-latency engine required to bring these autonomous agents to life.
The vision for autonomous AI agents is compelling.
We imagine them intelligently automating everything from supply chain logistics to real-time customer personalization. While large language models (LLMs) have become incredibly powerful, a major obstacle prevents this vision from becoming a widespread reality. The problem isn’t the agent’s brain; it’s the plumbing.
To make decisions, an agent needs access to fresh, contextual data from across the business. The common approach today is to stitch together a patchwork of disconnected systems: one for data streaming (like Apache Kafka), another for workflow orchestration, one for aggregating all the possible contextual data the agent might need and a separate application runtime for the agent’s logic.
This “stitching” approach creates a system that is both operationally complex and technically fragile. Engineers are left managing a brittle architecture where data is handed off between systems, introducing significant latency at each step. This process often relies on polling or micro-batching, meaning the agent is always acting on slightly stale data. Furthermore, when something goes wrong, debugging is a nightmare because there is no unified view or observability across the entire workflow.
This reveals a clear gap in today’s technology stack: the absence of a unified, native framework for building, running and scaling agents directly on the real-time data streams they need to be effective.
To overcome the infrastructure bottleneck, we need to fundamentally change the way we think about agent architecture. Instead of treating agents as request-response applications that we bolt onto our systems, we should build them as event-driven microservices.
This paradigm can be broken down into a simple formula: An agent is the combination of event-driven logic, which triggers its operation; fresh, contextual data, which informs its decisions; and an LLM reasoning engine, which powers its intelligence. Viewing agents through this lens reveals why a streaming architecture isn’t just a choice, but a necessity.
This model provides three critical advantages:
This event-driven worldview requires a new kind of infrastructure, an engine built for continuous computation on unbounded streams of data. This is where Apache Flink comes in. Flink is an open source stream processing framework designed from the ground up for stateful computations, making it the ideal foundation for building autonomous agents.
Out of the box, Flink provides the core capabilities that are essential for reliable agentic systems:
While Flink provides the perfect engine, the community recognized the need for better native support for agent-specific workflows. This led to Streaming Agents, designed to make Flink the definitive platform for building agents. Crucially, this is not another tool to stitch into your stack. It’s a native framework that directly extends Flink’s own DataStream and Table APIs, making agent development a first-class citizen within the Flink ecosystem.
This native approach unlocks the most powerful benefit: the seamless integration of data processing and AI. Before, an engineer might have one Flink job to enrich data, which then writes to a message queue for a separate Python service to apply the AI logic. With Streaming Agents, complex data transformations, like joining event streams, aggregating data into features and enriching it with context, can happen within the same unified pipeline as the agent’s reasoning and decision-making logic.
This eliminates the inefficient and error-prone boundary between the “data world” and the “AI world,” creating a single, observable and end-to-end consistent system.
The true power of Streaming Agents is unlocked when they move beyond simple data transformation and into the realm of “closed-world” automation. This involves automating specific, high-volume business processes where the agent operates on a defined set of data streams, tools and objectives.
In insurance, claim processing is a core function ripe for automation. A Streaming Agent can be designed to continuously monitor a stream of incoming claims. When a new claim event is ingested, the agent autonomously gathers the necessary context by querying the policyholder’s details, their claims history and associated damage reports from various internal databases and object stores.
Using an LLM, it performs critical checks for fraud indicators, policy compliance and data completeness. Based on its analysis, the agent then intelligently routes the claim to the next step: auto-approving low-risk claims, flagging complex cases for human review or requesting more information from the customer, dramatically accelerating resolution times.
Modern supply chains generate a torrent of real-time data. A Streaming Agent can be deployed to constantly monitor multiple streams of logistics data, including shipment locations from carriers, current warehouse inventory levels and external weather alerts.
The agent’s goal is to proactively detect potential disruptions. When its analysis of the combined data streams reveals a likely delay, for instance, a stalled shipment heading to a warehouse with low stock of that item, it autonomously plans and triggers a corrective action. This could involve rerouting a different shipment, splitting an order or creating a stock transfer order, all while simultaneously notifying stakeholders of the disruption and the solution.
For online marketplaces and e-commerce platforms like Instacart, maintaining a clean, consistent product catalog is a massive operational challenge. A Streaming Agent can automate this process by ingesting streams of product data from hundreds or thousands of partners.
As the varied data arrives, the agent uses an LLM to understand and standardize it, normalizing descriptions ( e.g., “12 oz” vs. “a dozen ounces”), enriching product attributes and assigning them to the correct categories. It can also identify inconsistencies like mismatched pricing or missing images, automatically flagging them for review or, in clear-cut cases, updating the central product catalog directly, ensuring a high-quality customer experience.
The goal of Streaming Agents is to democratize development, empowering millions of developers worldwide to build intelligent applications. By providing familiar APIs and abstractions, we bring AI development into the mainstream software engineering life cycle, allowing every engineer to leverage these powerful new capabilities.
However, agents will only succeed if they are built like robust, production-grade software, not brittle demos. This is where Flink provides the critical foundation.
It treats the essential components of real-world applications, state management, observability and fault-tolerant reliability as first-class citizens. This ensures that the agents you build are not just intelligent, but also scalable, resilient and ready for the demands of production.
Flink provides the essential infrastructure to turn agentic AI from a promising concept into scalable, production systems that drive real business value. The future is being built on data streams, and with Streaming Agents, every developer can have a hand in building it.