![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
Before the web had Hypertext Transfer Protocol (HTTP), and before email had Simple Mail Transfer Protocol (SMTP), we were stuck with custom integrations, fragmented systems and brittle workflows. It wasn’t until open protocols and shared infrastructure emerged that the internet truly scaled, unlocking the modern web, global communication and entire economies.
Today, AI agents are in this same pre-standardization phase. They’re powerful, capable and multiplying fast, but they don’t work together. One agent analyzes data. Another one drafts code. A third automates customer relationship management (CRM) workflows. But they’re isolated, siloed and unaware of each other’s existence.
That’s starting to change.
A new stack is emerging to support this next layer of the internet — one built not for humans browsing websites, but for autonomous agents collaborating across systems. At the core are four open components:
Let’s explore how these technologies fit together, why protocols alone aren’t enough and how this new stack provides the infrastructure needed to move from disconnected bots to dynamic, intelligent agent ecosystems.
If the hype is right — and it’s looking more like inevitability than speculation — most companies won’t just deploy one AI agent; they’ll deploy dozens. These agents will write code, triage support tickets, analyze customer data, manage onboarding, monitor infrastructure and more.
But today’s tooling isn’t ready for that future.
We’re not just facing the “island of agents” problem, where agents operate in silos and can’t communicate; we’re facing a broader ecosystem fragmentation problem:
As the 12-Factor Agents project argues, agents need to follow cloud native principles: They must be observable, loosely coupled, reproducible and infrastructure-aware. But today, most are built as brittle scripts, stitched together by hand and assumed to run in isolation.
The result? Silos. Duplication. Fragility.
The solution isn’t to shove all agents into one monolithic platform. It’s to build a shared stack, a new foundation based on open protocols, event-driven architecture and real-time processing.
Agent2Agent addresses part of the problem by giving agents a common protocol for discovery and communication. But to go beyond toy demos, to reach the scale and reliability production systems demand, we need more than protocols. We need infrastructure.
As mentioned, the agent ecosystem today looks a lot like the early web: powerful systems, each doing useful work, but siloed and incompatible. Just like browsers once struggled to talk to servers without a standard protocol, AI agents today can’t easily discover, communicate or collaborate with one another.
Google’s A2A protocol is a bold attempt to fix that. It’s not another agent framework: It’s a universal protocol meant to connect any agent, regardless of who built it or where it runs.
Just like HTTP standardized how websites communicate, A2A defines a shared language for agents. It lets them:
AgentCard, a JSON descriptor that declares what an agent can do and how to interact with it.What makes A2A promising is that it doesn’t try to reinvent the wheel. It builds on decades of internet protocol history, just like HTTP and SMTP did, by leveraging familiar, battle-tested web standards. That makes adoption easier and integration faster.
But A2A is only one half of the picture.
Anthropic’s MCP tackles the other half: how agents use tools and access context. MCP standardizes how agents invoke APIs, call functions and integrate with external systems — essentially, how they think and act in the world. A2A, on the other hand, defines how agents talk to each other.
If MCP is about giving agents access to tools, A2A is about giving them access to each other.
Together, these two protocols offer a blueprint for a connected agent ecosystem:
And just like HTTP and SMTP didn’t succeed in isolation, they required adoption, infrastructure and developer tooling, A2A and MCP will need an ecosystem to realize their potential.
But even with standardization like A2A and MCP, a fundamental question remains: How do these agent communications scale effectively across a complex, dynamic enterprise environment? Relying solely on direct, point-to-point connections defined by these protocols creates its own set of challenges, particularly around scalability, resilience and observability. This leads us to the need for a robust underlying communication infrastructure.
Imagine running a company where every employee can communicate only by sending direct, one-on-one messages. Need to share an update? You have to message each person individually. Want to coordinate a project across five teams? You’re stuck manually relaying information between every group.
Now imagine trying to scale that to hundreds of employees. Chaos.
That’s exactly what happens in agent ecosystems built on direct connections. Every agent must know who to talk to, how to reach them and when they’re available. As the number of agents grows, the number of required connections grows exponentially. The system becomes brittle, hard to manage and nearly impossible to scale.
A2A and MCP give agents the language and structure to communicate and act, but language alone isn’t enough. To coordinate dozens or hundreds of agents across an enterprise, you also need infrastructure for how those messages move and how agents react to them.
That’s where Apache Kafka and Apache Flink come in.
Apache Kafka is a distributed event streaming platform originally developed at LinkedIn and now part of the Apache Software Foundation. It acts as a durable, high-throughput message bus, allowing systems to publish and subscribe to streams of events in real time. Kafka is used everywhere, from financial systems to fraud detection to telemetry pipelines, because it decouples producers from consumers and ensures data is durable, replayable and scalable.
Flink, also an Apache project, is a real-time stream-processing engine. It was designed from the ground up for stateful, high-throughput, low-latency event processing. Where Kafka handles the movement of data, Flink handles the transformation, enrichment, monitoring and orchestration of that data as it flows through a system.
Together, they form a powerful duo: Kafka is the bloodstream, Flink is the reflex system.
Just as A2A is emerging as the HTTP of the agent world, Kafka and Flink form the event-driven foundation that can support scalable agent communication and computation. They solve problems that direct, point-to-point communication can’t:
"TaskCompleted", "InsightGenerated") to a topic; any interested agent or system can subscribe.In short:
Protocols like A2A and MCP are essential for standardizing agent behavior and communication. But without an event-driven substrate like Kafka and a stream-native runtime like Flink, these agents remain stuck in isolated interactions, unable to coordinate flexibly, scale gracefully or reason over time.
To fully realize the vision of enterprise-grade, interoperable AI agents, we need four layers:
Together, this is the new internet stack for AI agents — a foundation for building systems that are not only intelligent, but also collaborative, observable and production-ready.
How A2A, MCP, Kafka and Flink work together. (Source: Confluent)
We’re at a pivotal moment in the evolution of software.
Just as the original internet stack — protocols like HTTP and SMTP, and infrastructure like TCP/IP — unlocked a new era of global connectivity, a new stack is emerging for AI agents. But instead of humans navigating pages or sending emails, this stack is built for autonomous systems working together to reason, decide and act.
A2A and MCP provide the protocols for agent communication and tool use. Kafka and Flink provide the infrastructure for real-time coordination, observability and resilience. Together, they make it possible to move from disconnected agent demos to scalable, intelligent production-grade ecosystems.
This isn’t just about solving engineering challenges. It’s about enabling a new kind of software where agents collaborate across boundaries, providing insight and action flow in real time, allowing intelligence to become a distributed system.
But this vision won’t realize itself. We need to build it: openly, interoperably and with the lessons of the last internet revolution in mind.
So the next time you’re building an agent, don’t ask just what it can do. Ask how it fits into the larger system. Can it communicate? Can it coordinate? Can it evolve?
Because the future isn’t just agent-powered; it’s agent-connected.