VOOZH about

URL: https://thenewstack.io/microsoft-foundry-build-2026-ai-agents/

⇱ With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-06-08 08:00:00
With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability
AI / AI Agents / Developer tools

With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability

At Build 2026, Microsoft Foundry shipped the infrastructure layer enterprise developers have been patching together themselves — and made the case that Foundry should be the platform it runs on.
Jun 8th, 2026 8:00am by Darryl K. Taft
👁 Featued image for: With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability
Photo by Rick Rothenberg on Unsplash

The agentic AI wave has produced no shortage of impressive demos. What it has produced less of is agents that hold up in production — under real load, real data, real compliance requirements.

At the recent Microsoft Build 2026 conference, the company addressed that gap directly, shipping a cluster of updates to Microsoft Foundry that collectively add up to something closer to an enterprise runtime than a developer preview.

The company shipped a broad cluster of updates to Microsoft Foundry spanning hosted agent infrastructure, evaluation tooling, an open governance specification, memory, knowledge retrieval, and expanded model options. Taken piece by piece, each is a reasonable product update. Taken together, they suggest Microsoft has decided the next competitive front in enterprise AI isn’t capability — it’s reliability and governability.

It seems that Microsoft has decided the next competitive front in enterprise AI isn’t capability — it’s reliability and governability.

“At Build 2026, Microsoft Foundry added more of the platform pieces developers need for production agents: runtime, tools, memory, grounding, models, observability, and governance,” writes Nick Brady, senior program manager for developer experience at Microsoft, in a blog post recapping the announcements.

A runtime that doesn’t require a rewrite

The anchor of the infrastructure story is hosted agents in Foundry Agent Service, which Microsoft expects to reach general availability by early July 2026. The setup is a managed runtime where every session runs in its own sandbox with dedicated compute, memory, and durable filesystem access.

What makes it worth paying attention to is the framework posture. Agents built on Microsoft Agent Framework, GitHub Copilot SDK, LangGraph, or other SDKs can be deployed without rewrites. Two protocols are supported: a Responses API for OpenAI-compatible, stateful interactions, and an invocations protocol for pass-through scenarios in which developers control the request and response formats themselves. That second option is going to matter for teams that have already built custom orchestration they’re not about to discard.

The hosted runtime also supports routines, now in public preview, which let any agent run on a timer or schedule — overnight issue triage, daily reporting, that kind of workload. Long-running autonomous agents get durable state.

Alongside the runtime, Foundry Toolkit for VS Code is now generally available. Brady described it as the tool that lets developers “create agents from templates or with GitHub Copilot, debug runs locally with trace visualization, connect to Toolboxes, and deploy to Foundry Agent Service from VS Code.”

Toolboxes and the tool governance problem

As agent tool counts grow, tool governance becomes an engineering problem in its own right. Toolboxes in Foundry, now in public preview, gives an agent a single managed endpoint for every tool type. Configure once, point any MCP client to a single URL, and Foundry handles auth, lifecycle, and governance.

Skills, versioned in a project-scoped catalog and discoverable as MCP resources, are now first-class citizens in the Toolboxes model. A tool search capability, also in preview, helps select appropriate tools per task rather than dumping the entire catalog on the model. That matters both for quality and for keeping context windows from bloating.

Toolboxes also connect to Microsoft IQ — including Work IQ, Fabric IQ with the Fabric data agent, Ontology, and semantic models — so agents can tap into enterprise data without custom plumbing for each source.

Evaluating against policy, not just benchmarks

Two governance announcements stood out. The first is Adaptive Spec-driven Scoring for Evaluation and Regression Testing (ASSERT), Microsoft’s new open-source framework for policy-driven agent evaluation, built on Microsoft Research work. Rather than running agents against static benchmarks, ASSERT converts written policies into concrete, measurable evaluations and generates targeted scenarios to surface safety and quality defects before they reach production. It works across LangChain, CrewAI, LightLLM, OpenAI, and other frameworks.

The second is Agent Control Specification (ACS), an open industry spec for placing deterministic safety and security controls at five checkpoints in an agent’s lifecycle: input, LLM, state, tool execution, and output. ACS is expressed as a portable YAML contract — versionable, auditable, and framework-agnostic. Launch partners include Infosys, KPMG, IBM, Aviatrix, BigSpin, and CrewAI.

The combination points to a real problem. Agent failures in production tend not to be random — they cluster around predictable input types, tool misuse patterns, and output edge cases. ASSERT makes those failure modes testable. ACS makes the controls portable across frameworks and auditable across organizations.

Rounding out the evaluation stack: Guided Guardrail Setup, a questionnaire-driven wizard in Foundry Agent Builder that recommends PII filters, jailbreak protection, and task adherence controls without requiring security expertise; and a Rubric evaluator that auto-generates weighted quality criteria from an agent’s definition and use case.

Memory and knowledge

Memory in Foundry Agent Service, in public preview, now covers three types. Procedural memory, new at Build, helps agents learn how to do the work across runs — not just what was said in a session. Brady cites early Tau-bench results showing +7–14% absolute success-rate gains at near-baseline cost, which is specific enough to warrant independent replication. User memory retains preferences and facts across sessions. Session memory maintains context within a thread.

On the knowledge side, Foundry IQ got a serverless option in public preview, new knowledge sources spanning Work IQ, Fabric IQ, File Search, Azure SQL, and MCP, and knowledge bases generally available with SLA-backed retrieval. Brady described the pitch as replacing custom RAG pipelines with “a dedicated knowledge layer behind your Foundry agents.” Web IQ adds sub-200ms web grounding with zero data retention for agents that need live external context.

Models and compute

Microsoft announced four first-party MAI models entering public preview: MAI-Thinking-1 for chat and reasoning, MAI-Image-2.5 for image generation and editing, MAI-Transcribe-2 for speech-to-text with speaker diarization, and MAI-Voice-2 for multilingual text-to-speech with voice cloning.

Fireworks AI on Foundry reached general availability, bringing open-model inference through a single Azure endpoint with enterprise SLAs, SOC 2 readiness, and PTU Data Zone support — enterprise access to open models without separate infrastructure or contracts.

Microsoft also claimed that Frontier Tuning is more than 10x as cost-efficient as GPT-5.5 for tasks such as producing technical Microsoft documentation. That’s specific enough to be testable and general enough to warrant skepticism until it is.

The bigger picture

What Brady’s recap makes clear is that the Foundry announcements at Build 2026 were designed to fit together. The hosted runtime handles deployment. Toolboxes handle tool governance. ASSERT and ACS handle evaluation and control. Memory handles state. Foundry IQ handles grounding. Rubric and Agent ROI connect agent performance to business outcomes. Rubric is a new evaluator type in Microsoft Foundry, currently in public preview, that automatically generates evaluation criteria based on your agent’s specific context. 

Microsoft argues that enterprise agentic AI requires platform-level infrastructure — and that the platform should be Foundry.

Whether that argument lands will depend less on what shipped at Build and more on what enterprises actually find when they try to move agents from demo to deployment.

That is the gap Microsoft says it’s closing.

TRENDING STORIES
Darryl K. Taft covers DevOps, software development tools and developer-related issues from his office in the Baltimore area. He has more than 25 years of experience in the business and is always looking for the next scoop. He has worked...
Read more from Darryl K. Taft
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Aviatrix, OpenAI, CrewAI.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.