VOOZH about

URL: https://thenewstack.io/agentic-ai-data-governance/

⇱ How MCP and synthetic data are reshaping compliance in the agentic era - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-05-23 09:00:00
How MCP and synthetic data are reshaping compliance in the agentic era
sponsor-perforce,sponsored-post-contributed,
AI Agents / Compliance / DevOps

How MCP and synthetic data are reshaping compliance in the agentic era

Learn how MCP and synthetic data deliver automated data governance and continuous compliance for autonomous AI agents in DevOps.
May 23rd, 2026 9:00am by Brian Muskoff
👁 Featued image for: How MCP and synthetic data are reshaping compliance in the agentic era
Alghozy for Unsplash
Perforce sponsored this post.

Software development has always had a way of quietly distributing sensitive data in unexpected places, and many organizations have lost track of where it is. 

However, the rise of agentic AI has pushed that problem into new territory. AI agents are not just accelerating the software development lifecycle (SDLC); they are evolving the process by touching data at every stage in ways that teams may not fully see. The concern is that it is interacting with potentially sensitive data without a specific request. The scale and speed at which this can occur can exceed the governance frameworks of many organizations.

The encouraging news is that this is a solvable problem. Teams that build strong data governance practices designed for machine speed and autonomous systems, and not just human workflows, will comply more confidently and innovate faster. This, in turn, will support organizations’ ability to build AI they can trust.

Test Data Management best practices have been relatively well understood for years. Managing the test data throughout the product development cycle can be performed safely and efficiently. 

Unfortunately, sensitive data still often appears across the SDLC, including development sandboxes, CI/CD pipelines, model training datasets, feature stores, regression testing environments, and AI agent memory stores. It can be present in every environment and across every model, throughout the development lifecycle, creating significant organizational risk.

This matters because the volume of code and test data are growing sharply. With the rise of agentic AI, fewer humans are involved in the coding, and more are instructing AI agents to do it. 

That shift places greater emphasis on the need for test data because, as more code is generated, more of it needs to be tested. 

Agentic AI is increasingly capable of autonomously driving this process. As a result, many organizations are reporting that AI adoption is accelerating faster than their data privacy strategies can keep up.

Non-production environments and known risks

There is a persistent blind spot in how organizations think about data security. Production and non-production environments are treated very differently, even when both hold large volumes of sensitive data. Production environments are equipped with SOC monitoring, strict access controls, and incident response protocols. 

Non-production environments, on the other hand, include development, test, analytics, and AI. These environments were simply not built to withstand the same level of threats as production data, which is precisely why allowing real customer data, financial records, or health information to flow into them is high risk.

The problem is compounded by the economics of convenience, doing more of what is easier. DevOps culture has encouraged environment proliferation, such as spinning up multiple production-like clones, refreshing data regularly, and accelerating delivery pipelines. More environments mean more copies of data. 

When shortcuts feel low-risk, they become the default. Conversely, when properly governed, using techniques such as virtualization and masked data makes access just as frictionless, and teams do the right thing. The answer is not to restrict the data; it is to make compliance the path of least resistance.

“The answer is not to restrict the data; it is to make compliance the path of least resistance.”

Data governance frameworks were built for human workflows and allow for manual reviews, approval committees, and periodic audits. That model was already straining before AI arrived. With autonomous agents now capable of making hundreds or thousands of data requests per hour, it is simply incompatible with the new reality.

Governance needs to function as a service, with automated controls that enforce policy at the point of data delivery, in real time. Data compliance is increasingly being executed at runtime, meeting the ultimate requirement for continuous compliance. That puts much greater pressure on organizations to know exactly what kind of data they are working with, meaning that classification and data intelligence need to be embedded into the pipeline, not bolted on as an afterthought.

Building governance for the agentic era

DevOps best practices do not become obsolete in the age of agentic AI. In fact, they become more important. The 2026 State of DevOps Report reinforces that mature DevOps is foundational to AI success. The same holds for data governance. Here are the practices that matter most:

  • Embed compliance controls into the data pipeline itself, not as a downstream review step. Compliance logic should execute automatically when data is requested or delivered. This is what transforms governance from a bottleneck into a service.
  • Replace production data copies with virtualized, masked, and synthetic alternatives. Virtualized environments allow teams to spin up production-like database copies in seconds without moving raw sensitive data. Synthetic data can be generated to exact specifications, including edge cases that do not exist in production.
  • Use the Model Context Protocol (MCP) to provide agentic DevOps pipelines with a standard interface for testing data environments. MCP allows AI agents and developers to interact with data infrastructure through natural-language conversational prompts, removing the need to log in to multiple systems or wait for infrastructure teams to establish integrations. When requesting a properly governed data copy is as easy as cloning a production one, teams will do the right thing by default.
  • Shift from audit-based governance to runtime enforcement. Classification and tagging should be done initially, continuously revised, and then enforced at runtime.

Two scenarios illustrate how this plays out in modern engineering organizations. In the first case, a testing agent runs regression tests overnight and discovers it needs a fresh copy of a payments database, masked for PCI compliance. 

No human is available to approve the request. The agent calls a data API, receives a virtualized, masked copy within 90 seconds, completes its tests, and tears the environment down without a compliance ticket ever being raised.

In the second situation, a QA agent needs to test how a payment system handles 10,000 simultaneous expired credit cards during a leap year. That scenario does not exist in production data. The agent generates a synthetic dataset with exactly those characteristics, runs the tests, validates a fix, and closes the defect before the team’s morning stand-up. No real customer data was involved at any point.

What both scenarios share is a design philosophy of being compliant with production-quality data on demand. All this is done through an API or natural language interface, with policy enforcement built into the delivery mechanism, rather than applied as a gate afterward.

The SDLC has never been more productive, and it has never exposed more sensitive data to more systems, more agents, and more environments simultaneously. The window to get governance right is narrowing, as regulatory frameworks like the EU AI Act raise the bar for compliant AI development and data breaches in non-production environments continue to make headlines. 

“The SDLC has never been more productive, and it has never exposed more sensitive data to more systems, more agents, and more environments simultaneously.”

It’s no surprise that 86% of enterprises are looking to invest in AI and data privacy solutions, according to the 2025 State of AI and Data Privacy Report from Perforce Delphix.

The organizations that will navigate this well are not those that build manual compliance processes. They are the ones that redesign governance as infrastructure: automated, embedded, real-time, and built for a world where the primary consumers of data are autonomous systems operating at machine speed. Approached the right way, a solid backbone of trusted data tangibly accelerates innovation.  

That is not a future state. The tools, processes, and techniques are in place; now is the time for engineering leaders to build this foundation.

Perforce is the trusted partner to govern software delivery in the age of AI execution. Its solutions enforce guardrails across code, quality, infrastructure, and data—enabling innovation without introducing risk. Built for high-stakes, revenue-critical applications where failure isn’t an option. 
Learn More
The latest from Perforce
Hear more from our sponsor
TRENDING STORIES
Brian leads global teams to build category-defining DevOps for data products that help the world's largest enterprises move faster and safer across multi-cloud and AI environments. As the VP of Product at Perforce Delphix, he leads portfolio strategy for its...
Read more from Brian Muskoff
Perforce sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.