VOOZH about

URL: https://thenewstack.io/mitigating-safety-risks-with-ai-powered-applications/

⇱ Mitigating Safety Risks with AI-Powered Applications - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2024-08-16 05:05:00
Mitigating Safety Risks with AI-Powered Applications
sponsor-vmware,sponsored-post-contributed,
AI / Large Language Models / Security

Mitigating Safety Risks with AI-Powered Applications

Organizations using GenAI to enhance end-user experiences need to take a risk-management approach that addresses regulation and security.
Aug 16th, 2024 5:05am by Serdar Badem and Camille Crowell-Lee
👁 Featued image for: Mitigating Safety Risks with AI-Powered Applications
Featured image by Dario Morandotti on Unsplash.
VMware Tanzu sponsored this post.

Enterprises want to take advantage of efficient customer interactions by adding generative AI (GenAI) to their chatbot applications. However, navigating the safety concerns and risks of incorporating large language models (LLMs) into existing or new applications can leave a lot of open questions.

According to the analyst firm IDC, 30% of organizations cite “loss of control over data and IP” (intellectual property) as a hurdle to adopting GenAI. But data loss is just one of the risks you need to plan around when implementing GenAI in your applications.

Get Prepared: Regulation Is Coming to AI

The EU AI Act, along with regulatory frameworks in the United States, China and India, are setting guidelines to mitigate the risks associated with AI-powered applications.

Besides being prepared for emerging regulation, defining a safety strategy for your AI-powered apps is essential to avoid security pitfalls. Consider the business impacts and how to mitigate risks when incorporating the LLMs needed for delivering GenAI applications.

Here are the elements to consider to incorporate GenAI into your applications, safely:

  • Content anomaly detection. Monitor outputs for anomalies like inaccurate responses or hallucinations to help ensure content is appropriate and correct. Models have been known to misbehave (think of chatbots that recommended adding glue to pizza), so they need constant monitoring and tuning.
  • Data protection. Safeguard data privacy and security throughout the AI life cycle to prevent leakage of private data. For example, in the event that chatbot users accidentally input personal, identifiable information into the model, redaction strategies should be implemented in the model to prevent proliferating that data.
  • Accidental IP infringement. LLMs trained on large datasets can inadvertently leak sensitive information. Queries designed to exploit these errors can extract confidential data, posing privacy risks. Organizations should also consider the provenance of the data in their base model. Content should be ethically sourced and not indiscriminately scraped from the web or based on “fair use” rules (think Andersen v. Stability AI in 2023).
  • Explainability and transparency. Ensure that AI models provide clear and understandable outputs. This enables easier troubleshooting and identification of errors or biases in the model, key to building trust for your organization’s brand.
  • Application security. Implement security measures to protect intelligent applications from vulnerabilities. Just like any application, code security is paramount.

Adversarial Resistance: Protect Your LLMs From Bad Actors

There are inadvertent missteps as outlined above, but what if you are a victim of bad actors?

Malicious attackers that attempt to exploit your AI-powered applications can quickly become an issue if you have not properly prepared.

LLMs are vulnerable to these adversarial attacks because they are stateless and malleable. If a bad actor injects inputs that are intentionally crafted to confuse the model, they can force it to produce incorrect or harmful outputs.

👁 Model protection and safety considerations

Safety considerations in using a large language model. (Source: VMware by Broadcom)

Protecting your models from being compromised is essential for ensuring you don’t end up in a model jailbreak scenario. The risks of not adopting a proactive security stance for your LLMs can be devastating to your organization’s brand in several ways.

  • Spreading misinformation. LLMs can be used to spread false or misleading information deliberately. This includes generating fake news, conspiracy theories or biased content that can misinform and manipulate public opinion (consider chatbots that answered U.S. elections questions incorrectly 27% of the time).
  • Harmful prompts and responses. LLMs can be prompted to generate harmful, dangerous or illegal content. This includes instructions on creating weapons, committing crimes or self-harm.
  • Facilitation of criminal activities. LLMs can assist in planning and executing criminal activities by providing detailed instructions on illegal activities or helping to coordinate illicit operations.
  • Bias and discrimination. LLMs can perpetuate and amplify biases present in their training data, leading to discriminatory or prejudiced outputs. This can harm marginalized groups and reinforce stereotypes. (think of Microsoft‘s Tay launch).
  • Evasion of safety mechanisms. Fine-tuning with harmful data can easily compromise the safety mechanisms of base LLMs. This allows malicious actors to create versions of the models that ignore or bypass built-in safety features.
  • Manipulation and deception. Advanced LLMs can create realistic fake content, including deepfakes, that can deceive individuals. This poses risks in personal, professional and political contexts (consider debunked deepfake audio from a British politician).
  • Interaction with vulnerable users. Interactions with LLMs can adversely affect vulnerable users. For example, individuals experiencing mental health crises might receive responses that exacerbate their condition. Over-relying on LLMs for critical decisions without adequate oversight may lead to poor decision-making and harmful outcomes.

Benefits and Action Plan for Risk Mitigation

Organizations using GenAI to enhance end-user experiences need to take a risk-management approach that addresses people and technology transformation. With a proper AI risk-mitigation plan in place, you can capitalize on GenAI, safely.

Model safety is a new consideration when you are implementing intelligent software delivery programs. Assessing all the building blocks that go into a retrieval-augmented generation (RAG) app is crucial for complying with appropriate parameters downstream, so being intentional early on with your plan will help you avoid problems with your model later.

Implementing a mitigation plan not only helps ensure clear governance over data, application design, training processes and outputs, but it also increases confidence in the decisions made by AI models. With stronger security measures and trustworthy AI outputs, organizations can gain their customers’ trust and increase customer satisfaction.

Here are activities for improving the risk, security management and governance of your AI-powered apps.

  • Identify governance needs. Assess the risks specific to your organization, including the data sources used, potential biases and security threats. Identify areas where controls are needed to mitigate these risks.
  • Establish a governance structure. Create and implement a governance structure that includes clear roles and responsibilities for managing AI models and their outputs. This structure should also outline processes for handling potential issues or incidents from both a personnel and technology standpoint.
  • Create an AI model inventory. Keep track of all AI models in use, their purpose and the associated risks for proper governance and risk management. Understand the potential risks associated with each AI project you undertake and determine which components of your policy apply.
  • Design your intelligent apps for security. Design your AI applications with security in mind, including using secure coding practices and performing regular vulnerability assessments. Use industry-standard security measures such as encryption, authentication and access control to safeguard data and models.
  • Continuously monitor for anomalies. Regularly monitor and assess AI model outputs for anomalies or potential biases, using tools such as content anomaly detection systems or bias detection algorithms. Continuously monitor for vulnerabilities that may compromise the safety and trustworthiness of your application.
  • Implement explainability and transparency. Incorporate interpretability techniques into your model design to provide transparent outputs. Ensure that your AI models can provide clear explanations for their decisions, and make sure all stakeholders understand the reasoning behind these outputs.
  • Train your team on best practices and reevaluate regularly. Educate your team on the latest best practices, including security measures, explainability techniques and data protection protocols. As new risks arise or data sources change, it’s crucial to update and reevaluate your AI trust, risk and security management controls regularly to maintain a robust security posture.

Enterprises are anxious to move from experimentation with AI to running intelligent workloads in production safely. While AI brings unique risks, reinforcing fundamental cloud native patterns and practices can help you get there more quickly.


If you are considering refactoring or building new GenAI apps, safety is just one of the considerations. Join VMware and Forrester AI experts on August 21 for an insightful discussion where we will cover the latest GenAI trends and essential considerations to help you execute on a comprehensive AI strategy quickly. Register today!

Trusted by enterprises and loved by developers, VMware Tanzu is built for platform and data teams who want to accelerate agentic software delivery and AI-ready data. Tanzu provides a pre-engineered, agentic app platform and an AI-ready data intelligence platform that helps enterprises build, run, manage and safeguard agents, their integrations and data so you can capitalize on AI at scale. 
Learn More
The latest from VMware Tanzu
Hear more from our sponsor
TRENDING STORIES
Serdar is the Product Management Lead for Tanzu AI Solutions, a new initiative at VMware by Broadcom focused on helping customers develop, operate and optimize AI-powered applications. Before joining VMware, Serdar led the product management team for Uber’s developer platform,...
Read more from Serdar Badem
Camille Crowell-Lee is a solutions marketing leader who focuses on VMware Tanzu by Broadcom. She has been in technology marketing for over 17 years where she has built strategic marketing initiatives for hyperscale cloud providers and for ISVs, including containerization...
Read more from Camille Crowell-Lee
VMware Tanzu sponsored this post.
SHARE THIS STORY
TRENDING STORIES
Microsoft is also a sponsor of The New Stack.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.