VOOZH about

URL: https://thenewstack.io/nvidia-launches-ai-guardrails-llm-turtles-all-the-way-down/

⇱ Nvidia Launches AI Guardrails: LLM Turtles All the Way Down - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-04-25 06:00:48
Nvidia Launches AI Guardrails: LLM Turtles All the Way Down
AI / Frontend Development

Nvidia Launches AI Guardrails: LLM Turtles All the Way Down

Nvidia has announced NeMo Guardrails for AI applications, but is the toolkit overly reliant on LLMs for its safety and security measures?
Apr 25th, 2023 6:00am by Richard MacManus
👁 Featued image for: Nvidia Launches AI Guardrails: LLM Turtles All the Way Down
Image via Nvidia

Nvidia has announced a new safety toolkit for AI chatbots called NeMo Guardrails, which acts as a kind of censor for applications built on large language models (LLMs). The software has been released as an open source project.

Jonathan Cohen, VP of Applied Research at Nvidia, spoke about the new software yesterday in a briefing with journalists.

“A guardrail is a guide that helps keep the conversation between a human and an AI on track,” said Cohen.

According to the company, NeMo Guardrails enables developers to set up three kinds of boundaries:

  • Topical guardrails to “prevent apps from veering off into undesired areas.” The example Cohen used in the briefing was an employee asking an HR chatbot which employees had adopted children. The guardrails prevented the chatbot from attempting to answer this.
  • Safety guardrails is a broad category that includes fact-checking (preventing hallucinations), filtering out unwanted language, and preventing hateful content.
  • Security guardrails “restrict apps to making connections only to external third-party applications known to be safe.”

Developers can also create their own custom rules “with a few lines of code.”

NeMo Guardrails can run on a variety of tools that use LLMs. The primary one mentioned in the briefing was LangChain, an open source toolkit used by developers to plug third-party applications into LLMs.

It can also work with LLM-enabled applications such as Zapier.

While NeMo Guardrails can be used separately via GitHub, Nvidia is also integrating it into a couple of their own product offerings. It’s available in the NeMo framework, “which includes everything users need to train and tune language models using a company’s proprietary data.” In addition, Nvidia has made it available as a separate paid-for service.

Fact-Checking Using Other LLMs

Interestingly, the fact-checking mentioned as part of the safety guardrails is done not by a human… but another LLM. Cohen explained that this is because organizations can customize and train an LLM to be a fact-checker on specific data.

“There [are] very general purpose language models,” he said, “but there’s also a lot of value in training a language model with a lot of data on a very specific task, and we have a lot of evidence — and the community has a lot of evidence — that when you fine-tune these models with lots of examples, they actually can perform much better.”

👁 Nvidia Guardrails

image source: Nvidia

Colang

In a technical blog post, Nvidia stated that NeMo Guardrails is built on Colang, a modeling language, and its associated runtime for conversational AI. Cohen described it as “a domain-specific language for describing conversational flows.”

According to Nvidia, interacting with Colang is “like a traditional dialog manager.” You create guardrails by using three key concepts:

  • Canonical form (“a simplified paraphrase of an utterance”)
  • Messages
  • Flows (“a tree or a graph of interactions between the user and the bot”)

“Colang has this really nice property, that Colang source code reads very much like natural language,” said Cohen, “and so it’s a very easy-to-use tool. It’s very powerful and it lets you essentially script a language model in something that looks almost like English.”

👁 Guardrails code

An example of Colang code.

As for how the guardrails are implemented, Cohen explained that it’s a Python module that runs Colang scripts. The runtime “monitors the human speaking and the bot speaking, and tracks the state of the dialogue.”

The key, according to Cohen, is that the runtime is “able to determine whether a guardrail applies or not.” Once again, however, LLMs are used to make this determination.

Conclusion

One thing that’s noticeable about Nvidia’s new guardrails software is that it’s “LLMs all the way down”, to paraphrase the old turtles and software adage.

Cohen defended this by saying, “Why wouldn’t we use large language models? [It’s] such a powerful technology for in-context understanding and generalizing, and this kind of fuzzy inferencing.”

Of course, the heavy reliance on LLMs does make one wonder about the reliability of the system, from a fact-checking and security perspective. But this is surely why it’s being released as open source software — to let the community deal with that can of worms.

TRENDING STORIES
Richard MacManus is a Senior Editor at The New Stack and writes about web and application development trends. Previously he founded ReadWriteWeb in 2003 and built it into one of the world’s most influential technology news sites. From the early...
Read more from Richard MacManus
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.