VOOZH about

URL: https://thenewstack.io/should-you-try-small-language-models-for-ai-app-development/

⇱ Should You Try Small Language Models for AI App Development? - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-04-30 10:00:27
Should You Try Small Language Models for AI App Development?
sponsor-couchbase,sponsored-post-contributed,
AI / AI Agents / Large Language Models / Software Development

Should You Try Small Language Models for AI App Development?

SLMs may offer greater privacy, security and business opportunities than LLMs powering GenAI, but they aren’t right for every use case.
Apr 30th, 2025 10:00am by Mohan Varthakavi
👁 Featued image for: Should You Try Small Language Models for AI App Development?
Featured image by Leka Sergeeva on Shutterstock.
Couchbase sponsored this post.

Most enterprises share a common goal: to bring their most critical business operations as close as possible to the audiences they serve. This might mean making it as painless as possible for consumers to order groceries online; helping staff deal with customer queries and orders wherever they meet them on the shop floor; or demonstrating to regulators that a bank is swiftly identifying and dealing with fraud.

In a world that is increasingly embracing AI, this should be straightforward. Enterprises can create innovative applications that can address, and even predict, their audience’s needs. Yet creating these applications can be more challenging than ever. How can the enterprise ensure it creates applications quickly enough to meet demands that evolve as fast as, if not faster than, the technology? How can it ensure those applications will behave as expected and give users accurate, useful insights and experiences? And how can it do this at scale to reach thousands or millions of users?

While industries have been fixated on the potential of large language models (LLMs) powering generative AI (GenAI), small language models (SLMs) and other approaches are often a more effective solution.

The Right Tools for the Job

As with any project, when developing any new application, it’s important to understand its purpose and who it is for — and, increasingly, use this insight to select the right AI tool for the job. For instance, AI doomsayers are quick to repeat the downsides of GenAI powered by LLMs. They can be resource-intensive, and they require careful data management and training to ensure the AI doesn’t put sensitive data at risk, nor suffer from hallucinations or biases that make any conclusions unreliable at best.

But at the same time, there’s no rule saying an LLM is always the answer. While they provide powerful general capabilities, they are not equipped to answer every question pertaining to an organization’s specific business domain. A more specialized SLM, trained on domain-specific data, can be equally, if not more, effective.

Larger Isn’t Always Better

The benefits of SLMs are clear. Their focus on more specific data means they can more easily and accurately answer specific queries, either by themselves or in concert with an LLM. For applications expecting to fulfill a specialized function or answer specialized queries, this can be invaluable.

There are also the advantages of security and privacy. Unlike an LLM that may rely on public data, an SLM can more easily be made proprietary: the algorithm, the training and — most importantly — the data are all controlled by the owner organization.

Whether an organization is concerned about sharing business-critical or personal data beyond its walls, worried that it will accidentally access and use copyrighted or otherwise sensitive data from another source, or fears finding its language model has been corrupted through exposure to conflicting or irrelevant data, an SLM can offer some peace of mind.

Remember the Basics

SLMs have a number of advantages over other approaches to AI. But, like any other technology, this doesn’t mean they are perfect. Instead, enterprises need to understand what approach best fits their needs and then ensure they have everything in place to make that approach succeed.

For instance, an SLM’s effectiveness is still completely reliant on data. If the organization cannot manage data at speed and at scale, it will inevitably find its AI applications are compromised. A key expectation of modern AI is that it works in real time and can increasingly decipher and speak in human-like language. Any data architecture needs to be able to support this, or applications will be missing vital capabilities.

There is also the question of where and how the SLM processes data. SLMs can be implemented directly onto mobile devices, and the ability to operate directly at the edge will be essential in many use cases. This is not only to ensure applications can operate when disconnected from a central server, but also to minimize latency and ensure the fastest possible operation. For instance, take the drive to build more effective autonomous vehicles. Most manufacturers, not to mention passengers, will want this AI to operate immediately, with no lag.

SLMs have a number of advantages over other approaches to AI. But, like any other technology, this doesn’t mean they are perfect.

It is important to note that many foundational models also offer smaller versions. The smaller versions of foundational models offer several advantages, including increased efficiency, lower computational costs and reduced latency, making them ideal for deployment on edge devices and local machines. They are also more interpretable, easier to fine-tune and more environmentally friendly due to lower energy consumption. However, they come with trade-offs such as reduced accuracy, limited knowledge retention and weaker adaptability, often requiring more fine-tuning for specific tasks.

Finally, there is the question of who owns and manages the data itself. Maintaining control over data is critical, not only to ensure privacy and compliance with a range of data protection laws, but to ensure that the data any language model is learning from and using is accurate, complete and entirely trusted by the organization.

Choosing Your Model

Choosing the right model size depends on balancing performance needs with resource constraints. SLMs can offer businesses a host of opportunities and prove that when it comes to AI, bigger doesn’t always equal better.

Choosing models (large or small) is part of the workflow that Couchbase helps to support. Couchbase offers vector search across its entire product line, including support for vector similarity search on mobile devices. The company also launched Capella AI Services to facilitate AI-enabled agent development and deployment workflows. Sign up to try Couchbase Capella for free. A private preview of Capella AI Services is also available.

Couchbase delivers Capella, the cloud database platform for modern applications. Capella enables developers and architects to quickly build the apps of the future and deliver always-on experiences to customers, on a mission to simplify how businesses develop, deploy and consume modern applications.
Learn More
The latest from Couchbase
TRENDING STORIES
Mohan Varthakavi is vice president of software development, AI and edge at Couchbase. He previously held executive roles at Cruise, AWS and Microsoft.
Read more from Mohan Varthakavi
Couchbase sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.