VOOZH about

URL: https://thenewstack.io/the-pros-and-con-of-customizing-large-language-models/

⇱ The Pros (And Con) of Customizing Large Language Models - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-07-03 10:08:26
The Pros (And Con) of Customizing Large Language Models
AI / Frontend Development / Large Language Models / Software Development

The Pros (And Con) of Customizing Large Language Models

Tabnine has added a natural language chat to its AI assistant, giving developers access to a custom-trained developer bot from within the IDE.
Jul 3rd, 2023 10:08am by Loraine Lawson
👁 Featued image for: The Pros (And Con) of Customizing Large Language Models
Image via Unsplash

Before the artificial intelligence collective destroy humanity, it’s going to have to settle a few things in its own house first. For instance: Which is better — a large language model (LLM) or a custom-trained LLM?

Tabnine’s is team custom-trained, while its primary competitor — GitHub Copilot — is driven by the more broadly trained LLM GPT-4.

The company added chat functionality to its AI-powered assistant for developers last week. The chat assistant can write code and answer questions from the model, which is trained on fully permissive open source code, but it can also be custom-trained on an enterprise’s code base, said Brandon Jung, vice president of Ecosystem at Tabnine.

“Tabnine was the first AI code assistant on the planet,” Jung stressed to The New Stack in a recent interview. “Tabnine has a unique UI/UX, we use three LLMs at any one time — one local, one for longer completions and one for chat — and they’re purpose-built. …The second piece is we build our own models. ”

The chat is available for free early access now but will be generally available to Pro and Enterprise customers by fall.

Custom Versus General Purpose

By custom training a model on its own code, enterprises can ensure the languages they want are supported and therefore leverage (we hope) secure code for its examples. The model is then run on-premise or on the company’s virtual private cloud (VPC). Jung drops a few names: Samsung, the Israeli Army and Telsa run it on-premise, while Accenture runs it in a VPC.

“I don’t mean this to be flippant, but students aren’t committing code that has to go to code base, [but] it’s not part of the software development piece, it’s just build and learn how to do it, so a Copilot’s a good fit,” he said. “But if you’re Samsung, you’re going to say I have some of the most valuable code on the planet, that I don’t want leaked, I don’t want to be in someone’s data. And I need to build a model that my thousands of developers are getting suggestions from my code base, not from some random open source somewhere.”

If a new developer wants to know how to use an API, the code will pull up the details from a company’s specific code, rather than code for generic APIs, Jung said. It can also be used to document code that maybe has bad documentation or no documentation. Another differentiator for Tabnine is that the AI assistant integrates with popular IDEs, which means you don’t have to rely on a plugin being available.

“We may think general AI is magic, but at the end of the day it’s just machine learning,” he said. “It’s good data in, good data out, bad in and bad data out.”

Implications for the Frontend

For the frontend, custom training may not have the impact it will with backend systems, Jung acknowledged. But it can still be useful to get code that mirrors the specifications the development team has for its frontend coding, for instance.

“You also may have some very specific patterns for how you’re doing that front end, and you want that repeated. Citibank wants their frontend JavaScript to all look the same. So there’s still value in it,” he said.

There are tradeoffs in a custom trade model: You do have to know where you’ll be using the model, Jung said. A pocket knife is a good general tool, but it’s not as effective as a torque wrench for a lug nut, he pointed out. If the bulk of the custom model is trained on backend languages, it might not be as good a fit for a JavaScript frontend.

That’s one of the mistakes Jung says underlies our understanding of generative AI.

“The idea of a universal model sits as the way people think about it — I don’t have to know anything, I throw everything in the black box, and it gives me magic answer,” he said. “When you get to a customized model, you have to know.”

TRENDING STORIES
Loraine Lawson is a veteran technology reporter who has covered technology issues from data integration to security for 25 years. Before joining The New Stack, she served as the editor of the banking technology site Bank Automation News. She has...
Read more from Loraine Lawson
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.