VOOZH about

URL: https://thenewstack.io/cohere-sovereign-coding-model-north-mini-code/

⇱ Cohere sold sovereign AI to enterprises, now it's targeting developers with its first coding model - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-06-15 10:54:16
Cohere sold sovereign AI to enterprises, now it's targeting developers with its first coding model
AI / AI Agents / AI Engineering / AI Models

Cohere sold sovereign AI to enterprises, now it’s targeting developers with its first coding model

Nick Frosst says North Mini Code is a response to developers demanding the same sovereignty guarantees that regulated industries have long required.
Jun 15th, 2026 10:54am by Paul Sawers
👁 Featued image for: Cohere sold sovereign AI to enterprises, now it’s targeting developers with its first coding model
Rizki Ardia for Unsplash+

Canadian foundation model company Cohere has spent the past few years selling a specific idea to banks, governments, and healthcare providers: that AI should run on their infrastructure, under their control, with their data never leaving the perimeter.

Cohere’s pitch went down well in regulated industries. Now the company is taking it to a different audience, with the launch of North Mini Code — its first coding model, released under an Apache 2.0 license from the get-go.

Model access as infrastructure

The sovereignty argument Cohere has long made to enterprise customers is, at its root, about ownership. Regulated industries have hard requirements: data can’t leave certain boundaries, and the intelligence layer running on sensitive infrastructure needs to be something the organization controls. That requirement shaped how Cohere built its products — deployable anywhere, runnable on private infrastructure.

What’s changed, according to Cohere co-founder Nick Frosst, is who is asking those same questions.

“We’re now hearing similar concerns from developers,” Frosst tells The New Stack. “They’re starting to think of model access as infrastructure, and infrastructure should be something you own and control. That is an extension of sovereignty.”

“[Developers] are starting to think of model access as infrastructure, and infrastructure should be something you own and control.”

North Mini Code is a direct response to that demand. It’s a 30-billion-parameter Mixture of Experts (MoE) model with just 3 billion active parameters and is designed for agentic coding tasks: the kind of multi-step, tool-using work that coding agents like Claude Code and Cursor are built around.

Cohere says it runs on a single Nvidia H100 GPU, making self-hosting practical without a larger multi-GPU deployment. Developers who would rather not manage their own infrastructure can access it via API instead.

“We want to give developers a capable, fast, open-weight model they can run locally on their own terms, and that fits in their compute environments,” Frosst says.

“We want to give developers a capable, fast, open-weight model they can run locally on their own terms, and that fits in their compute environments.”

Cohere claims it outperforms comparable open-weight models including Alibaba’s Qwen3 and Google’s Gemma 4 on the Artificial Analysis Coding Index, where it scores 33.4, and says it delivers up to 2.8x higher output throughput than Mistral’s Devstral Small 2 on identical hardware.

Cohere’s own benchmark testing shows North Mini Code leading on terminal and code generation tasks — but results are mixed across the full evaluation suite, with Qwen 3.6 ahead on SWE-Bench Verified and LiveCodeBench v6, as its chart illustrates. Those comparisons are based on Cohere’s own testing and should be taken as indicative.

👁 North Mini Code’s performance in agentic software engineering and terminal tasks, along with complex code generation benchmarks, compared to leading open-source models of a similar size.
North Mini Code’s performance in agentic software engineering and terminal tasks, along with complex code generation benchmarks, compared to leading open-source models of a similar size. (Credit: Cohere)

A growing club

Cohere’s timing puts it alongside a growing group of international companies that have made open-weight coding models a deliberate product choice. Mistral, the Paris-based AI company, launched Devstral in May 2025 — its first dedicated agentic coding model, also under Apache 2.0 — and followed it with Devstral 2 in December. JetBrains, the Czech developer tools company, recently open-sourced Mellum2, its second-generation coding model.

The emphasis differs. Mistral has explicitly linked open weights to AI sovereignty and the ability to deploy models on private infrastructure, while JetBrains focuses on latency, cost and deployment flexibility. In practice, both approaches give developers and enterprises more control over where models run and how they are operated.

Owning the infrastructure

The appetite for open-weight alternatives to frontier models is clearly there. AI agent platform Lindy recently announced it had moved 100% of its inference traffic from Anthropic to China’s DeepSeek, saying the switch would save the company millions while actually improving performance on its core use cases. Lindy’s CEO Flo Crivello addressed the obvious question about routing through a Chinese-developed model: the company uses Atlas Cloud, a US-based inference provider that hosts DeepSeek on American soil. The open-weight nature of DeepSeek made that possible — the model can be hosted by any provider, in any jurisdiction.

That’s precisely the dynamic Frosst is pointing to. Open weights give developers optionality that a proprietary API does not: the ability to choose where the model runs, who operates it, and under what terms. For companies whose inference bill has grown to exceed payroll — as Crivello noted is the case at Lindy — those are decisions with real commercial consequences.

Cohere’s Command family — its flagship line of enterprise models built for agentic, multilingual, and multimodal tasks — had previously shipped as open-weight models under more restrictive licenses. With Command A+, the company moved to Apache 2.0 in May, making the legal terms around use and redistribution significantly more permissive.

“Open-source development was concentrated in a small number of jurisdictions, and organizations running critical infrastructure had no reliable alternative.”

Frosst draws a direct line between the enterprise sovereignty argument Cohere has made for years and the thinking behind North Mini Code. The open-source coding model, he says, is a response to the same concentration problem Cohere saw in enterprise AI — only now playing out at the developer layer.

“Open-source development was concentrated in a small number of jurisdictions, and organizations running critical infrastructure had no reliable alternative,” Frosst says. “North Mini Code extends that thinking to the developer layer. As coding agents become the infrastructure software engineering runs on, whoever controls those systems controls how they work, how they evolve, and what they’re optimized for. We think that developers and enterprises should be in control.”

TRENDING STORIES
Paul is an experienced technology journalist covering some of the biggest stories from Europe and beyond, most recently at TechCrunch where he covered startups, enterprise, Big Tech, infrastructure, open source, AI, regulation, and more. Based in London, these days Paul...
Read more from Paul Sawers
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Anthropic.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.