VOOZH about

URL: https://thenewstack.io/lambda-labs-hyperscaler-focuses-on-the-ai-developer/

⇱ Lambda HyperScaler Focuses on the AI Developer - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-03-06 11:05:13
Lambda HyperScaler Focuses on the AI Developer
AI Operations / Frontend Development / Serverless / Software Development

Lambda HyperScaler Focuses on the AI Developer

Lambda is an AI cloud hyperscaler built by machine learning engineers, but it has evolved into a platform for all AI developers.
Mar 6th, 2025 11:05am by Loraine Lawson
👁 Featued image for: Lambda HyperScaler Focuses on the AI Developer
Image by Getty Images via Unsplash.

There’s a little secret that Lambda, an artificial intelligence (AI) hyperscaler, knows: Not all AI developers are software developers, suggested Robert Brooks IV, a Lambda founding team member and the vice president of revenue.

“These folks are not necessarily software engineers. They’re experts in math; that is what their skill set is,” Brooks told The New Stack. “If you allow those people to not have to focus on the DevOps side of running a cloud and just focus on the math, a.k.a. building the machine learning model, that’s why we’ve gotten to build a big brand in the space.”

Lambda Focuses on AI Developers

Lambda, named after Alonzo Church’s Lambda Calculus, historically focused on machine learning engineers — and predominantly on the training and fine tuning side of the AI market. That meant developers were either building something from scratch or taking an open source model and fine-tuning it to their own needs.

Now, it’s marketing itself as a hyperscaler for all AI developers, from the hardcore deep learning researcher to the web developer plugging a machine learning API into an application’s frontend.

“Really, within the last two years, we’ve seen inference explode on our platform because the greatest thing about an NVIDIA chip is it has that dual purpose of being good for training and inference,” Brooks said. “We’ve seen an explosion in terms of our user growth related to the software engineer [who’s] using AI within their code.”

In AI, inference is the process where a trained machine learning model uses its knowledge to make predictions or decisions on new, unseen data. It’s what essentially creates the output of AI.

Why a Cloud for AI Developers?

There’s a remarkably simple reason why Lambda decided to focus on a hyper cloud for AI developers: They were machine learning engineers who wanted to build an infrastructure to support their own apps.

The platform sits on 25,000 GPUs, making it “one of the few companies in the world that have the ability to use our infrastructure to actually serve these applications correctly,” Brooks said.

“Few companies have as many NVIDIA GPUs as we do. That’s why we’re on this particular path,” he added. “The reason we deploy infrastructure is [that] we ultimately want to be able to use it, because we’re actually machine learning engineers.”

“Because we have all that infrastructure, we allow folks to grow with us and not be limited artificially — no pun intended on that word.”
— Robert Brooks IV, Lambda founding team member and the vice president of revenue

Their founder, Stephen Balaban, is a machine learning engineer who was at a computer vision contest when AI started to be able to identify a dog from a cat. He built a computer vision app that ran locally on the iPhone called DreamScope, which used a crude, bespoke neural network to do what generative AI does today, Brooks said.

“Ultimately, by building these applications, we’ve been relying on more and more NVIDIA GPUs to build and then productionize AI,” he said. “So we had this problem in 2015, 2016, and around 2017 we fully pivoted into building this infrastructure for other AI teams, as we were very familiar with that problem.”

To the best of their knowledge, Lambda posts one of the lowest cost inference APIs in the world, Brooks claimed. The company made that decision on purpose.

“We’re vertically integrated so there’s a lot of people that we compete with that are just built on top of other people’s GPU clouds, but because we actually own the infrastructure, we can control the cost better,” he said. “We own the data centers. We own the GPUs. We own everything. We write all the software. We actually market ourselves as the cheapest inference API out there. It is not rate limited.”

Companies often allow developers to use up to X amount of million of tokens per hour, day or month, but then the developer has to talk to a salesperson, he said.

“Because we have all that infrastructure, we allow folks to grow with us and not be limited artificially — no pun intended on that word,” Brooks added.

Many worry about the shelf life of GPUs because Nvidia is starting to launch new ones every two years, while historically that cadence has been every three to four years, Brooks said. Lambda doesn’t worry about that because it deploys GPUs for customers today, and once the contract ends, Lambda uses the GPUs for their own AI training and services.

“There’s this really nice feedback loop, because we’re machine learning engineers running a cloud where it’s allowing us to grow faster and produce more AI research and more machine learning software as time goes on,” he said.

Tools for AI Developers

One way the startup helps machine learning engineers and AI developers is that it has a plug-and-play approach to on-boarding.

“When a machine learning engineer or AI developer goes on to our cloud and spins up a GPU, all of their tools, frameworks, libraries that they need to rely on — essentially, their app store — is already pre-built and in there for them, so they can actually focus on their work,” Brooks said.

Lambda launched its Inference API, which became generally available in December. It allows developers to use AI language models in their applications without the overhead of managing the complex infrastructure.

“We now have dedicated services towards helping customers actually serve models within their apps. It’s been a nice, gradual uptick towards much more production-focused AI, or what we call inference,” Brooks said.

It also introduced Lambda Chat, which provides access to various large language models, many of which are open source. Lambda Chat is their first foray into serving consumers with open source models, Brooks said. It’s free for consumers, although Lambda does charge developers for the API calls.

Last month, the company raised $480M in series D funding to expand its AI cloud platform. The plan is to use that money to build more software tools for AI developers and deploy more GPUs to meet customer demand.

The company also said it will continue to develop Lambda Chat, which hosts DeepSeek-R1 and other open source models.

DeepSeek Enters the Chat

TNS asked Brooks how DeepSeek R1 has changed things in AI for hyperscalers. DeepSeek R1 is a large 671 billion parameter Mixture of Experts (MoE) model, meaning its designed to handle complex tasks by strategically activating different “expert” sub-networks within the larger model. It made headlines for being able to do AI with fewer GPUs.

The introduction of DeepSeek to the market led to a boost in demand, Brooks added.

“We’ve experienced sort of extreme demand from that, related to not only machine learning and engineers spinning up instances on our cloud, but also folks using our inference API to plug into their application,” he said. “So it was a democratizing effect in terms of starting to spread more of that functionality across more apps.”

But ultimately, Deep Seek itself doesn’t have enough GPUs to run the production, he said. That’s why there’s a limit to how many questions you can ask.

He said Lambda serves many larger enterprises with these private cloud buildouts of 4,000 to 8,000 GPU clusters. Enterprises are interested in running DeepSeek but with more GPUs.

Competitors

Other companies that are offering GPUs as a service include:

Editor’s Note: Updated March 7, 2025 at 1:17 p.m. EST to reflect the name of the company is now Lambda.

TRENDING STORIES
Loraine Lawson is a veteran technology reporter who has covered technology issues from data integration to security for 25 years. Before joining The New Stack, she served as the editor of the banking technology site Bank Automation News. She has...
Read more from Loraine Lawson
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.