VOOZH about

URL: https://thenewstack.io/5-steps-to-deploy-efficient-cloud-native-foundation-ai-models/

⇱ 5 Steps to Deploy Efficient Cloud Native Foundation AI Models - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2023-06-29 11:05:20
5 Steps to Deploy Efficient Cloud Native Foundation AI Models
podcast,video,
AI

5 Steps to Deploy Efficient Cloud Native Foundation AI Models

We discuss how to tackle resource allocation and gain efficiencies with Kubernetes in deployment with Huamin Chen of Red Hat.
Jun 29th, 2023 11:05am by Alex Williams
👁 Featued image for: 5 Steps to Deploy Efficient Cloud Native Foundation AI Models

The five steps to deploy cloud native sustainable foundation AI models starts with the obvious two: containers to manage the workloads and Kubernetes to deploy across a distributed infrastructure.

They may use PyTorch for programming and Jupyter Notebooks for debugging and evaluation, said Huamin Chen, who works in R&D at Red Hat’s Office of the CTO, in an interview recorded at the Open Source Summit North America in Vancouver earlier this Spring. Due to their facile deployment, Chen said Docker community files work well to containerize workloads for deployment.

With Kubernetes in deployment, the challenge is resource allocation and gaining efficiencies.

That may sound easy, but Chen asks, “How do you take care of the little difference when apportioning resources?”

The third step: measurement.

Chen cited Prometheus, the open source tool for event monitoring and alerting. Applications and infrastructure create metrics. With Prometheus, developers can correlate the workloads in foundation models and the runtime environments in a system, allowing for correlations and analysis.

Analytics represents the fourth step.

Chen said developers might use their analytics, but it’s helpful to have guidelines or some heuristics to build upon. To achieve bigger impacts, Chen said they place queries into Prometheus to get the basic metrics in place. They use that information to establish benchmarks, for example, to determine, for example, energy usage from foundation models, then correlate with performance metrics.

Energy usage relates to the performance of the model. It’s assumed that better performance comes with more energy expended.

“Your intuition may not always work,” Chen said. “And our discovery is that you can get the same performance without using more energy.”

The action taken from the analytics represents the fifth step. It culminates with applying the efficiencies and performance attained through the five stages.

“Number five is what we believe is most important for the community, for society, and for the environment,” he said. “Once you are able to optimize the energy profiles for our foundation models, then the more energy we can save, and the better environment we are going to have in the future.”

TRENDING STORIES
Alex Williams is founder and publisher of The New Stack. He's a longtime technology journalist who did stints at TechCrunch, SiliconAngle and what is now known as ReadWrite. Alex has been a journalist since the late 1980s, starting at the...
Read more from Alex Williams
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Docker.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.