VOOZH about

URL: https://thenewstack.io/why-infrastructure-must-be-serverless-in-the-ai-age/

⇱ Why Infrastructure Must Be Serverless in the AI Age - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2024-11-20 11:00:00
Why Infrastructure Must Be Serverless in the AI Age
contributed,
AI / AI Engineering / Infrastructure as Code / Serverless

Why Infrastructure Must Be Serverless in the AI Age

Developers shift from creating single apps to orchestrating fleets of AI-built applications.
Nov 20th, 2024 11:00am by Nikita Shamgunov
👁 Featued image for: Why Infrastructure Must Be Serverless in the AI Age
Photo by Arian Darvishi on Unsplash.

Replit’s new AI agent will write your code, then configure, provision, build, and deploy that code in seconds. You can go from natural language description to implemented, working application before VS Code has loaded all your extensions.

If an AI agent can build and deploy an app in seconds, minutes are too long to spin up the resources. If an AI agent can spin up and spin down thousands of these apps each hour, $10 per database becomes an extreme overhead. Suddenly, we’ve gone from a team of developers deploying a single app to a single developer deploying a team of apps, all working under the direction of AI agents.

This approach changes the perspective on development and demands a new view of infrastructure. Traditional infrastructure is too slow, too permanent, and too complex for agentic workflows. The future of infrastructure demands instantaneous, serverless, and simplified tooling — this is what must be built.

The Core Needs of AI Agent Infrastructure

AI agents operate at a scale and speed that makes traditional infrastructure management technically and economically impractical. A good “Agentic Experience” will emphasize three core features.

  1. Simplicity

Any code and integration will need to be simple. This world will be built on straightforward API calls rather than IAM policies and multistep provisioning. Consider starting a new RDS instance: a VPC, security groups, rules, subnet groups, and IAM roles. Each step requires multiple API calls, deliberation of options, and troubleshooting.

A human DevOps engineer understands these dependencies and can debug issues when they arise. An AI agent needs everything to work perfectly the first time and every time. This complexity isn’t just a barrier to entry; it’s a barrier to automation. This is what starting a database on Neon requires:

👁 Image

Code example from @neondatabase/toolkit SDK

Three lines of code to provision a database, one API call, and immediate availability. This isn’t just better DevEx — it’s the only way to make infrastructure accessible to AI agents. This simplicity also helps with two factors:

  • Cost: More steps = more cost. Every API call an agent makes consumes tokens, and complex infrastructure operations can require dozens of calls. Simple APIs are more than just more straightforward to use; they’re fundamentally more economical at scale.
  • Security: Though the AWS RDS setup is highly secure, you can’t give away your root keys to a machine. Modern infrastructure needs to be sandboxed and self-contained, with clear boundaries that let agents experiment freely without risking production systems.
  1. Immediacy

The above code can spin up a new database in under a second, ready for use by an agent.

Traditional infrastructure timelines simply need to be revised in an agent-driven world. Code creation speed used to be the rate-limiting factor in development, but when an AWS RDS instance takes 10 minutes to provision, that infrastructure becomes that rate-limiting factor.

Moving to an immediacy model opens up what is possible just as much as AI does. The core principle of agent-driven development is disposability. An agent might create an application, test it, and discard it within minutes. Infrastructure needs to match this lifecycle — spinning up instantly when needed and disappearing just as quickly when it’s not. An agent should be able to build and destroy a disposable app in the time it takes a traditional database to spin up.

  1. Ephemerality

This also leads to the critical component of disposable apps — ephemerality. A traditional approach presumes an app is permanent, so it needs permanent infrastructure.

Not so. Look at how developers already use Vercel’s v0, a dev tool for generating apps from text prompts. Instead of searching for a “mortgage calculator” online, you can instead ask v0 to create one for you:

👁 Image

This works. It needs no infra, but that’s a likely next step. Most importantly, a user can build a massive amount in a single session with AI. The vast majority will be discarded and never used again, but some might be shared and turned into long-lived applications. This is the future of tooling — you use AI and AI agents to build precisely what you want, with a trail of abandoned alternates in its wake.

This demands an infrastructure that can scale to zero. When a resource isn’t being used, its cost should drop to zero. With this capability, the economics of disposable apps will work. Consider a company spinning up dozens of databases per hour in the process of building with AI. Who is going to delete the unused ones? Who is even going to decide which are unused?

When infrastructure can genuinely scale to zero, it enables new development patterns. Agents can freely experiment with different approaches, testing multiple solutions in parallel without worrying about cleanup or ongoing costs. This removes economic constraints from the development process — you no longer need to carefully consider the cost implications of each new database or service.

The result is a development environment where resources are truly disposable. Create what you need, use it for as long as you need it, and let it disappear when you’re done. This isn’t just more efficient — it’s the only way to make agent-driven development economically viable at scale.

Simpler, Faster, Cheaper — Pick Three

In six months, we’ve gone from a demo of Cognition AI’s Devin AI programmer to Replit’s AI developer/DevOps agents working in the wild. What will SOTA be like this time next year?

Nobody knows, but it’s clear what kind of infrastructure will get us there. That infrastructure must move from the human developer to the AI agent as a builder. But creating good AgentEx will also make us build good DevEx, as simpler, faster, and cheaper will also work for the humans in the loop. This virtuous cycle — where improvements for AI agents create better tools for humans, and vice versa — will accelerate the transformation of how developers build and deploy software.


This article is part of The New Stack’s contributor network. Have insights on the latest challenges and innovations affecting developers? We’d love to hear from you. Become a contributor and share your expertise by filling out this form or emailing Matt Burns at mattburns@thenewstack.io.

TRENDING STORIES
Neon co-founder and CEO Nikita Shamgunov is a database builder turned entrepreneur. Prior to Neon, Nikita co-founded SingleStore, a unicorn data and analytics company valued over $1.3 billion. He served as a founding CTO and then CEO, successfully scaling the...
Read more from Nikita Shamgunov
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.