VOOZH about

URL: https://thenewstack.io/strands-agents-tool-design/

⇱ Cut AI token usage by 96%? Here's how AWS Strands Agents does it. - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-04-29 14:52:32
Cut AI token usage by 96%? Here's how AWS Strands Agents does it.
podcast,sponsor-amazon-web-services-aws,sponsored-post,video,
AI Agents / Model Context Protocol / Model Context Protocol (MCP) / Operations

Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.

AWS developer advocate Morgan Willis on Strands Agents, intent-based tools, MCP gateways, and how smarter tool design cut agent token usage from 52K to 2K.
Apr 29th, 2026 2:52pm by Frederic Lardinois
👁 Featued image for: Cut AI token usage by 96%? Here’s how AWS Strands Agents does it.
AWS sponsored this post.

For this episode of The New Stack Makers, I sat down with AWS developer advocate Morgan Willis to talk about Strands Agents, the company’s open source agentic framework, which has seen over 14 million downloads since it launched just under a year ago. Willis brought a hands-on demo built around a simple accounting API to show what building with Strands looks like in practice.

The demo walks through three iterations of the same task: looking up the latest invoice for a customer. First, Willis mapped each API endpoint directly to an agent tool, the way most developers would by default. The agent needed five chained API calls and burned roughly 52,000 tokens. Then she swapped in intent-based tools that are built around an outcome rather than a data operation. With the same query, getting an answer now took one tool call and only 2,000 tokens.

“It’s calling multiple API’s, but rolling them up into one intent-based tool for the agent that it’s going to have a better time using — and understanding when exactly to use it. […]

“The fewer tools that you expose to your agent, the less likely it is to call the wrong one.”

“Your agent is going to have a better time reasoning around what tool to use and when, because these tools are more aligned to a task and less aligned to data,” Willis tells The New Stack. “The fewer tools that you expose to your agent, the less likely it is to call the wrong one.”

The third iteration moved those tools to a remote MCP server via AWS Agent Core Gateway and enabled semantic search across the tool catalog, so the agent received only the tools relevant to each query, rather than the full set of 16. That cut token usage roughly in half again compared to loading everything.

Willis says the broader principle at work here is that narrowly scoped agents tend to outperform general-purpose ones. 

“I think agents that are more narrowly defined tend to perform better than general use case agents. If you’re looking for context efficiency, speed, and accuracy, I would also look at your agent design as well.” 

Having many agents, each doing a small number of things, lets you design tools precisely for each use case rather than building a more general agent that tries to do everything. As MCP servers proliferate and tool catalogs grow, the question of which tools an agent actually sees on a given run is going to matter as much as the tools themselves.

Since its inception, Amazon Web Services (AWS) has been the best place for customers to build and run open source software in the cloud. AWS is proud to support open source projects, foundations, and partners.
Learn More
The latest from AWS
Hear more from our sponsor
TRENDING STORIES
Before joining The New Stack as its senior editor for AI, Frederic was the enterprise editor at TechCrunch, where he covered everything from the rise of the cloud and the earliest days of Kubernetes to the advent of quantum computing....
Read more from Frederic Lardinois
AWS sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.