VOOZH about

URL: https://thenewstack.io/david-vs-goliath-use-tactics-over-size-when-building-ai-agents/

⇱ David vs. Goliath: Use Tactics Over Size When Building AI Agents - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2025-05-09 06:06:00
David vs. Goliath: Use Tactics Over Size When Building AI Agents
contributed,contributed-bardeen,
AI Agents / Data / Large Language Models

David vs. Goliath: Use Tactics Over Size When Building AI Agents

General AI is impressive, but when the structure and accuracy of the results matter for your business, you need a different approach.
May 9th, 2025 6:06am by Pascal Weinberger
👁 Featued image for: David vs. Goliath: Use Tactics Over Size When Building AI Agents
Photo by Artem Sapegin on Unsplash.

In recent months, remarkable advances have been made in general-purpose browser agents powered by large language models (LLMs). Industry leaders, such as OpenAI and Anthropic, have released these agents, Operator and Computer Use, respectively, for the public to use. These browser tools have demonstrated impressive capabilities, from booking restaurant reservations to answering diverse and complex questions.

General-purpose browser agents, despite their flexibility, fail to perform structured, repeatable business tasks. Precise analytics, automated workflows, and reliable data enrichment depend critically on consistently structured data. Without a predictable structure, extracted data rapidly becomes unreliable, severely limiting its practical value for downstream processes essential to business and technical applications.

Why General Agents Struggle With Scale

Several key issues are limiting the effectiveness of general-purpose AI agents, like OpenAI’s Operator and Claude’s Computer Use, when applied to large-scale data extraction tasks:

  • Lack of Structure: Agents typically produce outputs that aren’t consistently structured. Since most have been built with question-answering as their primary focus, their outputs tend to be paragraph-formatted or loosely organized.
  • Limited Website Coverage: Existing benchmarks to evaluate these agents typically include only a narrow set of websites, often simulated or tightly controlled environments. As a result, general-purpose agents struggle to adapt to the complexity, variability, and messiness of real-world websites.
  • Model Laziness: Open AI Operator or Claude Computer Use frequently stops execution prematurely after extracting only partial data (such as a single page of listings). For example, for specific tasks with several pages of data, these models would save partial information on the first page and terminate execution.

Key Insight: All Problems Have Structure

At Bardeen, our approach to web agents is fundamentally different. Rather than competing head-to-head with large model providers by building increasingly bigger and more computationally expensive models, we chose to leverage the inherent structure of real-world web tasks. Through building millions of automations for team and enterprise customers, we’ve learned that many critical business tasks — like extracting job postings, monitoring company blog updates, or analyzing customer testimonials — rely heavily on structured data.

Moreover, we’ve observed that websites presenting this data consistently use repetitive HTML structures.

By recognizing and exploiting this underlying structure, both in the business problems themselves and the websites presenting the data, we realized we could build a significantly more efficient, accurate, and scalable AI Agent without resorting to brute computational force.

BardeenAgent: A New Approach To Building Browser Agents

Our solution, BardeenAgent, implements this structured approach through a two-step execution process:

  1. Capture the Extraction Structure (Once)

First, BardeenAgent navigates to the desired data on a webpage and uses an LLM to identify and record how to extract a single data item. This step generates robust CSS selectors and a structured extraction script, essentially creating a reusable “recipe” for capturing similar data.

  1. Replay with Precision (Many Times)

Instead of repeatedly invoking an expensive AI inference for every subsequent data item, BardeenAgent replays this recorded extraction script across multiple pages or data points. This approach drastically reduces computational overhead and improves reliability.

Why It Works: Efficiency Through Reusability

This structured approach is powerful because it ensures the following:

  • Consistency: Structured scripts reliably extract data, ensuring consistent output formats.
  • Scalability: Once the structure is captured, extraction scales quickly and efficiently to hundreds or thousands of data points.
  • Cost and Time Efficiency: Fewer AI calls mean dramatically lower costs and faster data extraction.

Real-World Applications and Results: Introducing WebLists

To evaluate BardeenAgent’s effectiveness in extracting structured data, we evaluated how well LLM browser agents performed on our new benchmark, WebLists, composed of use cases requested by real enterprise customers. This includes scenarios like:

  • Tracking job postings to identify competitors’ growth or hiring trends
  • Monitoring company blogs and product updates to prepare informed sales outreach.
  • Extracting customer testimonials for robust competitive analysis.

When evaluating the BardeenAgent against other methods on the WebLists benchmark, the results were clear: Our evaluation demonstrated that BardeenAgent’s structured approach significantly outperforms existing state-of-the-art agents, including Wilbur (Bardeen’s previously published agent), Agent-E, and Perplexity. BardeenAgent achieved a 66.2% recall, more than doubling the best method’s performance. On top of that, the structured method also translated directly into cost efficiency: BardeenAgent achieves roughly 3x lower cost per extracted row compared to competing solutions.

A Different Approach

General AI is impressive, but when the structure and accuracy of the results matter for your business, you need a different approach. BardeenAgent addresses this gap by leveraging the inherent structure of web data, enabling businesses and researchers alike to extract valuable insights efficiently and reliably at scale.

Read our full technical paper and blog post here if you want to move beyond general AI limitations and tap into structured web intelligence.

If you’re interested in testing how your own agent compares or exploring this approach further, reach out to us at ml@bardeen.ai. We’re excited to collaborate with and support the broader AI community.


The owner of TNS, Insight Partners, also invests in Bardeen. As a result, Bardeen receives preference as a contributor.

TRENDING STORIES
Pascal is the Co-founder and CEO of Bardeen.
Read more from Pascal Weinberger
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real, bardeen, Anthropic, OpenAI.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.