![]() |
VOOZH | about |
We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.
Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.
Follow TNS on your favorite social media networks.
Become a TNS follower on LinkedIn.
Check out the latest featured and trending stories while you wait for your first TNS newsletter.
In recent months, remarkable advances have been made in general-purpose browser agents powered by large language models (LLMs). Industry leaders, such as OpenAI and Anthropic, have released these agents, Operator and Computer Use, respectively, for the public to use. These browser tools have demonstrated impressive capabilities, from booking restaurant reservations to answering diverse and complex questions.
General-purpose browser agents, despite their flexibility, fail to perform structured, repeatable business tasks. Precise analytics, automated workflows, and reliable data enrichment depend critically on consistently structured data. Without a predictable structure, extracted data rapidly becomes unreliable, severely limiting its practical value for downstream processes essential to business and technical applications.
Several key issues are limiting the effectiveness of general-purpose AI agents, like OpenAI’s Operator and Claude’s Computer Use, when applied to large-scale data extraction tasks:
At Bardeen, our approach to web agents is fundamentally different. Rather than competing head-to-head with large model providers by building increasingly bigger and more computationally expensive models, we chose to leverage the inherent structure of real-world web tasks. Through building millions of automations for team and enterprise customers, we’ve learned that many critical business tasks — like extracting job postings, monitoring company blog updates, or analyzing customer testimonials — rely heavily on structured data.
Moreover, we’ve observed that websites presenting this data consistently use repetitive HTML structures.
By recognizing and exploiting this underlying structure, both in the business problems themselves and the websites presenting the data, we realized we could build a significantly more efficient, accurate, and scalable AI Agent without resorting to brute computational force.
Our solution, BardeenAgent, implements this structured approach through a two-step execution process:
First, BardeenAgent navigates to the desired data on a webpage and uses an LLM to identify and record how to extract a single data item. This step generates robust CSS selectors and a structured extraction script, essentially creating a reusable “recipe” for capturing similar data.
Instead of repeatedly invoking an expensive AI inference for every subsequent data item, BardeenAgent replays this recorded extraction script across multiple pages or data points. This approach drastically reduces computational overhead and improves reliability.
This structured approach is powerful because it ensures the following:
To evaluate BardeenAgent’s effectiveness in extracting structured data, we evaluated how well LLM browser agents performed on our new benchmark, WebLists, composed of use cases requested by real enterprise customers. This includes scenarios like:
When evaluating the BardeenAgent against other methods on the WebLists benchmark, the results were clear: Our evaluation demonstrated that BardeenAgent’s structured approach significantly outperforms existing state-of-the-art agents, including Wilbur (Bardeen’s previously published agent), Agent-E, and Perplexity. BardeenAgent achieved a 66.2% recall, more than doubling the best method’s performance. On top of that, the structured method also translated directly into cost efficiency: BardeenAgent achieves roughly 3x lower cost per extracted row compared to competing solutions.
General AI is impressive, but when the structure and accuracy of the results matter for your business, you need a different approach. BardeenAgent addresses this gap by leveraging the inherent structure of web data, enabling businesses and researchers alike to extract valuable insights efficiently and reliably at scale.
Read our full technical paper and blog post here if you want to move beyond general AI limitations and tap into structured web intelligence.
If you’re interested in testing how your own agent compares or exploring this approach further, reach out to us at ml@bardeen.ai. We’re excited to collaborate with and support the broader AI community.
The owner of TNS, Insight Partners, also invests in Bardeen. As a result, Bardeen receives preference as a contributor.