#

We evaluated search providers against five open benchmarks covering complementary aspects of agentic search: BrowseComp (hard multi-hop questions that require navigating the live web), Frames (multi-document factoid reasoning), FreshQA (time-sensitive questions where the correct answer depends on recent web information), HLE (Humanity's Last Exam — expert-level academic questions spanning math, science, and humanities), SealQA (ambiguity-robust factoid QA with intentionally misleading snippets), WebWalker (tasks designed around following links across pages to find an answer).

**Evaluation methodology**

Every task is run through a shared deep-research harness: a single GPT-5.4 agent is given two tools (web search and web fetch) with an iterative budget of up to `MAX_TOOL_CALLS=25` tool calls per question. The agent plans sub-queries, fans out searches, fetches specific pages when snippets are insufficient, and returns an answer when it exhausts the number of allowed tool calls or has sufficient information to answer the question. Each answer is then LLM-graded by GPT-5.4. We report accuracy of the final answer.

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

## Highest accuracy at every price point

State of the art across the most challenging benchmarks

View All Benchmarks

[View All Benchmarks](https://parallel.ai/benchmarks)

### Humanity's Last Exam

| Series | Model | Cost (CPM) | Accuracy (%) |
| -------- | ----------------- | ---------- | ------------ |
| Parallel | Parallel Basic | 451 | 58 |
| Parallel | Parallel Advanced | 315 | 56 |
| Others | Exa | 522 | 57 |
| Others | Tavily | 538 | 54 |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

**Dataset**

**Evaluation methodology**

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

### FRAMES

| Series | Model | Cost (CPM) | Accuracy (%) |
| -------- | ----------------- | ---------- | ------------ |
| Parallel | Parallel Advanced | 93 | 87 |
| Parallel | Parallel Basic | 165 | 84 |
| Others | Exa | 169 | 87 |
| Others | Tavily | 189 | 83 |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

**Dataset**

**Evaluation methodology**

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

### WebWalker

| Series | Model | Cost (CPM) | Accuracy (%) |
| -------- | ----------------- | ---------- | ------------ |
| Others | Exa | 210 | 74 |
| Others | Tavily | 202 | 71 |
| Parallel | Parallel Advanced | 101 | 73 |
| Parallel | Parallel Basic | 155 | 71 |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

**Dataset**

**Evaluation methodology**

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

### FreshQA

| Series | Model | Cost (CPM) | Accuracy (%) |
| -------- | ----------------- | ---------- | ------------ |
| Parallel | Parallel Advanced | 49 | 79 |
| Parallel | Parallel Basic | 90 | 77 |
| Others | Exa | 84 | 78 |
| Others | Tavily | 89 | 78 |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

**Dataset**

**Evaluation methodology**

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

### Coding

| Series | Model | Cost (CPM) | Accuracy (%) |
| -------- | ----------------- | ---------- | ------------ |
| Parallel | Parallel Advanced | 154 | 82 |
| Parallel | Parallel Basic | 269 | 81 |
| Others | Exa | 331 | 80 |
| Others | Tavily | 352 | 75 |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

**Dataset**

A proprietary coding dataset derived from production queries to Parallel’s search API.

**Evaluation methodology**

We measure accuracy and overall cost, which includes LLM token costs and tool call costs.

**Testing dates**

April 19-21, 2026

Search, built for AIs

The most accurate search tool, to bring web context to your AI agents

Give AIs Search[Give AIs Search](https://platform.parallel.ai/play/search)

SthenP

Learn More[Learn More](https://parallel.ai/products/search)

The most accurate deep and wide research

Run deeper and more accurate research at scale, for the same compute budget

Run a query [Run a query ](https://platform.parallel.ai/play/deep-research)

DthenP

Starting research...

Build a dataset from the web

Define your search criteria in natural language, and get back a structured table of matches

Create a dataset[Create a dataset](https://platform.parallel.ai/play/find-all)

FthenP

Entities

Custom web enrichment

Bring existing data, define output columns to research, and get fresh web enrichments back

Enrich your data[Enrich your data](https://platform.parallel.ai/play)

TthenP

Entities

Product releases

SOC 2 status

Monitor any event on the web

Continuously monitor for any changes on the web

Monitor the web[Monitor the web](https://platform.parallel.ai/play/monitor)

MthenP

New breakthroughs in AI research

Start building

## Towards a programmatic web for AIs

Parallel is building new interfaces, infrastructure, and business models for AIs to work with the web

Try it for free

[Try it for free](https://platform.parallel.ai)Join us

[Join us ](https://parallel.ai/careers)

## Agent onboarding prompt

Use curl to read [parallel.ai/agents.md](https://parallel.ai/agents.md) and perform the setup to install Parallel

## Agent onboarding prompt

Use curl to read parallel.ai/agents.md and perform the setup to install Parallel

👁 API interface screenshot showing search query input and structured data output

Deep research API interface for ChatGPT and AI agents. Enterprise-grade deep research with up to 48% accuracy vs GPT-4's 1%. Built for ChatGPT deep research assistants and complex multi-hop AI workflows.

Latest updates

June 9

[OpenClaw now has free, LLM-optimized web search by default powered by Parallel](https://parallel.ai/blog/free-web-search-openclaw)

Tags:Partnership

May 19

[Introducing Index by Parallel](https://parallel.ai/blog/introducing-index-by-parallel)

Today we're launching Index, a platform that helps content owners understand how AI agents use their work and earn compensation tied to the value they contribute.

Tags:Product Release

May 6

[Parallel Monitor API: New processor tiers, snapshots and event streams, and Basis on every event](https://parallel.ai/blog/monitor-api)

The Monitor API is now generally available, with significant upgrades, bringing us closer to a web that continuously pushes changes to power proactive agents.

Tags:Product Release

April 29

[Parallel Raises at $2 Billion Valuation to Scale Web Infrastructure for Agents](https://parallel.ai/blog/series-b)

Sequoia Capital led the round. Andrew Reed, Partner at Sequoia, joins our board. Existing investors Kleiner Perkins, Index Ventures, Khosla Ventures, First Round Capital, Spark Capital, and Terrain Capital all increased their participation.

Tags:Fundraise

URL: https://parallel.ai/

⇱ Parallel Web Systems | Infrastructure for intelligence on the web

#

## A web API purpose-built for AIs

Highest accuracy at every price point

## Highest accuracy at every price point

### Humanity's Last Exam

### FRAMES

### WebWalker

### FreshQA

### Coding

Search, built for AIs

The most accurate deep and wide research

Build a dataset from the web

Custom web enrichment

Monitor any event on the web

## Towards a programmatic web for AIs

## Agent onboarding prompt