VOOZH about

URL: https://www.explainx.ai/blog/types-of-ai-agents-complete-guide-2026

⇱


← Back to blog
go deep
πŸ‘ Types of AI Agents: Complete Taxonomy and When to Use Each (2026)

Related posts

Jun 27, 2026

Multi-Agent Orchestration Patterns: A Production Guide (2026)

Single agents hit walls. Multi-agent systems break through them β€” but only if you architect them right. This guide covers every major orchestration pattern, state management, error handling, cost control, and which frameworks to use.

Jun 23, 2026

Eric Xing Critique of Agent Model: Agentic vs Agentive AI and the GIC Architecture

Submitted June 22, 2026, Critique of Agent Model argues that most LLM "coding agents" are agentic β€” competence in external scaffolding β€” not agentive, where goals, identity, and learning live inside the system. The paper proposes GIC: hierarchical goals, evolving identity, world-model simulation, self-regulation, and self-directed learning under human oversight.

Jun 16, 2026

What Are AI Agents? The Complete Explainer for 2026

AI agents are systems that perceive their environment, reason about what to do next, take action using tools, observe the results, and repeat β€” until a goal is achieved. This is the definitive explainer on how they work, why they matter, and what you need to know to build with them in 2026.

Every vendor in 2026 calls their product an "AI agent." That label now covers a chat sidebar with web search, a terminal coding assistant that runs for an hour unsupervised, a customer-support bot that opens tickets, and a fleet of twelve specialized subagents coordinating a research report. Those are not the same architecture β€” and picking the wrong type is the fastest way to waste tokens, miss deadlines, or ship something unsafe.

This guide gives you a practical taxonomy: the major types of AI agents, how they differ, real examples of each, and a decision matrix for choosing the right design.

If you need the foundational definition first, start with What Are AI Agents?. If you already know the basics and want the full four-layer stack (prompt β†’ context β†’ loop β†’ harness), see Context vs Prompt vs Loop vs Harness Engineering.


TL;DR β€” agent types at a glance

Type (axis)SubtypesBest forExample products
AutonomyReactive, deliberative, fully autonomousSpeed vs planning depthReactive: autocomplete; Deliberative: Claude Code; Autonomous: background schedulers
Loop architectureReAct, plan-and-execute, reflexion, hierarchicalTask length and error recoveryReAct: most coding agents; Plan-execute: LangGraph workflows
DomainCoding, research, support, browser, workflow, voiceMatching tools to taskCoding: Cursor; Research: Perplexity Deep Research; Browser: Computer Use
Agent countSingle, multi-agent (orchestrator/worker, pipeline, fan-out)Parallelism and specializationSingle: most CLI agents; Multi: Claude Code subagents, CrewAI teams
Tool accessRAG-only, API/MCP, computer use, code executionExternal system integrationMCP: Claude Desktop; Computer use: Anthropic Computer Use API
Human involvementCopilot, supervised, autonomousSafety and trustCopilot: inline suggestions; Supervised: approve-before-send

Axis 1: Autonomy level β€” reactive vs deliberative vs autonomous

Classic AI textbooks (Russell & Norvig) define agent types by how they choose actions. LLM agents map cleanly onto this framework β€” with one important twist: the "model" is doing the reasoning, not hand-coded rules.

Reactive agents

A reactive agent maps the current input directly to an action. No internal world model, no multi-step plan. Think: autocomplete, inline code suggestions, or a classifier that routes a ticket to the right queue.

Strengths: Fast, cheap, predictable on narrow tasks. Weaknesses: Cannot recover from errors across steps; no memory of prior actions unless you inject it.

Modern example: GitHub Copilot inline completions β€” one observation (cursor context), one action (suggest next lines), no loop.

Deliberative (goal-based) agents

A deliberative agent maintains a goal, reasons about what to do next, acts, observes the result, and repeats. This is the dominant LLM agent pattern in 2026.

Strengths: Handles multi-step tasks; can adapt when a tool call fails. Weaknesses: Token cost scales with steps; can drift on very long horizons.

Modern examples: Claude Code, Cursor Agent, Devin, OpenAI Codex CLI β€” all run a goal-directed loop until the task completes or hits a stop condition.

Fully autonomous (background) agents

These agents run on schedules or triggers without a human initiating each session. Anthropic's managed agents, Claude Code's /goal mode, and various "AI employee" products fall here.

Strengths: True automation β€” work happens while you sleep. Weaknesses: Highest risk profile; requires strong guardrails, logging, and rollback.

The critical design choice is not "how autonomous" but which actions stay gated. See Human-in-the-Loop AI for the decision framework.


Axis 2: Loop architecture β€” how the agent thinks step by step

Autonomy tells you whether the agent loops. Loop architecture tells you how each iteration works.

ReAct (Reason + Act)

The default pattern: the model outputs reasoning (optional) and a tool call, the harness executes it, the result goes back into context, repeat.

Goal β†’ [Reason β†’ Tool call β†’ Observe result] β†’ ... β†’ Done

Used by: Claude Code, most LangChain agents, Cursor Agent, OpenAI function-calling loops.

Deep dive: ReAct Prompting guide and Agentic Loop: stop_reason guide.

Plan-and-execute

A planner produces a numbered step list first. An executor runs each step sequentially. Replanning happens only when a step fails or the plan becomes invalid.

When to use: Tasks with 15+ steps where pure ReAct drifts (migrations, multi-file refactors, research reports with fixed sections). Trade-off: Slower to adapt mid-flight; upfront plan can be wrong.

LangGraph's PlanAndExecute chain and CrewAI's task decomposition are common implementations.

Reflexion (self-critique)

After each attempt, a critic model evaluates output quality and injects feedback before the next iteration. Useful when success criteria are fuzzy (writing quality, test coverage, security review).

When to use: Code review agents, content generation with quality bars, eval-driven improvement loops. Trade-off: 2–3x token cost per iteration.

Research: Reflexion paper (Shinn et al., 2023) β€” the pattern that inspired most self-critique loops in production harnesses.

Hierarchical agents

A manager agent decomposes work and delegates to worker agents. Workers may themselves be ReAct loops. The manager synthesizes results.

When to use: Parallel research, large codebases with independent modules, multi-domain tasks (legal + finance + engineering).

Deep dive: Multi-Agent Orchestration Patterns.


Axis 3: Domain β€” what the agent is built to do

Domain type is the axis most product marketing emphasizes. It determines default tools, safety profile, and evaluation criteria.

Coding agents

Terminal or IDE agents with file read/write, shell execution, git, and test runners.

ProductLoopTool accessTypical autonomy
Claude CodeReAct + subagentsShell, files, MCP, gitHigh on edit; gated on push
Cursor AgentReActIDE, terminal, webSupervised
OpenAI Codex CLIReActShell, filesConfigurable
DevinReAct + planFull dev environmentHigh

Pathway: Building AI Agents.

Research agents

Agents optimized for web search, document retrieval, synthesis, and citation. Perplexity Deep Research, Google's research mode, and custom RAG pipelines are the main forms.

Key difference from coding agents: Read-heavy, write-light; evaluation is factual accuracy and source coverage, not test pass rate.

Customer support agents

Ticket routing, knowledge-base lookup, draft responses, escalation to humans. Usually reactive or short-loop deliberative β€” not open-ended autonomy.

Design constraint: Must handle PII, stay within policy, and escalate gracefully. Human gates on every customer-facing send.

Browser / computer-use agents

Agents that control a browser or desktop via screenshots and UI actions. Anthropic's Computer Use, OpenAI's Operator, and various open-source Playwright wrappers.

Strengths: Can interact with any web UI without an API. Weaknesses: Slow (screenshot β†’ action cycles), fragile on dynamic UIs, high token cost.

Workflow / automation agents

Zapier-style agents, n8n AI nodes, Make.com scenarios β€” fixed DAGs with LLM steps at decision points. Less "autonomous loop," more "LLM inside a workflow."

When to use: Repeatable business processes with known steps (invoice processing, lead enrichment, report generation).

Voice agents

Speech-in, speech-out agents with tool access β€” customer phone lines, meeting assistants, real-time translation with action capability.

Extra constraints: Latency budget (under 800ms for natural conversation), interruption handling, and ASR/TTS error propagation.


Axis 4: Single agent vs multi-agent

PatternStructureBest forCost multiplier
Single agentOne loop, one contextSequential tasks, under 20 steps1x
Orchestrator/workerManager decomposes, workers executeParallel independent subtasks2–5x
PipelineAgent A β†’ Agent B β†’ Agent CSequential specialization (research β†’ draft β†’ edit)3x
Fan-out/fan-inN workers in parallel, aggregator synthesizesSearch across many sources simultaneouslyNx
Debate / critiqueTwo agents challenge each otherHigh-stakes decisions, code review2x

Rule of thumb: Start with one agent. Add a second only when you can name the specific subtask that needs a different system prompt, tool set, or parallel execution β€” not because "multi-agent sounds more advanced."

Claude Code subagents are orchestrator/worker at the harness level: the main session spawns specialists via the Task tool. See Claude Code Subagents.


Axis 5: Tool and memory access

RAG-only agents

Retrieve documents, inject into context, generate answer. No external actions.

Good for: Q&A over internal docs, policy lookup, knowledge bases. Not an agent in the strict sense if there is no action loop β€” but vendors often label these "agents" anyway.

API / MCP agents

Connect to structured tools via REST APIs or Model Context Protocol (MCP) servers. The standard pattern for production integrations in 2026.

Good for: Database queries, CRM updates, calendar scheduling, custom internal tools.

Code execution agents

Run Python, shell, or sandboxed code as a tool. The model writes code; the harness executes and returns stdout/stderr.

Good for: Data analysis, file transformation, anything where generated code is more reliable than direct tool calls.

Memory-augmented agents

Agents with persistent memory across sessions β€” MEMORY.md, vector stores, or structured state files.

Types of memory:

Deep dive: Agent Markdown Files and Karpathy's LLM Wiki pattern.


Axis 6: Human involvement β€” the safety spectrum

LevelHuman roleExample
CopilotHuman initiates every action; AI suggestsInline autocomplete, chat sidebar
SupervisedAgent proposes; human approves irreversible stepsClaude Code with permission prompts
CheckpointedAgent runs autonomously until a gateApprove-before-email, approve-before-deploy
Fully autonomousHuman reviews output after the factScheduled report generation, log monitoring

The agent type does not determine safety β€” the harness does. A fully autonomous coding agent with no gates on git push is a different risk class than the same agent with hooks that block production deploys.


Decision matrix β€” which agent type should you build?

Answer these five questions in order:

  1. Is the task reversible?

    • No β†’ Supervised or checkpointed, regardless of domain.
    • Yes β†’ Continue.
  2. How many steps?

    • 1–3 β†’ Reactive or single-shot (maybe not an agent at all).
    • 4–20 β†’ Single ReAct agent.
    • 20+ β†’ Plan-and-execute or multi-agent decomposition.
  3. Does it need external systems?

    • No β†’ RAG or pure generation (chatbot may suffice).
    • Yes β†’ MCP/API agent with defined tool schemas.
  4. Can subtasks run in parallel?

    • No β†’ Single agent.
    • Yes β†’ Orchestrator/worker or fan-out.
  5. What domain?

    • Code β†’ Coding agent harness (terminal + files + tests).
    • Knowledge β†’ Research agent (search + RAG + synthesis).
    • Business process β†’ Workflow agent (DAG + LLM decision nodes).
Your answersRecommended type
Reversible, under 10 steps, needs APIs, sequential, codeSingle ReAct coding agent (Claude Code, Cursor)
Reversible, 20+ steps, needs APIs, sequential, codePlan-and-execute or hierarchical coding agent
Reversible, parallel research, needs webMulti-agent fan-out research system
Irreversible customer commsReactive support agent + human gate on every send
Repeatable business processWorkflow agent with LLM at decision nodes

How the types connect to the four-layer stack

Every agent type above sits on the same underlying stack:

  1. Prompt engineering β€” how each turn is worded
  2. Context engineering β€” what the model sees (history, RAG, tool schemas)
  3. Loop engineering — how observe→act cycles run (ReAct, plan-execute, etc.)
  4. Harness engineering β€” the code that enforces gates, retries, and tool execution

Changing the "type" usually means changing layers 3 and 4, not just the prompt. A multi-agent system is primarily a loop + harness change. A browser agent is primarily a tool-access + harness change.

Full stack guide: Context vs Prompt vs Loop vs Harness Engineering.


Summary

"AI agent" is not one thing. The useful taxonomy spans six axes:

Most production systems in 2026 are deliberative, ReAct-style, domain-specific agents with MCP tool access and supervised gates on irreversible actions. Multi-agent and plan-and-execute patterns appear when task complexity or parallelism demands them β€” not by default.


Related reading on explainx.ai


Product names, API capabilities, and agent features referenced in this guide reflect the landscape as of June 29, 2026. Agent taxonomies evolve quickly β€” check official documentation for the latest capabilities.