VOOZH about

URL: https://pecollective.com/tools/cohere-pricing/

⇱ Cohere Pricing 2026: Command, Embed, and Rerank Costs


Cohere API Pricing: Every Model Compared (April 2026)

Cohere offers three product lines: Command (text generation), Embed (vector embeddings), and Rerank (search result reranking). Unlike OpenAI and Anthropic, Cohere has a meaningful free trial tier with 100 API calls per minute and 1,000 per month. The paid models are competitively priced. Command R+ at $2.50/$10 per million tokens is cheaper than Claude Sonnet 4.6 and GPT-4.1 for text generation. This page covers every model's pricing.

Trial (Free)

$0 Rate-limited
  • 100 API calls per minute
  • 1,000 calls per month
  • Access to all models
  • No credit card required
  • Enough for prototyping and evaluation
Most Popular

Command R+

$2.50 / $10 per 1M input / output tokens
  • Most capable text generation model
  • Strong at RAG and tool use
  • 128K context window
  • Multilingual support (10+ languages)
  • Cheaper than Sonnet 4.6 and GPT-4.1

Command R

$0.15 / $0.60 per 1M input / output tokens
  • Budget text generation model
  • Good for classification and extraction
  • 128K context window
  • Comparable to GPT-4.1 Nano on simple tasks
  • very cost-effective at scale

Embed v3

$0.10 per 1M tokens
  • High-quality text embeddings
  • 1024 dimensions (configurable)
  • Multilingual support
  • Comparable to OpenAI text-embedding-3-small
  • Top for search and RAG

Rerank 3.5

$2.00 per 1,000 searches
  • Reranks up to 100 docs per search
  • Dramatically improves RAG accuracy
  • Works with any vector database
  • Simple API, just send query + documents
  • Documents >500 tokens split into chunks

Cohere vs OpenAI vs Anthropic: Pricing Comparison

👁 Cohere API Pricing 2026: Command R+ vs. GPT-4.1 data visualization
Cohere API Pricing 2026: Command R+ vs. GPT-4.1

Here's how Cohere's models stack up against the competition on price.

Use CaseCohereOpenAIAnthropic
Budget generationCommand R: $0.15/$0.60GPT-4.1 Nano: $0.10/$0.40Haiku 4.5: $1/$5
Production generationCommand R+: $2.50/$10GPT-4.1: $2/$8Sonnet 4.6: $3/$15
EmbeddingsEmbed v3: $0.10/1Mtext-embedding-3-small: $0.02/1M
RerankingRerank 3.5: $2/1K searches
Free tier1,000 calls/monthLimited creditsLimited credits

Hidden Costs & Gotchas

  • Rerank charges per search, not per document. One search with 100 documents costs the same as one search with 10 documents (if all docs are under 500 tokens). But documents over 500 tokens get split into chunks, and each chunk counts separately.
  • Command R+ output tokens cost 4x input tokens. Long generative responses get expensive. Use Command R ($0.15/$0.60) for tasks that don't need R+ quality.
  • The free trial's 1,000 calls/month is generous for prototyping but not for production. There's no pay-as-you-go middle ground, you go from free to production pricing.
  • Embed v3 at $0.10/1M tokens is cheap, but embedding large document collections adds up. A million 500-word documents is roughly 650M tokens, costing $65 to embed.
  • Cohere's pricing is competitive with OpenAI and Anthropic, but the model quality gap matters. Command R+ is strong at RAG but may lag behind Sonnet 4.6 or GPT-4.1 on general reasoning and coding tasks.

Which Plan Do You Need?

RAG pipeline builder

Embed v3 ($0.10/1M tokens) for embeddings + Rerank 3.5 ($2/1K searches) for result quality. This combo is Cohere's strongest use case and competitive with any alternative.

Text generation at scale

Command R ($0.15/$0.60) for high-volume simple tasks. Command R+ ($2.50/$10) when you need stronger reasoning. Both are cheaper than Anthropic and OpenAI equivalents.

Enterprise with compliance needs

Cohere offers deployment on your own cloud (VPC) and on-premises options. Contact sales for enterprise pricing. This is a differentiator vs OpenAI and Anthropic, which are API-only.

The Bottom Line

Cohere's sweet spot is RAG pipelines. Embed v3 for creating embeddings plus Rerank 3.5 for improving search results is a top combination at competitive prices. For text generation, Command R+ at $2.50/$10 is cheaper than Sonnet 4.6 ($3/$15) and GPT-4.1 ($2/$8 on output), though model quality varies by task. The free trial with 1,000 calls/month is the most generous free tier among major API providers.

Disclosure: Pricing information is sourced from official websites and may change. We update this page regularly but always verify current pricing on the vendor's site before purchasing.

Related Resources

Anthropic API Pricing → OpenAI API Pricing → AWS Bedrock Pricing → Best Embedding Models →

Frequently Asked Questions

AI tool pricing changes weekly. We track all of it.

Weekly data from 22,000+ job postings. Free.

2,700+ subscribers. Unsubscribe anytime.

AI coding tools move fast

Weekly data on which tools developers are actually adopting, pricing changes, and new releases worth knowing about.

Weekly data from 22,000+ job postings. Unsubscribe anytime.

Updated April 2026

Cohere launched embed-v4 and updated Command R+ pricing in Q1 2026. Rerank v3 reduced costs. Enterprise plans now include dedicated capacity options.