Claude API Pricing 2026: $0.25–$75/M tokens (Haiku, Sonnet, Opus)

Price checkPer per million tokens

API (Pay-as-you-go)Custom EnterpriseCustom

All Claude API Plans & Pricing

Plan	Monthly	Annual	Best For
API (Pay-as-you-go)	Contact Sales	Contact Sales	Developers building AI-powered applications on Claude
What's included at API (Pay-as-you-go) Best for: Developers building AI-powered applications on Claude Claude Opus 4.6: $5.00 input / $25.00 output per M tokens Claude Sonnet 4.6: $3.00 input / $15.00 output per M tokens Claude Haiku 4.5: $1.00 input / $5.00 output per M tokens Batch API: 50% off all model rates Prompt caching: ~10% of input cost for cache reads (5-min TTL) Legacy models available: Sonnet 4.5, Opus 4.5, Opus 4.1, Sonnet 4, Opus 4, Haiku 3 Managed Agents: $0.08 per session-hour Web search: $10 per 1,000 searches Code execution: $0.05 per hour per container (50 free hours/day)
Enterprise	Contact Sales	Contact Sales	Large-scale enterprise API deployments
What's included at Enterprise Best for: Large-scale enterprise API deployments Volume discounts on API usage Dedicated support and SLAs Custom rate limits SSO and admin controls

View all features by plan (compare side-by-side)

API (Pay-as-you-go)

Claude Opus 4.6: $5.00 input / $25.00 output per M tokens
Claude Sonnet 4.6: $3.00 input / $15.00 output per M tokens
Claude Haiku 4.5: $1.00 input / $5.00 output per M tokens
Batch API: 50% off all model rates
Prompt caching: ~10% of input cost for cache reads (5-min TTL)
Legacy models available: Sonnet 4.5, Opus 4.5, Opus 4.1, Sonnet 4, Opus 4, Haiku 3
Managed Agents: $0.08 per session-hour
Web search: $10 per 1,000 searches
Code execution: $0.05 per hour per container (50 free hours/day)

Enterprise

Volume discounts on API usage
Dedicated support and SLAs
Custom rate limits
SSO and admin controls

See Claude API Plans

Compare Claude API with alternativesAdjust seats, lock a tier, add up to 2 more products side-by-side. Shareable URL.

Quick Answer

Last verified: May 5, 2026

High confidence

Claude API costs $0.03 to $75 per per million tokens as of June 2026, with 2 plans available. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

Free tier: No free tier available

Claude API offers 2 pricing tiers: API (Pay-as-you-go), Enterprise. The Enterprise plan is large-scale enterprise api deployments.

Compared to other llm api providers software, Claude API is positioned at the budget-friendly price point.

2 documented hidden costs beyond list price

How much does Claude API cost?

Claude API pricing starts at $0.03/per million tokens across 2 plans, with enterprise pricing available on request. Plans include API (Pay-as-you-go) (custom pricing), Enterprise (custom pricing).

Claude API Pricing Overview

Claude API has 2 pricing plans ranging from $0.03 to $75/per million tokens. The API (Pay-as-you-go) plan requires contacting sales for a custom quote and is designed for developers building ai-powered applications on claude. The Enterprise plan requires contacting sales for a custom quote and is designed for large-scale enterprise api deployments.

There are at least 2 documented hidden costs beyond Claude API's list price, including implementation, training, and add-on fees.

This pricing was last verified in May 5, 2026 from 2 independent sources.

See Claude API Plans

The Claude API costs from $0.25 per million input tokens (Claude Haiku 4) to $75 per million output tokens (Claude Opus 4) as of April 2026. Pricing is usage-based with no monthly subscription — you pay per token consumed. Claude Sonnet 4, the most popular model, is priced at $3.00 per million input tokens and $15.00 per million output tokens. Prompt caching reduces costs by up to 90% on repeated context, and the Batch API cuts rates in half for async workloads.

How Claude API Pricing Compares

Compare Claude API pricing against top alternatives in LLM API Providers.

Groq $0-$3.0/per million tokens Compare → Together AI $0.03-$9.95/per million tokens / hour Compare → Fireworks AI $0-$9/per million tokens / hour Compare →

Usage-Based Rates

Per-unit pricing for Claude API API usage.

API (Pay-as-you-go)

Model	Unit	Rate
Claude Opus 4.6	1M input tokens	$5.00
Claude Opus 4.6	1M output tokens	$25.00
Claude Opus 4.6 (cache write)	1M cache write tokens	$6.25
Claude Opus 4.6 (cache read)	1M cached input tokens	$0.500
Claude Sonnet 4.6	1M input tokens	$3.00
Claude Sonnet 4.6	1M output tokens	$15.00
Claude Sonnet 4.6 (cache write)	1M cache write tokens	$3.75
Claude Sonnet 4.6 (cache read)	1M cached input tokens	$0.300
Claude Haiku 4.5	1M input tokens	$1.00
Claude Haiku 4.5	1M output tokens	$5.00
Claude Haiku 4.5 (cache write)	1M cache write tokens	$1.25
Claude Haiku 4.5 (cache read)	1M cached input tokens	$0.100
Claude Sonnet 4.5	1M input tokens	$3.00
Claude Sonnet 4.5	1M output tokens	$15.00
Claude Sonnet 4.5 (cache write)	1M cache write tokens	$3.75
Claude Sonnet 4.5 (cache read)	1M cached input tokens	$0.300
Claude Opus 4.5	1M input tokens	$5.00
Claude Opus 4.5	1M output tokens	$25.00
Claude Opus 4.5 (cache write)	1M cache write tokens	$6.25
Claude Opus 4.5 (cache read)	1M cached input tokens	$0.500
Claude Opus 4.1	1M input tokens	$15.00
Claude Opus 4.1	1M output tokens	$75.00
Claude Opus 4.1 (cache write)	1M cache write tokens	$18.75
Claude Opus 4.1 (cache read)	1M cached input tokens	$1.50
Claude Sonnet 4	1M input tokens	$3.00
Claude Sonnet 4	1M output tokens	$15.00
Claude Sonnet 4 (cache write)	1M cache write tokens	$3.75
Claude Sonnet 4 (cache read)	1M cached input tokens	$0.300
Claude Opus 4	1M input tokens	$15.00
Claude Opus 4	1M output tokens	$75.00
Claude Opus 4 (cache write)	1M cache write tokens	$18.75
Claude Opus 4 (cache read)	1M cached input tokens	$1.50
Claude Haiku 3	1M input tokens	$0.250
Claude Haiku 3	1M output tokens	$1.25
Claude Haiku 3 (cache write)	1M cache write tokens	$0.300
Claude Haiku 3 (cache read)	1M cached input tokens	$0.030

Batch API: 50% discount on all rates
Cache writes: 25% of base input price; cache reads: 10% of base input price (5-min TTL)
Haiku 4.5 is fastest/cheapest; Opus 4.6 is most intelligent
US-only inference available at 1.1x pricing for input/output tokens

Compare Claude API vs Alternatives

Before committing to Claude API, compare pricing with these 3 alternatives in the same category.

VSGroq

Free

Prototyping and evaluation

Full comparison

VSTogether AI

From $0.03/per million tokens / hour

Variable-volume API usage

Full comparison

VSFireworks AI

Free

Variable-volume API usage

Full comparison

All Claude API alternatives & migration guides

What Companies Actually Pay for Claude API

Median per-1M-token pricing across 16 models

Input $5.00/1M

Output $25.00/1M

Flagship models in this provider's catalog

Model	Input /1M	Output /1M	Blended /1M
anthropic_claude-4-1-opus	$15.00	$75.00	$30.00
anthropic_claude-opus-4-6	$5.00	$25.00	$10.00
anthropic_claude-sonnet-4-6	$3.00	$15.00	$6.00
anthropic_claude-4-5-haiku	$1.00	$5.00	$2.00
anthropic_claude-3-haiku	$0.250	$1.25	$0.500

Review scores

Top pricing complaints

API costs at production scale ($3/$15 per 1M tokens for Sonnet) can reach $15,000+ per two-week period for high-volume workloadsOpus-class models at $15/$75 per 1M tokens are significantly more expensive than some comparable competitor offeringsNo published volume discount tiers for standard API customers — enterprise discounts require direct sales negotiation

Source: Artificial Analysis — medians aggregated from 16 models in this provider's catalog. Per-1M-token pricing reflects list rates.

Claude API Year 1 Total Cost by Company Size

Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.

Lightweight Chatbot (Haiku 4) $5 Year 1 total

10M input + 2M output tokens/month using Claude Haiku 4

Mid-Scale App (Sonnet 4) $45 Year 1 total

5M input + 2M output tokens/month using Claude Sonnet 4

Heavy Reasoning (Opus 4) $52 Year 1 total

1M input + 0.5M output tokens/month using Claude Opus 4

High-Volume Production Workload (4.5B tokens / 2 weeks) ~$15,000 per two-week period (developer estimate) Year 1 total

developer estimate

Total ~$15,000 per two-week period (developer estimate)

A startup-scale application processing 4.5 billion tokens over 14 days using Claude Sonnet pricing ($3/$15 per 1M tokens in/out).

Vision Processing at Scale (1M images, 1000×1000px) ~$4,000 for input tokens on 1M images (output costs additional depending on response length) Year 1 total

output costs additional depending on response length

Total ~$4,000 for input tokens on 1M images (output costs additional depending on response length)

Processing one million 1,000×1,000 pixel images through Claude Vision. Each image uses approximately 1,334 tokens at $3/1M input tokens, plus output tokens for the model's response.

How Claude API Pricing Compares

Software	Starting Price	Top Price
Claude API	$0.03/per million tokens	$75/per million tokens
Amazon Bedrock	$0.07/per million tokens	$75/per million tokens
Anyscale	$0.15/per million tokens	$5/per million tokens
Baidu ERNIE API	$0.1/per million tokens	$10/per million tokens
Cerebras Inference API	$0.1/per million tokens	$6/per million tokens
Cloudflare Workers AI	Free	$4.881/per million tokens

Detailed pricing comparisons:

Browse all LLM API Providers pricing →

2 Claude API Hidden Costs Beyond the List Price

Beyond the listed price, Claude API has at least 2 documented hidden costs that can significantly increase total cost of ownership.

Watch for 2 hidden costs

Image/Vision Token Overhead 5-20% of license costs
medium 1 source

Reddit "The formula for token calculation is: (width px * height px) / 750. Here are some examples: 200x200 px Image: 54 tokens, costing roughly $0.0016 per image. 1000x1000 px Image: 1334 tokens, costing about $0.004 per image."
Token Costs Scale Dramatically at Production Volume $15,000-$50,000
high 1 source

Reddit "In the last 14 days that model processed about 4.5B tokens for $280, or about: $0.0007 per million tokens in, and $0.01 per million tokens out. With Claude that would have been $15,000 @ $3/$15."

Tip

Ask your Claude API sales rep about these costs upfront. Getting them in writing before signing can save you from surprise charges later.

Full hidden costs breakdown →

Intelligence sourced from 1 independent sources

Reddit User discussions

Key claims include inline source attribution. Data verified against multiple independent sources. 8 source citations total.

Claude API Contract Terms

Claude API contracts do not auto-renew. Changes require advance notice. These terms are sourced from verified buyer experiences.

Contract Terms

Auto-Renewal No

Mid-Term Downgrade Not allowed

Payment Terms Pay-as-you-go for standard API; Enterprise terms are custom-negotiated

How to Negotiate Claude API Pricing

Claude API contracts are negotiable. These 5 tactics are sourced from real buyer experiences and procurement specialists.

Negotiation Playbook 5 tactics

Route Tasks to Cheapest Capable Model high success

Claude 3 Haiku at $0.25/$1.25 per 1M tokens is 60x cheaper than Claude Opus 4 at $15/$75 per 1M tokens. Route high-volume, lower-complexity tasks (classification, extraction, summarization, routing) to Haiku or Sonnet and reserve Opus only for tasks where frontier-level reasoning is demonstrably required.

Artificial Analysis (artificialanalysis.ai), 2026-04-23

Implement Prompt Caching for Repeated Contexts high success

For applications that repeatedly send large system prompts or shared document context, Anthropic's prompt caching feature can reduce input token consumption on cached portions. This is especially valuable for long-context retrieval-augmented generation (RAG) applications.

Anthropic API documentation

Self-Host Open-Weight Alternatives for Bulk Tasks high success

Developers report that self-hosting models on rented GPUs can reduce per-token costs by 50x or more compared to Claude API for sufficiently high-volume workloads. One developer processed 4.5B tokens for $280 vs. an estimated $15,000 via Claude API. Consider a hybrid approach: Claude API for quality-critical tasks, self-hosted models for bulk/routine tasks.

Reddit r/startups (2025-04-18)

Use Sonnet Over Opus for Most Production Workloads high success

Claude Sonnet 4.6 at $3/$15 per 1M tokens delivers strong performance at a fraction of Opus 4's $15/$75 cost. Multiple developers report Sonnet is sufficient for the majority of production use cases, reserving Opus for complex reasoning or high-stakes generation tasks.

Artificial Analysis (artificialanalysis.ai), 2026-04-23

Negotiate Enterprise Volume Commitments medium success

The Enterprise tier is custom-priced. Organizations with predictable, high-volume API usage should contact Anthropic's sales team and commit to a minimum spend in exchange for per-token rate reductions. Come prepared with token consumption data and projected growth.

CURRENT TIER DATA (Enterprise tier is custom-priced)

Full negotiation guide →

Claude API Pricing FAQ

01 How much does the Claude API cost?

Claude API pricing varies by model. Claude Haiku 4 is the cheapest at $0.25/M input and $1.25/M output tokens. Claude Sonnet 4 is $3.00/$15.00 per million tokens. Claude Opus 4 is $15.00/$75.00 per million tokens.

02 How much would 10 million Sonnet tokens cost?

At Claude Sonnet 4 rates: 10M input tokens × $3.00 = $30.00; 10M output tokens × $15.00 = $150.00. A typical production app consuming 10M tokens/month (mix of input and output) would run roughly $50–75/month depending on the input/output ratio.

03 What is the Batch API discount?

Anthropic's Batch API processes requests asynchronously and offers a 50% discount on all model rates. Haiku 4 drops to $0.125/$0.625 per M tokens, Sonnet 4 to $1.50/$7.50. Ideal for offline workloads with no latency requirement.

04 How does prompt caching work with pricing?

Cache writes cost 25% of the base input price; cache reads cost just 10%. For Sonnet 4: cache write = $0.75/M, cache read = $0.30/M (vs $3.00/M standard input). Caching is profitable once you reuse the same context 2+ times.

05 Is Claude API the same as Anthropic API?

Yes — Anthropic is the company; Claude is the AI model family. The Anthropic API is the Claude API. You access it at api.anthropic.com with an API key from console.anthropic.com.

06 How does Claude API pricing work?

The Claude API uses pay-as-you-go token pricing. You are billed separately for input tokens (the text, images, or documents you send) and output tokens (the text Claude generates back). Prices vary significantly by model: Claude 3 Haiku starts at $0.25/1M input and $1.25/1M output tokens, while Claude Opus 4 costs $15/1M input and $75/1M output tokens. Enterprise pricing is custom-negotiated through Anthropic's sales team.

07 Which Claude model is cheapest for API access?

Claude 3 Haiku is the most affordable model at $0.25/1M input and $1.25/1M output tokens — a blended rate of approximately $0.50/1M tokens. For teams needing more capability, Claude Sonnet 4.6 at $3/$15 per 1M tokens offers a strong middle ground. Claude Opus 4 at $15/$75 per 1M tokens is the most expensive and is best reserved for complex reasoning tasks.

08 How are images billed in the Claude API?

Image inputs are converted to tokens using the formula: (width in pixels × height in pixels) / 750. A 1,000×1,000 pixel image consumes approximately 1,334 tokens, costing about $0.004 per image at Sonnet's $3/1M input token rate. At one million images, vision input costs alone reach roughly $4,000 before accounting for output tokens.

09 Does Anthropic offer volume discounts on the Claude API?

Volume discounts are available through the Enterprise tier, which is custom-quoted. There is no published discount schedule for standard pay-as-you-go API customers. The primary cost-reduction levers available without a custom contract are choosing lower-cost model tiers (Haiku, Sonnet) and implementing prompt caching for repeated contexts.

10 How does Claude API pricing compare to running your own model?

Developers with sufficient volume report that self-hosting open-weight models can be dramatically cheaper than the Claude API. One developer reported processing 4.5 billion tokens over 14 days for $280 on self-hosted infrastructure, estimating the same workload would have cost $15,000 via Claude at $3/$15 per million tokens. However, self-hosting requires engineering overhead and may sacrifice quality on complex tasks.

Is this pricing incorrect? — we'll verify and update it.