Claude API Pricing 2026
Complete pricing guide with plans, hidden costs, and cost analysis
Claude API pricing ranges from $0.03 to $75/per million tokens.
Claude API costs $0.03 to $75 per per million tokens as of April 2026, with 2 plans available. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: No free tier available
Claude API offers 2 pricing tiers: API (Pay-as-you-go), Enterprise. The Enterprise plan is large-scale enterprise api deployments.
Compared to other llm api providers software, Claude API is positioned at the budget-friendly price point.
- 2 documented hidden costs beyond list price
How much does Claude API cost?
Claude API Pricing Overview
Claude API has 2 pricing plans ranging from $0.03 to $75/per million tokens. The API (Pay-as-you-go) plan requires contacting sales for a custom quote and is designed for developers building ai-powered applications on claude. The Enterprise plan requires contacting sales for a custom quote and is designed for large-scale enterprise api deployments.
There are at least 2 documented hidden costs beyond Claude API's list price, including implementation, training, and add-on fees.
This pricing was last verified in April 15, 2026 from 2 independent sources.
The Claude API costs from $0.25 per million input tokens (Claude Haiku 4) to $75 per million output tokens (Claude Opus 4) as of April 2026. Pricing is usage-based with no monthly subscription — you pay per token consumed. Claude Sonnet 4, the most popular model, is priced at $3.00 per million input tokens and $15.00 per million output tokens. Prompt caching reduces costs by up to 90% on repeated context, and the Batch API cuts rates in half for async workloads.
How Claude API Pricing Compares
Compare Claude API pricing against top alternatives in LLM API Providers.
All Claude API Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| API (Pay-as-you-go) | Contact Sales | Contact Sales | Developers building AI-powered applications on Claude |
| Enterprise | Contact Sales | Contact Sales | Large-scale enterprise API deployments |
View all features by plan
API (Pay-as-you-go)
- Claude Opus 4.6: $5.00 input / $25.00 output per M tokens
- Claude Sonnet 4.6: $3.00 input / $15.00 output per M tokens
- Claude Haiku 4.5: $1.00 input / $5.00 output per M tokens
- Batch API: 50% off all model rates
- Prompt caching: ~10% of input cost for cache reads (5-min TTL)
- Legacy models available: Sonnet 4.5, Opus 4.5, Opus 4.1, Sonnet 4, Opus 4, Haiku 3
- Managed Agents: $0.08 per session-hour
- Web search: $10 per 1,000 searches
- Code execution: $0.05 per hour per container (50 free hours/day)
Enterprise
- Volume discounts on API usage
- Dedicated support and SLAs
- Custom rate limits
- SSO and admin controls
Usage-Based Rates
Per-unit pricing for Claude API API usage.
API (Pay-as-you-go)
| Model | Unit | Rate |
|---|---|---|
| Claude Opus 4.6 | 1M input tokens | $5.00 |
| Claude Opus 4.6 | 1M output tokens | $25.00 |
| Claude Opus 4.6 (cache write) | 1M cache write tokens | $6.25 |
| Claude Opus 4.6 (cache read) | 1M cached input tokens | $0.500 |
| Claude Sonnet 4.6 | 1M input tokens | $3.00 |
| Claude Sonnet 4.6 | 1M output tokens | $15.00 |
| Claude Sonnet 4.6 (cache write) | 1M cache write tokens | $3.75 |
| Claude Sonnet 4.6 (cache read) | 1M cached input tokens | $0.300 |
| Claude Haiku 4.5 | 1M input tokens | $1.00 |
| Claude Haiku 4.5 | 1M output tokens | $5.00 |
| Claude Haiku 4.5 (cache write) | 1M cache write tokens | $1.25 |
| Claude Haiku 4.5 (cache read) | 1M cached input tokens | $0.100 |
| Claude Sonnet 4.5 | 1M input tokens | $3.00 |
| Claude Sonnet 4.5 | 1M output tokens | $15.00 |
| Claude Sonnet 4.5 (cache write) | 1M cache write tokens | $3.75 |
| Claude Sonnet 4.5 (cache read) | 1M cached input tokens | $0.300 |
| Claude Opus 4.5 | 1M input tokens | $5.00 |
| Claude Opus 4.5 | 1M output tokens | $25.00 |
| Claude Opus 4.5 (cache write) | 1M cache write tokens | $6.25 |
| Claude Opus 4.5 (cache read) | 1M cached input tokens | $0.500 |
| Claude Opus 4.1 | 1M input tokens | $15.00 |
| Claude Opus 4.1 | 1M output tokens | $75.00 |
| Claude Opus 4.1 (cache write) | 1M cache write tokens | $18.75 |
| Claude Opus 4.1 (cache read) | 1M cached input tokens | $1.50 |
| Claude Sonnet 4 | 1M input tokens | $3.00 |
| Claude Sonnet 4 | 1M output tokens | $15.00 |
| Claude Sonnet 4 (cache write) | 1M cache write tokens | $3.75 |
| Claude Sonnet 4 (cache read) | 1M cached input tokens | $0.300 |
| Claude Opus 4 | 1M input tokens | $15.00 |
| Claude Opus 4 | 1M output tokens | $75.00 |
| Claude Opus 4 (cache write) | 1M cache write tokens | $18.75 |
| Claude Opus 4 (cache read) | 1M cached input tokens | $1.50 |
| Claude Haiku 3 | 1M input tokens | $0.250 |
| Claude Haiku 3 | 1M output tokens | $1.25 |
| Claude Haiku 3 (cache write) | 1M cache write tokens | $0.300 |
| Claude Haiku 3 (cache read) | 1M cached input tokens | $0.030 |
- Batch API: 50% discount on all rates
- Cache writes: 25% of base input price; cache reads: 10% of base input price (5-min TTL)
- Haiku 4.5 is fastest/cheapest; Opus 4.6 is most intelligent
- US-only inference available at 1.1x pricing for input/output tokens
Compare Claude API vs Alternatives
Before committing to Claude API, compare pricing with these 3 alternatives in the same category.
What Companies Actually Pay for Claude API
| Model | Input /1M | Output /1M | Blended /1M |
|---|---|---|---|
| anthropic_claude-4-1-opus | $15.00 | $75.00 | $30.00 |
| anthropic_claude-opus-4-6 | $5.00 | $25.00 | $10.00 |
| anthropic_claude-sonnet-4-6 | $3.00 | $15.00 | $6.00 |
| anthropic_claude-4-5-haiku | $1.00 | $5.00 | $2.00 |
| anthropic_claude-3-haiku | $0.250 | $1.25 | $0.500 |
Claude API Year 1 Total Cost by Company Size
Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.
10M input + 2M output tokens/month using Claude Haiku 4
5M input + 2M output tokens/month using Claude Sonnet 4
1M input + 0.5M output tokens/month using Claude Opus 4
A startup-scale application processing 4.5 billion tokens over 14 days using Claude Sonnet pricing ($3/$15 per 1M tokens in/out).
Processing one million 1,000×1,000 pixel images through Claude Vision. Each image uses approximately 1,334 tokens at $3/1M input tokens, plus output tokens for the model's response.
How Claude API Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Claude API | $0.03/per million tokens | $75/per million tokens |
| Amazon Bedrock | $0.07/per million tokens | $75/per million tokens |
| Anyscale | $0.15/per million tokens | $5/per million tokens |
| Baidu ERNIE API | $0.1/per million tokens | $10/per million tokens |
| Cerebras Inference API | $0.1/per million tokens | $6/per million tokens |
| Cloudflare Workers AI | $0.05/per million tokens | $5/per million tokens |
Detailed pricing comparisons:
Claude API Contract Terms
Claude API contracts do not auto-renew. Changes require advance notice. These terms are sourced from verified buyer experiences.
How to Negotiate Claude API Pricing
Claude API contracts are negotiable. These 5 tactics are sourced from real buyer experiences and procurement specialists.
Claude 3 Haiku at $0.25/$1.25 per 1M tokens is 60x cheaper than Claude Opus 4 at $15/$75 per 1M tokens. Route high-volume, lower-complexity tasks (classification, extraction, summarization, routing) to Haiku or Sonnet and reserve Opus only for tasks where frontier-level reasoning is demonstrably required.
Artificial Analysis (artificialanalysis.ai), 2026-04-23For applications that repeatedly send large system prompts or shared document context, Anthropic's prompt caching feature can reduce input token consumption on cached portions. This is especially valuable for long-context retrieval-augmented generation (RAG) applications.
Anthropic API documentationDevelopers report that self-hosting models on rented GPUs can reduce per-token costs by 50x or more compared to Claude API for sufficiently high-volume workloads. One developer processed 4.5B tokens for $280 vs. an estimated $15,000 via Claude API. Consider a hybrid approach: Claude API for quality-critical tasks, self-hosted models for bulk/routine tasks.
Reddit r/startups (2025-04-18)Claude Sonnet 4.6 at $3/$15 per 1M tokens delivers strong performance at a fraction of Opus 4's $15/$75 cost. Multiple developers report Sonnet is sufficient for the majority of production use cases, reserving Opus for complex reasoning or high-stakes generation tasks.
Artificial Analysis (artificialanalysis.ai), 2026-04-23The Enterprise tier is custom-priced. Organizations with predictable, high-volume API usage should contact Anthropic's sales team and commit to a minimum spend in exchange for per-token rate reductions. Come prepared with token consumption data and projected growth.
CURRENT TIER DATA (Enterprise tier is custom-priced)Claude API Pricing FAQ
01 How much does the Claude API cost?
Claude API pricing varies by model. Claude Haiku 4 is the cheapest at $0.25/M input and $1.25/M output tokens. Claude Sonnet 4 is $3.00/$15.00 per million tokens. Claude Opus 4 is $15.00/$75.00 per million tokens.
02 How much would 10 million Sonnet tokens cost?
At Claude Sonnet 4 rates: 10M input tokens × $3.00 = $30.00; 10M output tokens × $15.00 = $150.00. A typical production app consuming 10M tokens/month (mix of input and output) would run roughly $50–75/month depending on the input/output ratio.
03 What is the Batch API discount?
Anthropic's Batch API processes requests asynchronously and offers a 50% discount on all model rates. Haiku 4 drops to $0.125/$0.625 per M tokens, Sonnet 4 to $1.50/$7.50. Ideal for offline workloads with no latency requirement.
04 How does prompt caching work with pricing?
Cache writes cost 25% of the base input price; cache reads cost just 10%. For Sonnet 4: cache write = $0.75/M, cache read = $0.30/M (vs $3.00/M standard input). Caching is profitable once you reuse the same context 2+ times.
05 Is Claude API the same as Anthropic API?
Yes — Anthropic is the company; Claude is the AI model family. The Anthropic API is the Claude API. You access it at api.anthropic.com with an API key from console.anthropic.com.
06 How does Claude API pricing work?
The Claude API uses pay-as-you-go token pricing. You are billed separately for input tokens (the text, images, or documents you send) and output tokens (the text Claude generates back). Prices vary significantly by model: Claude 3 Haiku starts at $0.25/1M input and $1.25/1M output tokens, while Claude Opus 4 costs $15/1M input and $75/1M output tokens. Enterprise pricing is custom-negotiated through Anthropic's sales team.
07 Which Claude model is cheapest for API access?
Claude 3 Haiku is the most affordable model at $0.25/1M input and $1.25/1M output tokens — a blended rate of approximately $0.50/1M tokens. For teams needing more capability, Claude Sonnet 4.6 at $3/$15 per 1M tokens offers a strong middle ground. Claude Opus 4 at $15/$75 per 1M tokens is the most expensive and is best reserved for complex reasoning tasks.
08 How are images billed in the Claude API?
Image inputs are converted to tokens using the formula: (width in pixels × height in pixels) / 750. A 1,000×1,000 pixel image consumes approximately 1,334 tokens, costing about $0.004 per image at Sonnet's $3/1M input token rate. At one million images, vision input costs alone reach roughly $4,000 before accounting for output tokens.
09 Does Anthropic offer volume discounts on the Claude API?
Volume discounts are available through the Enterprise tier, which is custom-quoted. There is no published discount schedule for standard pay-as-you-go API customers. The primary cost-reduction levers available without a custom contract are choosing lower-cost model tiers (Haiku, Sonnet) and implementing prompt caching for repeated contexts.
10 How does Claude API pricing compare to running your own model?
Developers with sufficient volume report that self-hosting open-weight models can be dramatically cheaper than the Claude API. One developer reported processing 4.5 billion tokens over 14 days for $280 on self-hosted infrastructure, estimating the same workload would have cost $15,000 via Claude at $3/$15 per million tokens. However, self-hosting requires engineering overhead and may sacrifice quality on complex tasks.
Is this pricing incorrect? — we'll verify and update it.