Replicate Pricing 2026
Complete pricing guide with plans, hidden costs, and cost analysis
Replicate uses custom pricing — contact their sales team for a quote.
Replicate uses custom pricing as of April 2026 with 3 plans available. Contact Replicate directly for a personalized quote. Plans: Free (free), and Pay-as-you-go (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
Replicate offers 3 pricing tiers: Free, Pay-as-you-go, Enterprise. The Pay-as-you-go plan is developers and teams running ai predictions at any scale.
Compared to other ai productivity software, Replicate is positioned at the budget-friendly price point.
- 2 documented hidden costs beyond list price
How much does Replicate cost?
Replicate Pricing Overview
Replicate uses custom pricing — contact their sales team for a quote. The Free plan is free and is best for trying out ai models and small experiments. The Pay-as-you-go plan is free and is best for developers and teams running ai predictions at any scale. The Enterprise plan requires contacting sales for a custom quote and is designed for organizations with complex requirements or high-volume usage.
There are at least 2 documented hidden costs beyond Replicate's list price, including implementation, training, and add-on fees.
This pricing was last verified in April 15, 2026 from 2 independent sources.
Replicate pricing is based on compute usage with a serverless model that charges per-minute for GPU access. Users report costs of over $5/hr for A100 GPUs and up to $1/minute for serverless workloads. No public tier structure is available, and pricing appears to be usage-based through their API.
How Replicate Pricing Compares
Compare Replicate pricing against top alternatives in AI Productivity.
All Replicate Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Free | Free | Custom | Trying out AI models and small experiments |
| Pay-as-you-go | Free | Custom | Developers and teams running AI predictions at any scale |
| Enterprise | Contact Sales | Contact Sales | Organizations with complex requirements or high-volume usage |
View all features by plan
Free
- Free credits to get started
- Access to thousands of public models
- Pay-per-use after free credits
Pay-as-you-go
- No monthly subscription fee
- Billed per prediction (per token, per image, or per second)
- Public models: from $0.003/image to $0.25/sec video
- Private model hardware: $0.09/hr (CPU Small) to $43.92/hr (8x H100 GPU)
- GPU options: T4 ($0.81/hr), L40S ($3.51/hr), A100 ($5.04/hr), H100 ($5.49/hr)
- Auto-scaling for private models
- Deploy custom models via Cog
Enterprise
- Dedicated account manager
- Priority support
- Higher GPU limits
- Performance SLAs
- Help with onboarding, custom models, and optimizations
- Volume discounts for large spend
Usage-Based Rates
Per-unit pricing for Replicate API usage.
Pay-as-you-go
| Model | Unit | Rate |
|---|---|---|
| Claude 3.7 Sonnet | 1M input tokens | $3 |
| Claude 3.7 Sonnet | 1K output tokens | $0.015 |
| DeepSeek R1 | 1M input tokens | $3.75 |
| DeepSeek R1 | 1K output tokens | $0.01 |
| FLUX 1.1 Pro (image) | image | $0.04 |
| FLUX.1 [schnell] (image) | image | $0.003 |
| FLUX.1 [dev] (image) | image | $0.025 |
| Ideogram v3 Quality (image) | image | $0.09 |
| Recraft V3 (image) | image | $0.04 |
| Wan 2.1 (480p video) | second | $0.09 |
| Wan 2.1 (720p video) | second | $0.25 |
- Public models billed per prediction (token, image, or second)
- Custom/private models billed per second of hardware time
- A100 GPU: $0.00140/sec; H100: $0.001525/sec
Compare Replicate vs Alternatives
Before committing to Replicate, compare pricing with these 3 alternatives in the same category.
Individuals and small teams getting started with AI calendar management and basic scheduling
Compare pricingIndividual users testing basic writing assistance features
Compare pricingIndividual professionals and small teams needing AI-powered scheduling and task management
Compare pricingWhat Companies Actually Pay for Replicate
Replicate Year 1 Total Cost by Company Size
Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.
Running image generator finetuning on Replicate's serverless versus alternatives shows the cost difference. Replicate charges $1/minute for workloads that could run on a single H100.
HN discussion on finetuning costs
How Replicate Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Replicate | Custom | Custom |
| Clockwise | $7.75/user/month | $7.75/user/month |
| Grammarly Business | $12/user/month | $30/user/month |
| Motion | $19/user/month | $49/user/month |
| Notion AI | Free | $20/user/month |
| OpenAI | Free | $200/month |
How to Negotiate Replicate Pricing
Replicate contracts are negotiable. These 2 tactics are sourced from real buyer experiences and procurement specialists.
Instead of using Replicate's serverless offering, rent raw compute via Runpod or similar providers. An A100 on Runpod is $1.64/hr in Secure Cloud or $0.49/hr in Community Cloud, versus over $5/hr on Replicate.
HN discussion comparing GPU providersIf you're willing to take some risk of boxes disappearing and don't need much security, Runpod's Community Cloud offers significantly cheaper rates than Replicate's managed service.
HN user comparing pricing modelsReplicate Pricing FAQ
01 How does Replicate's pricing compare to alternatives like Runpod?
Replicate is significantly more expensive than raw compute providers. An A100 GPU costs over $5/hr on Replicate versus $1.64/hr on Runpod's Secure Cloud (about 3x more). Replicate's serverless pricing can reach $1/minute, which is over 20x the cost of equivalent compute on Runpod. The premium pays for convenience and managed infrastructure, but costs add up quickly for sustained workloads.
02 Is Replicate's serverless pricing worth the cost?
Replicate's serverless model charges a significant premium over renting GPUs directly. At $1/minute for some workloads, users report this is 'unreasonably expensive' and over 20x the cost of running equivalent compute on platforms like Runpod. The convenience may be worth it for occasional use or prototyping, but actual users confirm 'the pricing part it can get expensive' for regular production workloads.
Is this pricing incorrect? — we'll verify and update it.