Cerebrium Pricing 2026
Complete pricing guide with plans, and cost analysis
Cerebrium has a free plan. Paid plans start at $100/month (Standard) and go up to $100/month.
Cerebrium costs Free to $100 per month as of April 2026, with 3 plans available including a free tier. Plans: Hobby (free), and Standard at $100/month. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
Cerebrium offers 3 pricing tiers: Hobby, Standard, Enterprise. A free plan is available. Paid plans include Standard at $100/month. The Standard plan is production teams running continuous inference workloads needing higher concurrency and compliance.
Compared to other ai model hosting & inference software, Cerebrium is positioned at the mid-market price point.
How much does Cerebrium cost?
Cerebrium Pricing Overview
Cerebrium has 3 pricing plans, including a free tier. Paid plans range from $0 to $100/month. The Hobby plan is free and is best for individual developers and hobbyists experimenting with serverless ml inference. The Standard plan costs $100/month, best for production teams running continuous inference workloads needing higher concurrency and compliance. The Enterprise plan requires contacting sales for a custom quote and is designed for large-scale inference workloads requiring enterprise compliance, dedicated support, and unlimited capacity.
This pricing was last verified in April 13, 2026 from 2 independent sources.
Cerebrium is a serverless GPU inference platform for deploying ML models without managing infrastructure. It bills per second for GPU, CPU, and memory usage, so teams only pay for active inference time. The Hobby plan has no monthly fee; the Standard plan costs $100/month and unlocks unlimited apps and 30 concurrent GPUs. Enterprise is required for H100 and A100 access. Cerebrium is a Y Combinator company.
All Cerebrium Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Hobby deployedApps: 3 appsuserSeats: 3 | Free | Custom | Individual developers and hobbyists experimenting with serverless ML inference |
| Standard containerConcurrency: 1000gpuConcurrency: 30 | $100 /month | Custom | Production teams running continuous inference workloads needing higher concurrency and compliance |
| Enterprise gpuConcurrency: UnlimitedcontainerConcurrency: Unlimited | Contact Sales | Contact Sales | Large-scale inference workloads requiring enterprise compliance, dedicated support, and unlimited capacity |
View all features by plan
Hobby
- No monthly platform fee
- Pay-as-you-go GPU compute (per second billing)
- All GPU types available (T4, L4, A10, L40s, A100, H100, H200, B200)
- T4 GPU: $0.000164/s (~$0.59/hr)
- L4 GPU: $0.000222/s (~$0.80/hr)
- A10 GPU: $0.000306/s (~$1.10/hr)
- L40s GPU: $0.000542/s (~$1.95/hr)
- A100 (40GB): $0.000555/s (~$2.00/hr)
- A100 (80GB): $0.000583/s (~$2.10/hr)
- H100 GPU: $0.000944/s (~$3.40/hr)
- H200 GPU: $0.001166/s (~$4.20/hr)
- B200 GPU: $0.00167/s (~$6.01/hr)
- Up to 3 deployed apps
- 3 user seats
- 500 container concurrency
- 5 concurrent GPUs
- 7-day log retention
- Real-time observability
- Community support
Standard
- $100/month platform fee
- Everything in Hobby
- Unlimited deployed apps
- 10 user seats
- 1000 container concurrency
- 30 concurrent GPUs
- Custom domains
- 30-day log retention
- SOC2 compliance
- Private Slack support
Enterprise
- Everything in Standard
- Unlimited concurrent GPUs
- Unlimited container concurrency
- Volume compute discounts
- Dedicated Slack support
- White glove onboarding
- ML engineering services
- Unlimited log retention
- HIPAA, GDPR, ISO 27001 compliance
- Custom seat allocation
How Cerebrium Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Cerebrium | Free | $100/month |
| Baseten | Free | $6500/month |
| BentoML | Free | $5000/month |
| Banana.dev | Custom | Custom |
Detailed pricing comparisons:
Cerebrium Pricing FAQ
01 How much does Cerebrium cost?
Cerebrium has two paid tiers: Hobby (free monthly fee, pay-as-you-go compute) and Standard ($100/month plus compute). GPU compute is billed per second — a T4 GPU costs approximately $0.000164/second (~$0.59/hour), an L4 costs ~$0.000222/second (~$0.80/hour), and an A10 costs ~$0.000306/second (~$1.10/hour). Enterprise pricing is custom for H100+ access.
02 Does Cerebrium have a free plan?
Yes. The Hobby plan has no monthly platform fee — you only pay for the GPU, CPU, and memory you consume, billed per second. New accounts also receive up to $1,000 in free onboarding credits. The Hobby plan is limited to 3 deployed apps, 3 user seats, and standard GPU types (T4, L4, A10, L40s).
03 What GPUs does Cerebrium support?
Cerebrium supports T4, L4, A10, L40s, and AWS Trainium on the Hobby plan. The Standard plan ($100/month) adds A100 40GB and 80GB. The Enterprise plan unlocks H100, H200, B200, and B300 GPUs with up to 8-GPU configurations.
04 How does Cerebrium serverless billing work?
Cerebrium charges separately for GPU time, CPU vCPU-seconds, memory GB-seconds, and persistent storage. You only pay while your app is actively processing requests — idle time between requests is not billed. This makes it cost-effective for bursty workloads compared to dedicated GPU instances.
Is this pricing incorrect? — we'll verify and update it.