Quick Answer
Last verified:
High confidence

Cerebrium costs Free to $100 per month as of April 2026, with 3 plans available including a free tier. Plans: Hobby (free), and Standard at $100/month. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

  • Free tier: Yes

Cerebrium offers 3 pricing tiers: Hobby, Standard, Enterprise. A free plan is available. Paid plans include Standard at $100/month. The Standard plan is production teams running continuous inference workloads needing higher concurrency and compliance.

Compared to other ai model hosting & inference software, Cerebrium is positioned at the mid-market price point.

How much does Cerebrium cost?

Cerebrium offers 3 pricing plans, starting with a free tier and scaling to custom enterprise pricing. Plans include Hobby (free), Standard at $100/month, Enterprise (custom pricing).

Cerebrium Pricing Overview

Cerebrium has 3 pricing plans, including a free tier. Paid plans range from $0 to $100/month. The Hobby plan is free and is best for individual developers and hobbyists experimenting with serverless ml inference. The Standard plan costs $100/month, best for production teams running continuous inference workloads needing higher concurrency and compliance. The Enterprise plan requires contacting sales for a custom quote and is designed for large-scale inference workloads requiring enterprise compliance, dedicated support, and unlimited capacity.

This pricing was last verified in April 13, 2026 from 2 independent sources.

Cerebrium is a serverless GPU inference platform for deploying ML models without managing infrastructure. It bills per second for GPU, CPU, and memory usage, so teams only pay for active inference time. The Hobby plan has no monthly fee; the Standard plan costs $100/month and unlocks unlimited apps and 30 concurrent GPUs. Enterprise is required for H100 and A100 access. Cerebrium is a Y Combinator company.

All Cerebrium Plans & Pricing

Plan Monthly Annual Best For
Hobby deployedApps: 3 appsuserSeats: 3 Free Custom Individual developers and hobbyists experimenting with serverless ML inference
Standard containerConcurrency: 1000gpuConcurrency: 30 $100 /month Custom Production teams running continuous inference workloads needing higher concurrency and compliance
Enterprise gpuConcurrency: UnlimitedcontainerConcurrency: Unlimited Contact Sales Contact Sales Large-scale inference workloads requiring enterprise compliance, dedicated support, and unlimited capacity
View all features by plan

Hobby

  • No monthly platform fee
  • Pay-as-you-go GPU compute (per second billing)
  • All GPU types available (T4, L4, A10, L40s, A100, H100, H200, B200)
  • T4 GPU: $0.000164/s (~$0.59/hr)
  • L4 GPU: $0.000222/s (~$0.80/hr)
  • A10 GPU: $0.000306/s (~$1.10/hr)
  • L40s GPU: $0.000542/s (~$1.95/hr)
  • A100 (40GB): $0.000555/s (~$2.00/hr)
  • A100 (80GB): $0.000583/s (~$2.10/hr)
  • H100 GPU: $0.000944/s (~$3.40/hr)
  • H200 GPU: $0.001166/s (~$4.20/hr)
  • B200 GPU: $0.00167/s (~$6.01/hr)
  • Up to 3 deployed apps
  • 3 user seats
  • 500 container concurrency
  • 5 concurrent GPUs
  • 7-day log retention
  • Real-time observability
  • Community support

Standard

  • $100/month platform fee
  • Everything in Hobby
  • Unlimited deployed apps
  • 10 user seats
  • 1000 container concurrency
  • 30 concurrent GPUs
  • Custom domains
  • 30-day log retention
  • SOC2 compliance
  • Private Slack support

Enterprise

  • Everything in Standard
  • Unlimited concurrent GPUs
  • Unlimited container concurrency
  • Volume compute discounts
  • Dedicated Slack support
  • White glove onboarding
  • ML engineering services
  • Unlimited log retention
  • HIPAA, GDPR, ISO 27001 compliance
  • Custom seat allocation

How Cerebrium Pricing Compares

Software Starting Price Top Price
Cerebrium Free $100/month
Baseten Free $6500/month
BentoML Free $5000/month
Banana.dev Custom Custom

Detailed pricing comparisons:

Cerebrium Pricing FAQ

01 How much does Cerebrium cost?

Cerebrium has two paid tiers: Hobby (free monthly fee, pay-as-you-go compute) and Standard ($100/month plus compute). GPU compute is billed per second — a T4 GPU costs approximately $0.000164/second (~$0.59/hour), an L4 costs ~$0.000222/second (~$0.80/hour), and an A10 costs ~$0.000306/second (~$1.10/hour). Enterprise pricing is custom for H100+ access.

02 Does Cerebrium have a free plan?

Yes. The Hobby plan has no monthly platform fee — you only pay for the GPU, CPU, and memory you consume, billed per second. New accounts also receive up to $1,000 in free onboarding credits. The Hobby plan is limited to 3 deployed apps, 3 user seats, and standard GPU types (T4, L4, A10, L40s).

03 What GPUs does Cerebrium support?

Cerebrium supports T4, L4, A10, L40s, and AWS Trainium on the Hobby plan. The Standard plan ($100/month) adds A100 40GB and 80GB. The Enterprise plan unlocks H100, H200, B200, and B300 GPUs with up to 8-GPU configurations.

04 How does Cerebrium serverless billing work?

Cerebrium charges separately for GPU time, CPU vCPU-seconds, memory GB-seconds, and persistent storage. You only pay while your app is actively processing requests — idle time between requests is not billed. This makes it cost-effective for bursty workloads compared to dedicated GPU instances.

Is this pricing incorrect? — we'll verify and update it.