All Google Cloud Text-to-Speech Plans & Pricing

Plan Monthly Annual Best For
View all features by plan (compare side-by-side)

Gemini 2.5 Flash TTS

Chirp 3: HD voices

Instant custom voice

WaveNet voices

Studio voices

Standard voices

Neural2 voices

Polyglot (Preview) voices

Compare Google Cloud Text-to-Speech with alternativesAdjust seats, lock a tier, add up to 2 more products side-by-side. Shareable URL.
Live calculator

What does Google Cloud Text-to-Speech actually cost you?

Drag the slider. Pick a tier. Watch your projected spend update live.

Tier
Billing
Your projected cost$0.001per month · $0.00003/seat × 25 seats
Year 1 license$0.00912 months at this rate
At a glance

List price by tier (annualized, per seat)

Per-seat list price across Google Cloud Text-to-Speech's plans, annualized. Custom-priced tiers show a hatched bar.

Gemini 2.5 Flash TTSCustom
Chirp 3: HD voices$0/yr
Instant custom voice$0.001/yr
WaveNet voices$0/yr
Studio voices$0.002/yr
Standard voices$0/yr
Neural2 voices$0/yr
Polyglot (Preview) voices$0/yr
Quick Answer
Last verified:
High confidence

Google Cloud Text-to-Speech costs $0.00 to $0.00 per 1 million text tokens as of June 2026, with 8 plans available. Plans: Chirp 3: HD voices at $0.00003/1 million text tokens, Instant custom voice at $0.00006/1 million text tokens, WaveNet voices at $0.000004/1 million text tokens, Studio voices at $0.00016/1 million text tokens, Standard voices at $0.000004/1 million text tokens, Neural2 voices at $0.000016/1 million text tokens, and Polyglot (Preview) voices at $0.000016/1 million text tokens. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

  • Free tier: No free tier available

Google Cloud Text-to-Speech offers 8 pricing tiers: Gemini 2.5 Flash TTS, Chirp 3: HD voices, Instant custom voice, WaveNet voices, Studio voices, Standard voices, Neural2 voices, Polyglot (Preview) voices. Paid plans include Chirp 3: HD voices at $0.00003/character, Instant custom voice at $0.00006/character, WaveNet voices at $0.000004/character.

Compared to other voice apis (text-to-speech & voice cloning) software, Google Cloud Text-to-Speech is positioned at the budget-friendly price point.

How much does Google Cloud Text-to-Speech cost?

Google Cloud Text-to-Speech pricing starts at $0.00/1 million text tokens across 8 plans, with enterprise pricing available on request. Plans include Gemini 2.5 Flash TTS (custom pricing), Chirp 3: HD voices at $0.00/1 million text tokens, Instant custom voice at $0.00/1 million text tokens, WaveNet voices at $0.00/1 million text tokens, Studio voices at $0.00/1 million text tokens, Standard voices at $0.00/1 million text tokens, Neural2 voices at $0.00/1 million text tokens, Polyglot (Preview) voices at $0.00/1 million text tokens.

Google Cloud Text-to-Speech Pricing Overview

Google Cloud Text-to-Speech has 8 pricing plans ranging from $0.00 to $0.00/1 million text tokens. The Gemini 2.5 Flash TTS plan requires contacting sales for a custom quote. The Chirp 3: HD voices plan costs $0.00/1 million text tokens. The Instant custom voice plan costs $0.00/1 million text tokens. The WaveNet voices plan costs $0.00/1 million text tokens. The Studio voices plan costs $0.00/1 million text tokens. The Standard voices plan costs $0.00/1 million text tokens. The Neural2 voices plan costs $0.00/1 million text tokens. The Polyglot (Preview) voices plan costs $0.00/1 million text tokens.

This pricing was last verified in June 14, 2026 from 1 independent source.

Google Cloud Text-to-Speech offers various pricing tiers based on voice type and usage. Pricing is calculated per character for most voice types, with specific rates for different voice qualities and features.

How Google Cloud Text-to-Speech Pricing Compares

Software Starting Price Top Price
Google Cloud Text-to-Speech $0.000004/1 million text tokens $0.00016/1 million text tokens
Cartesia Free $299/month
PlayHT Free $499/month
Rime Free $40/month
LMNT Free $199/month
Hume AI Free $500/month

Google Cloud Text-to-Speech Pricing FAQ

01 What are the different voice types and their pricing?

Standard voices and WaveNet voices are priced at $0.000004 per character. Neural2 voices and Polyglot (Preview) voices are priced at $0.000016 per character. Chirp 3: HD voices are priced at $0.00003 per character, Instant custom voice at $0.00006 per character, and Studio voices at $0.00016 per character.

02 Is there a plan for custom voice models?

Yes, there is an 'Instant custom voice' tier priced at $0.00006 per character.

03 What is the pricing for Gemini 2.5 Flash TTS?

Gemini 2.5 Flash TTS has custom pricing and is billed per 1 million text tokens.

Is this pricing incorrect? — we'll verify and update it.