Google Cloud Text-to-Speech Pricing 2026
Complete pricing guide with plans, and cost analysis
Google Cloud Text-to-Speech costs $0.00/1 million text tokens for WaveNet voices. Plans range from $0.00 to $0.00/1 million text tokens.
Are you Google Cloud Text-to-Speech? Claim this profile
All Google Cloud Text-to-Speech Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Gemini 2.5 Flash TTS | Contact Sales | Contact Sales | — |
| What's included at Gemini 2.5 Flash TTS Feature details not yet documented for this tier. | |||
| Chirp 3: HD voices | $0.00003 /character | — | — |
| What's included at Chirp 3: HD voices Feature details not yet documented for this tier. | |||
| Instant custom voice | $0.00006 /character | — | — |
| What's included at Instant custom voice Feature details not yet documented for this tier. | |||
| WaveNet voices | $0.000004 /character | — | — |
| What's included at WaveNet voices Feature details not yet documented for this tier. | |||
| Studio voices | $0.00016 /character | — | — |
| What's included at Studio voices Feature details not yet documented for this tier. | |||
| Standard voices | $0.000004 /character | — | — |
| What's included at Standard voices Feature details not yet documented for this tier. | |||
| Neural2 voices | $0.000016 /character | — | — |
| What's included at Neural2 voices Feature details not yet documented for this tier. | |||
| Polyglot (Preview) voices | $0.000016 /character | — | — |
| What's included at Polyglot (Preview) voices Feature details not yet documented for this tier. | |||
View all features by plan (compare side-by-side)
Gemini 2.5 Flash TTS
Chirp 3: HD voices
Instant custom voice
WaveNet voices
Studio voices
Standard voices
Neural2 voices
Polyglot (Preview) voices
What does Google Cloud Text-to-Speech actually cost you?
Drag the slider. Pick a tier. Watch your projected spend update live.
List price by tier (annualized, per seat)
Per-seat list price across Google Cloud Text-to-Speech's plans, annualized. Custom-priced tiers show a hatched bar.
Google Cloud Text-to-Speech costs $0.00 to $0.00 per 1 million text tokens as of June 2026, with 8 plans available. Plans: Chirp 3: HD voices at $0.00003/1 million text tokens, Instant custom voice at $0.00006/1 million text tokens, WaveNet voices at $0.000004/1 million text tokens, Studio voices at $0.00016/1 million text tokens, Standard voices at $0.000004/1 million text tokens, Neural2 voices at $0.000016/1 million text tokens, and Polyglot (Preview) voices at $0.000016/1 million text tokens. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: No free tier available
Google Cloud Text-to-Speech offers 8 pricing tiers: Gemini 2.5 Flash TTS, Chirp 3: HD voices, Instant custom voice, WaveNet voices, Studio voices, Standard voices, Neural2 voices, Polyglot (Preview) voices. Paid plans include Chirp 3: HD voices at $0.00003/character, Instant custom voice at $0.00006/character, WaveNet voices at $0.000004/character.
Compared to other voice apis (text-to-speech & voice cloning) software, Google Cloud Text-to-Speech is positioned at the budget-friendly price point.
How much does Google Cloud Text-to-Speech cost?
Google Cloud Text-to-Speech Pricing Overview
Google Cloud Text-to-Speech has 8 pricing plans ranging from $0.00 to $0.00/1 million text tokens. The Gemini 2.5 Flash TTS plan requires contacting sales for a custom quote. The Chirp 3: HD voices plan costs $0.00/1 million text tokens. The Instant custom voice plan costs $0.00/1 million text tokens. The WaveNet voices plan costs $0.00/1 million text tokens. The Studio voices plan costs $0.00/1 million text tokens. The Standard voices plan costs $0.00/1 million text tokens. The Neural2 voices plan costs $0.00/1 million text tokens. The Polyglot (Preview) voices plan costs $0.00/1 million text tokens.
This pricing was last verified in June 14, 2026 from 1 independent source.
Google Cloud Text-to-Speech offers various pricing tiers based on voice type and usage. Pricing is calculated per character for most voice types, with specific rates for different voice qualities and features.
How Google Cloud Text-to-Speech Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Google Cloud Text-to-Speech | $0.000004/1 million text tokens | $0.00016/1 million text tokens |
| Cartesia | Free | $299/month |
| PlayHT | Free | $499/month |
| Rime | Free | $40/month |
| LMNT | Free | $199/month |
| Hume AI | Free | $500/month |
Browse all Voice APIs (Text-to-Speech & Voice Cloning) pricing →
Google Cloud Text-to-Speech Pricing FAQ
01 What are the different voice types and their pricing?
Standard voices and WaveNet voices are priced at $0.000004 per character. Neural2 voices and Polyglot (Preview) voices are priced at $0.000016 per character. Chirp 3: HD voices are priced at $0.00003 per character, Instant custom voice at $0.00006 per character, and Studio voices at $0.00016 per character.
02 Is there a plan for custom voice models?
Yes, there is an 'Instant custom voice' tier priced at $0.00006 per character.
03 What is the pricing for Gemini 2.5 Flash TTS?
Gemini 2.5 Flash TTS has custom pricing and is billed per 1 million text tokens.
Is this pricing incorrect? — we'll verify and update it.