Quick Answer
Last verified:
High confidence

Google Cloud Text-to-Speech costs $0.00 to $0.00 per 1 million text tokens as of June 2026, with 8 plans available. Plans: Chirp 3: HD voices at $0.00003/1 million text tokens, Instant custom voice at $0.00006/1 million text tokens, WaveNet voices at $0.000004/1 million text tokens, Studio voices at $0.00016/1 million text tokens, Standard voices at $0.000004/1 million text tokens, Neural2 voices at $0.000016/1 million text tokens, and Polyglot (Preview) voices at $0.000016/1 million text tokens. Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

  • Free tier: No free tier available

Google Cloud Text-to-Speech offers 8 pricing tiers: Gemini 2.5 Flash TTS, Chirp 3: HD voices, Instant custom voice, WaveNet voices, Studio voices, Standard voices, Neural2 voices, Polyglot (Preview) voices. Paid plans include Chirp 3: HD voices at $0.00003/character, Instant custom voice at $0.00006/character, WaveNet voices at $0.000004/character.

Google Cloud Text-to-Speech true cost runs 70% above the listed $0.000004-$0.00016/1 million text tokens price as of June 2026. For a 25-person team, expect ~$0 in year-one costs vs the $0.005 base license. Verified from 1 sources by CostBench.

Hidden Costs Breakdown

Example: True Cost for 25 Users

License (25 × $0.000016 × 12) $0.005/yr
Implementation (one-time) +$15,000–$50,000
Premium Support (20%) +$0/yr
Training (25 × $500) +$12,500
Admin (part-time) +$15,000–$25,000/yr
Estimated Year 1 Total ~$0
That's roughly 1.7× the advertised license price.