Speechmatics Pricing 2026
Complete pricing guide with plans, hidden costs, and negotiation tips
Speechmatics pricing varies by team size and features, ranging from $0 to $0.006 per minute in 2026. Your actual cost depends on the tier you choose, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
- Billing: Monthly and annual (save 15-20%)
- Hidden costs: Add ~35% for implementation, support, and training
Speechmatics offers 3 pricing tiers: Free, Pro, Enterprise. Standard paid plans include Free at $0/minute. The Pro plan is growing companies needing higher concurrency, volume discounts, and email support for production transcription workloads.
Compared to other ai transcription apis software, Speechmatics is positioned at the budget-friendly price point.
Speechmatics is a multilingual speech recognition platform specializing in speech-to-text and text-to-speech APIs with support for 55+ languages, on-premises deployment options, and per-second billing precision. Unlike competitors focused on real-time streaming (Deepgram) or rich audio intelligence (AssemblyAI), Speechmatics prioritizes multilingual accuracy, deployment flexibility (cloud, on-premises, or air-gapped), and a recurring free tier that resets monthly. The platform is used by enterprises like BBC, Sky, and Vodafone for live captioning, customer call analytics, and media transcription workflows requiring data sovereignty and compliance.
Pricing starts with a Free tier offering 480 minutes/month (8 hours/month) of speech-to-text and 1 million characters/month of text-to-speech (English only), with no credit card required. This free tier resets monthly, unlike competitors' one-time credits (Deepgram's $200, AssemblyAI's $50). The Pro tier costs $0.24/hour ($0.004/min) for speech-to-text, includes the same 480 free minutes/month, and offers automatic 20% discounts above 500 hours/month per transcription type. Enterprise plans have custom pricing (typically $25,000-$50,000+ annually) and include custom models, on-premises deployment, unlimited scale, and dedicated support.
A critical consideration: Speechmatics Pro at $0.004/min is 60% more expensive than AssemblyAI Universal ($0.0025/min) but 48% cheaper than Deepgram Nova-3 ($0.0077/min) for basic transcription. The recurring 480 minutes/month free tier (5,760 min/year) is the most generous for sustained low-volume usage, but Pro tier usage is capped at 6,000 hours/month, requiring Enterprise upgrades for higher volumes. Text-to-speech is English-only on Free and Pro tiers, with multilingual TTS requiring Enterprise pricing. On-premises deployment (Enterprise) adds significant infrastructure overhead ($2,000-$10,000/month) but is essential for data sovereignty in regulated industries.
In this 2026 pricing guide, we break down Speechmatics' three tiers (Free, Pro, Enterprise), calculate real-world costs for multilingual transcription workflows, expose hidden usage caps and infrastructure costs, and compare Speechmatics to alternatives like AssemblyAI, Deepgram, and Rev AI to help you determine if it is the most cost-effective solution for your multilingual transcription needs.
All Speechmatics Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Free Speech-to-Text: 480 minutes/month (resets monthly)Text-to-Speech: 1 million characters/month (English only) | Free | Free 0 | Developers prototyping applications or teams with low-volume transcription needs (<8 hours/month) |
| Pro Free monthly credits: 480 minutes/month STT + 1M chars/month TTSConcurrency: 50 real-time sessions | Contact | Contact | Growing companies needing higher concurrency, volume discounts, and email support for production transcription workloads |
| Enterprise Minimum commitment: Typically $25,000-$50,000 annuallyConcurrency: Unlimited (negotiable) | Contact | Contact | Large enterprises needing custom models, on-premises deployment, unlimited scale, and dedicated support with compliance guarantees |
View all features by plan
Free
- 480 minutes/month free speech-to-text (recurring monthly)
- 55+ languages including English, Spanish, French, German, Chinese
- 1 million characters/month free text-to-speech (~20 hours, English only)
- Batch and real-time transcription modes
- 2 concurrent real-time sessions
- Per-second billing precision (no rounding up)
- No credit card required to start
- Community support via documentation
- Free tier resets monthly (unlike one-time credits)
Pro
- Base rate of $0.24/hour ($0.004/min) for speech-to-text
- 480 minutes/month free speech-to-text included (same as Free tier)
- 1 million characters/month free text-to-speech included
- 55+ languages with multilingual support
- 50 concurrent real-time sessions (vs 2 on Free)
- 10 file jobs per second for batch processing
- Per-second billing precision (billed to the second)
- 20% automatic volume discount above 500 hours/month per STT type
- Usage capped at 6,000 hours/month (contact sales for higher)
- Online email support with faster response times
- Additional discounts available for 24,000+ hours annually
Enterprise
- All Pro features with custom volume pricing
- Unlimited scale with no rate limits
- Custom-trained models for domain-specific accuracy
- On-premises deployment options (self-hosted infrastructure)
- Multi-region deployment for data residency compliance
- Private cloud or air-gapped deployment
- Highest concurrency limits (no cap)
- Dedicated Customer Success Manager
- Dedicated Solutions Engineer for integration support
- Priority support with custom SLA (99.9%+ uptime)
- Advanced security and compliance (SOC 2, GDPR, HIPAA)
- Custom data retention and deletion policies
- Flexible contract terms (annual or multi-year)
Get a custom Speechmatics quote
Enter your work email and we'll send you a detailed cost breakdown.
Frequently Asked Questions
01 How much does Speechmatics cost?
Speechmatics offers a Free tier with 480 minutes/month of speech-to-text (recurring monthly) and 1 million characters/month of text-to-speech (English only), requiring no credit card. The Pro tier costs $0.24/hour ($0.004/min) for speech-to-text, includes the same 480 free minutes/month, and offers a 20% automatic discount above 500 hours/month per transcription type. Text-to-speech on Pro is English-only; multilingual TTS requires Enterprise. Enterprise plans have custom pricing (typically $25,000-$50,000+ annually) and include custom models, on-premises deployment, unlimited scale, and dedicated support.
02 Is Speechmatics free?
Yes, Speechmatics offers a Free tier with 480 minutes/month of speech-to-text and 1 million characters/month of text-to-speech (English only), with no credit card required. This free tier resets monthly, unlike competitors' one-time credits (Deepgram's $200, AssemblyAI's $50). At 480 minutes/month, the Free tier provides 5,760 minutes/year (96 hours) of ongoing transcription. For teams processing more than 8 hours/month, the Pro tier at $0.004/min is cost-effective. For larger free testing budgets, Deepgram's $200 credit (~433 hours one-time) is more generous upfront.
03 What is Speechmatics?
Speechmatics is a speech recognition platform providing speech-to-text and text-to-speech APIs for developers. It specializes in multilingual transcription with 55+ languages, on-premises deployment options for data sovereignty, and per-second billing precision. Speechmatics differentiates itself through its recurring free tier (480 minutes/month, unlike competitors' one-time credits), automatic volume discounts (20% above 500 hours/month), and enterprise-grade deployment flexibility (cloud, on-premises, or air-gapped). The platform is used by companies like BBC, Sky, and Vodafone for live captioning, call analytics, and media transcription.
04 Speechmatics vs AssemblyAI: which is better?
Speechmatics Pro at $0.004/min ($0.24/hr) is 60% more expensive than AssemblyAI Universal at $0.0025/min ($0.15/hr) for basic transcription. However, Speechmatics includes 480 minutes/month free that resets monthly (5,760 min/year ongoing), while AssemblyAI's $50 credit is one-time (~185 hours). Speechmatics supports 55+ languages natively, while AssemblyAI focuses on English and common languages. AssemblyAI offers richer audio intelligence features (entity detection, topic detection, auto chapters), while Speechmatics excels at multilingual transcription and on-premises deployment. Choose Speechmatics for ongoing free tier, multilingual support, or on-premises needs; choose AssemblyAI for audio intelligence features and lower base pricing.
05 Speechmatics vs Deepgram: which should I choose?
Speechmatics Pro at $0.004/min is 48% cheaper than Deepgram Nova-3 at $0.0077/min for batch transcription. Speechmatics includes 480 minutes/month free that resets monthly (5,760 min/year), while Deepgram offers $200 in one-time credits (~433 hours). Deepgram excels at real-time streaming with <300ms latency, which Speechmatics also supports but with higher base pricing. Speechmatics supports 55+ languages vs Deepgram's 30+. Speechmatics offers on-premises deployment (Enterprise), while Deepgram is cloud-only. Choose Speechmatics for lower per-minute cost, multilingual support, recurring free tier, or on-premises deployment; choose Deepgram for ultra-low latency real-time streaming, larger upfront free credit, or Voice Agent API.
06 What features are included in Speechmatics pricing?
Speechmatics' Free tier includes 480 minutes/month of speech-to-text (55+ languages, batch and real-time), 1 million characters/month of text-to-speech (English only), and 2 concurrent real-time sessions. Pro tier ($0.24/hr or $0.004/min) includes the same 480 free minutes/month plus 50 concurrent real-time sessions, 10 file jobs/second for batch processing, email support, and automatic 20% discount above 500 hours/month per transcription type. Pro usage is capped at 6,000 hours/month. Enterprise includes custom models, on-premises deployment, unlimited scale, dedicated support, and custom SLA. All tiers bill per-second with no rounding.
07 Does Speechmatics charge for silence or non-speech audio?
Yes, Speechmatics charges for the full duration of submitted audio files, including silence, music, and non-speech segments, based on audio file length. However, Speechmatics bills per-second (not per-minute or per-hour), so a 37-second audio file costs exactly 37 seconds worth (37 × $0.004/min ÷ 60 = $0.00247) with no rounding up to a full minute. To minimize costs, preprocess audio to remove long silences using voice activity detection (VAD) or FFmpeg before sending to Speechmatics' API, especially if you frequently process audio with extended silence periods.
08 What is Speechmatics' refund policy?
Speechmatics operates on a usage-based billing model for Pro tier with no subscriptions or advance payments, so there are no refunds -- you are billed only for audio processed beyond the 480 free minutes/month. The Free tier includes 480 minutes/month that resets monthly with no charges. Enterprise customers with prepaid annual commitments should negotiate refund terms directly in their contracts, as prepaid credits may expire annually and are typically non-refundable for unused balances. If you encounter a service issue or are overcharged due to a bug, contact Speechmatics support to request a credit adjustment.
09 Can I use Speechmatics for free long-term?
Yes, Speechmatics offers a recurring free tier with 480 minutes/month (8 hours/month) of speech-to-text and 1 million characters/month of text-to-speech (English only) that resets monthly. This provides 5,760 minutes/year (96 hours) of ongoing transcription at no cost, making it the best free tier for sustained low-volume usage compared to competitors. Deepgram's $200 credit (~433 hours) is more generous upfront but does not refresh. AssemblyAI's $50 credit (~185 hours) is one-time. For teams processing under 8 hours/month long-term, Speechmatics Free tier is the most cost-effective option. For production workloads exceeding 8 hours/month, Pro tier at $0.004/min is competitive.