Speechmatics Pricing 2026
Complete pricing guide with plans, hidden costs, and cost analysis
Speechmatics uses custom pricing — contact their sales team for a quote.
Speechmatics uses custom pricing as of May 2026 with 3 plans available. Contact Speechmatics directly for a personalized quote. Plan: Free (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.
Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.
- Free tier: Yes
Speechmatics offers 3 pricing tiers: Free, Pro, Enterprise. The Pro plan is growing companies needing higher concurrency, volume discounts, and bolt-on features (translation, summaries, sentiment) for production transcription workloads.
Compared to other ai transcription apis software, Speechmatics is positioned at the budget-friendly price point.
- 6 documented hidden costs beyond list price
How much does Speechmatics cost?
Speechmatics Pricing Overview
Speechmatics uses custom pricing — contact their sales team for a quote. The Free plan is free and is best for developers prototyping applications or teams with low-volume transcription needs (<8 hours/month combined real-time + batch). The Pro plan requires contacting sales for a custom quote and is designed for growing companies needing higher concurrency, volume discounts, and bolt-on features (translation, summaries, sentiment) for production transcription workloads. The Enterprise plan requires contacting sales for a custom quote and is designed for large enterprises needing custom models, on-premises/air-gapped deployment, unlimited scale, audio alignment, custom voice/language development, and dedicated support.
There are at least 6 documented hidden costs beyond Speechmatics's list price, including implementation, training, and add-on fees.
This pricing was last verified in April 30, 2026 from 5 independent sources.
Speechmatics is a multilingual speech recognition platform specializing in speech-to-text and text-to-speech APIs with support for 55+ languages, on-premises deployment options, and per-second billing precision. Unlike competitors focused on real-time streaming (Deepgram) or rich audio intelligence (AssemblyAI), Speechmatics prioritizes multilingual accuracy, deployment flexibility (cloud, on-premises, or air-gapped), and a recurring free tier that resets monthly. The platform is used by enterprises like BBC, Sky, and Vodafone for live captioning, customer call analytics, and media transcription workflows requiring data sovereignty and compliance.
Pricing starts with a Free tier offering 480 minutes/month (8 hours/month) of speech-to-text and 1 million characters/month of text-to-speech (English only), with no credit card required. This free tier resets monthly, unlike competitors' one-time credits (Deepgram's $200, AssemblyAI's $50). The Pro tier costs $0.24/hour ($0.004/min) for speech-to-text, includes the same 480 free minutes/month, and offers automatic 20% discounts above 500 hours/month per transcription type. Enterprise plans have custom pricing (typically $25,000-$50,000+ annually) and include custom models, on-premises deployment, unlimited scale, and dedicated support.
A critical consideration: Speechmatics Pro at $0.004/min is 60% more expensive than AssemblyAI Universal ($0.0025/min) but 48% cheaper than Deepgram Nova-3 ($0.0077/min) for basic transcription. The recurring 480 minutes/month free tier (5,760 min/year) is the most generous for sustained low-volume usage, but Pro tier usage is capped at 6,000 hours/month, requiring Enterprise upgrades for higher volumes. Text-to-speech is English-only on Free and Pro tiers, with multilingual TTS requiring Enterprise pricing. On-premises deployment (Enterprise) adds significant infrastructure overhead ($2,000-$10,000/month) but is essential for data sovereignty in regulated industries.
In this 2026 pricing guide, we break down Speechmatics' three tiers (Free, Pro, Enterprise), calculate real-world costs for multilingual transcription workflows, expose hidden usage caps and infrastructure costs, and compare Speechmatics to alternatives like AssemblyAI, Deepgram, and Rev AI to help you determine if it is the most cost-effective solution for your multilingual transcription needs.
How Speechmatics Pricing Compares
Compare Speechmatics pricing against top alternatives in AI Transcription APIs.
All Speechmatics Plans & Pricing
| Plan | Monthly | Annual | Best For |
|---|---|---|---|
| Free Real-time STT: 240 minutes/month (resets monthly)Batch STT: 240 minutes/month (resets monthly) | Free | Free | Developers prototyping applications or teams with low-volume transcription needs (<8 hours/month combined real-time + batch) |
| Pro Free monthly credits: 240 min Real-time STT + 240 min Batch STT + 1M chars TTSConcurrency: 50 real-time sessions | Contact Sales | Contact Sales | Growing companies needing higher concurrency, volume discounts, and bolt-on features (translation, summaries, sentiment) for production transcription workloads |
| Enterprise Concurrency: Unlimited (negotiable)Batch rate: Unlimited (negotiable) | Contact Sales | Contact Sales | Large enterprises needing custom models, on-premises/air-gapped deployment, unlimited scale, audio alignment, custom voice/language development, and dedicated support |
View all features by plan
Free
- 240 minutes/month free Real-time Speech-to-Text (resets monthly)
- 240 minutes/month free Batch Speech-to-Text (resets monthly)
- 1 million characters/month free Text-to-Speech (~20 hours, English only)
- 55+ languages including English, Spanish, French, German, Mandarin
- 2 concurrent real-time sessions
- 1 batch job per second
- 3 concurrent Voice Agent conversations
- Per-second billing precision (no rounding up)
- No credit card required to start
- Multi-region cloud (US, EU, or Australia)
- Standard and Enhanced accuracy models
- Speaker diarization, custom dictionary, language identification
- Community support via documentation
Pro
- Batch Standard accuracy STT: $0.24/hr
- Batch Enhanced accuracy STT: $0.40/hr
- Real-time Standard accuracy STT: $0.24/hr
- Real-time Enhanced accuracy STT: $0.56/hr
- Text-to-Speech: $0.011 per 1,000 characters
- Translation bolt-on: $0.65/hr
- Summaries bolt-on: $0.12/hr
- Chapters bolt-on: $0.40/hr
- Sentiment bolt-on: $0.12/hr
- Topics bolt-on: $0.20/hr
- 240 minutes/month free Real-time STT included
- 240 minutes/month free Batch STT included
- 1 million characters/month free TTS included
- 55+ languages with multilingual support
- 50 concurrent real-time sessions
- 10 file jobs per second for batch processing
- 6 concurrent Voice Agent conversations
- Per-second billing precision
- 20% automatic volume discount above 500 hours/month per STT type
- 33% additional discount available with Model Training enabled
- Usage capped at 6,000 hours/month (contact sales for higher)
- Online email support
- Audio events, multi-channel support, subtitle formatting
- Additional discounts available for 24,000+ hours annually
Enterprise
- All Pro features with custom volume pricing
- Unlimited scale with no rate limits
- Unlimited real-time session concurrency
- Unlimited batch job creation rate
- Unlimited Voice Agent conversation concurrency
- Custom-trained models for domain-specific accuracy
- Custom voice development for TTS
- Custom language development
- On-premises deployment options (Container, Virtual Appliance, On-Device)
- Private Cloud deployment with GPU and CPU based models
- Multi-region cloud (US, EU, or Australia)
- Audio alignment feature (Enterprise only)
- Early access to new features and capabilities
- Dedicated Customer Success Manager
- Dedicated Solutions Engineer
- Prioritized email support with custom SLA
- Customer community access
- Best volume discounts available
Usage-Based Rates
Per-unit pricing for Speechmatics API usage.
Pro
| Item | Dimension | Unit | Rate |
|---|---|---|---|
| speechmatics-batch-standard | minute | hour | $0.240 |
| speechmatics-batch-enhanced | minute | hour | $0.400 |
| speechmatics-realtime-standard | minute | hour | $0.240 |
| speechmatics-realtime-enhanced | minute | hour | $0.560 |
| speechmatics-tts | character | 1k characters | $0.011 |
| speechmatics-translation | minute | hour | $0.650 |
| speechmatics-summaries | minute | hour | $0.120 |
| speechmatics-chapters | minute | hour | $0.400 |
| speechmatics-sentiment | minute | hour | $0.120 |
| speechmatics-topics | minute | hour | $0.200 |
- 20% automatic volume discount above 500 hours/month per STT type
- 33% additional discount available with Model Training enabled (anonymized data sharing)
- Per-second billing precision
- Pro tier capped at 6,000 hours/month combined
- Additional negotiated discounts above 24,000 hours/year
Compare Speechmatics vs Alternatives
Before committing to Speechmatics, compare pricing with these 3 alternatives in the same category.
Batch transcription with rich audio intelligence features like entity detection, topic detection, and auto chapters
Full comparisonReal-time streaming applications needing ultra-low latency and per-second billing
Full comparisonBudget-conscious batch transcription or when human transcription fallback is required
Full comparisonWhat Companies Actually Pay for Speechmatics
Speechmatics Year 1 Total Cost by Company Size
Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.
A content creator transcribing 300 minutes/month of multilingual video content (English, Spanish, French) using the Free tier. Stays within 480 minutes/month free limit.
A customer support platform transcribing 1,500 hours of multilingual calls monthly using Pro tier with per-second billing and automatic 20% volume discount (exceeds 500 hours/month threshold). Processing ~18,000 hours annually.
A large enterprise transcribing 10,000 hours of multilingual media content monthly on an Enterprise plan with custom models, on-premises deployment in EU for GDPR compliance, and dedicated support. Processing ~120,000 hours annually.
How Speechmatics Pricing Compares
| Software | Starting Price | Top Price |
|---|---|---|
| Speechmatics | Custom | Custom |
| AssemblyAI | Free | $0.21/hour |
| AWS Transcribe | Free | $6.75/minute |
| Deepgram | Free | $4000/minute |
| Google Cloud Speech-to-Text | Custom | Custom |
| Whisper (OpenAI) | $0.003/minute | $0.006/minute |
Detailed pricing comparisons:
Speechmatics Pricing FAQ
01 How much does Speechmatics cost?
Speechmatics offers a Free tier with 480 minutes/month of speech-to-text (recurring monthly) and 1 million characters/month of text-to-speech (English only), requiring no credit card. The Pro tier costs $0.24/hour ($0.004/min) for speech-to-text, includes the same 480 free minutes/month, and offers a 20% automatic discount above 500 hours/month per transcription type. Text-to-speech on Pro is English-only; multilingual TTS requires Enterprise. Enterprise plans have custom pricing (typically $25,000-$50,000+ annually) and include custom models, on-premises deployment, unlimited scale, and dedicated support.
02 Is Speechmatics free?
Yes, Speechmatics offers a Free tier with 480 minutes/month of speech-to-text and 1 million characters/month of text-to-speech (English only), with no credit card required. This free tier resets monthly, unlike competitors' one-time credits (Deepgram's $200, AssemblyAI's $50). At 480 minutes/month, the Free tier provides 5,760 minutes/year (96 hours) of ongoing transcription. For teams processing more than 8 hours/month, the Pro tier at $0.004/min is cost-effective. For larger free testing budgets, Deepgram's $200 credit (~433 hours one-time) is more generous upfront.
03 What is Speechmatics?
Speechmatics is a speech recognition platform providing speech-to-text and text-to-speech APIs for developers. It specializes in multilingual transcription with 55+ languages, on-premises deployment options for data sovereignty, and per-second billing precision. Speechmatics differentiates itself through its recurring free tier (480 minutes/month, unlike competitors' one-time credits), automatic volume discounts (20% above 500 hours/month), and enterprise-grade deployment flexibility (cloud, on-premises, or air-gapped). The platform is used by companies like BBC, Sky, and Vodafone for live captioning, call analytics, and media transcription.
04 Speechmatics vs AssemblyAI: which is better?
Speechmatics Pro at $0.004/min ($0.24/hr) is 60% more expensive than AssemblyAI Universal at $0.0025/min ($0.15/hr) for basic transcription. However, Speechmatics includes 480 minutes/month free that resets monthly (5,760 min/year ongoing), while AssemblyAI's $50 credit is one-time (~185 hours). Speechmatics supports 55+ languages natively, while AssemblyAI focuses on English and common languages. AssemblyAI offers richer audio intelligence features (entity detection, topic detection, auto chapters), while Speechmatics excels at multilingual transcription and on-premises deployment. Choose Speechmatics for ongoing free tier, multilingual support, or on-premises needs; choose AssemblyAI for audio intelligence features and lower base pricing.
05 Speechmatics vs Deepgram: which should I choose?
Speechmatics Pro at $0.004/min is 48% cheaper than Deepgram Nova-3 at $0.0077/min for batch transcription. Speechmatics includes 480 minutes/month free that resets monthly (5,760 min/year), while Deepgram offers $200 in one-time credits (~433 hours). Deepgram excels at real-time streaming with <300ms latency, which Speechmatics also supports but with higher base pricing. Speechmatics supports 55+ languages vs Deepgram's 30+. Speechmatics offers on-premises deployment (Enterprise), while Deepgram is cloud-only. Choose Speechmatics for lower per-minute cost, multilingual support, recurring free tier, or on-premises deployment; choose Deepgram for ultra-low latency real-time streaming, larger upfront free credit, or Voice Agent API.
06 What features are included in Speechmatics pricing?
Speechmatics' Free tier includes 480 minutes/month of speech-to-text (55+ languages, batch and real-time), 1 million characters/month of text-to-speech (English only), and 2 concurrent real-time sessions. Pro tier ($0.24/hr or $0.004/min) includes the same 480 free minutes/month plus 50 concurrent real-time sessions, 10 file jobs/second for batch processing, email support, and automatic 20% discount above 500 hours/month per transcription type. Pro usage is capped at 6,000 hours/month. Enterprise includes custom models, on-premises deployment, unlimited scale, dedicated support, and custom SLA. All tiers bill per-second with no rounding.
07 Does Speechmatics charge for silence or non-speech audio?
Yes, Speechmatics charges for the full duration of submitted audio files, including silence, music, and non-speech segments, based on audio file length. However, Speechmatics bills per-second (not per-minute or per-hour), so a 37-second audio file costs exactly 37 seconds worth (37 × $0.004/min ÷ 60 = $0.00247) with no rounding up to a full minute. To minimize costs, preprocess audio to remove long silences using voice activity detection (VAD) or FFmpeg before sending to Speechmatics' API, especially if you frequently process audio with extended silence periods.
08 What is Speechmatics' refund policy?
Speechmatics operates on a usage-based billing model for Pro tier with no subscriptions or advance payments, so there are no refunds -- you are billed only for audio processed beyond the 480 free minutes/month. The Free tier includes 480 minutes/month that resets monthly with no charges. Enterprise customers with prepaid annual commitments should negotiate refund terms directly in their contracts, as prepaid credits may expire annually and are typically non-refundable for unused balances. If you encounter a service issue or are overcharged due to a bug, contact Speechmatics support to request a credit adjustment.
09 Can I use Speechmatics for free long-term?
Yes, Speechmatics offers a recurring free tier with 480 minutes/month (8 hours/month) of speech-to-text and 1 million characters/month of text-to-speech (English only) that resets monthly. This provides 5,760 minutes/year (96 hours) of ongoing transcription at no cost, making it the best free tier for sustained low-volume usage compared to competitors. Deepgram's $200 credit (~433 hours) is more generous upfront but does not refresh. AssemblyAI's $50 credit (~185 hours) is one-time. For teams processing under 8 hours/month long-term, Speechmatics Free tier is the most cost-effective option. For production workloads exceeding 8 hours/month, Pro tier at $0.004/min is competitive.
Is this pricing incorrect? — we'll verify and update it.