Try our new research platform with insights from 80,000+ expert users

Deepgram vs Google Cloud Speech-to-Text comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Apr 6, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Deepgram
Ranking in Speech-To-Text Services
3rd
Average Rating
8.4
Reviews Sentiment
6.5
Number of Reviews
9
Ranking in other categories
Text-To-Speech Services (4th)
Google Cloud Speech-to-Text
Ranking in Speech-To-Text Services
2nd
Average Rating
7.8
Reviews Sentiment
6.2
Number of Reviews
8
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of October 2025, in the Speech-To-Text Services category, the mindshare of Deepgram is 19.4%, up from 2.1% compared to the previous year. The mindshare of Google Cloud Speech-to-Text is 15.9%, down from 23.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Speech-To-Text Services Market Share Distribution
ProductMarket Share (%)
Google Cloud Speech-to-Text15.9%
Deepgram19.4%
Other64.7%
Speech-To-Text Services
 

Featured Reviews

Arunkumar HG - PeerSpot reviewer
A Powerful, Adaptable, and Constantly Evolving STT Solution for Voice Automation
Honestly, Deepgram has been exceptionally proactive in addressing the primary area that needed improvement. My main challenge was with the real-time detection of when a user has finished speaking in a live conversation, which is critical for a responsive voice bot. They directly solved this by releasing their Flux model. Because Flux is a recent release, I haven't yet had enough time to thoroughly test it and identify new limitations. At this stage, any "improvement" would be more of a "nice-to-have" feature rather than a fix for an existing problem. The core service is already very robust and meets all of our current needs. What additional features should be included in the next release? ---------------------------------------------------------------- Looking toward the future, here are a few features that could add even more value to an already excellent platform: * Advanced Built-in Analytics: While I can get the raw transcript and build my own analytics pipeline, it would be powerful to have features like sentiment analysis, emotion detection, or automatic summarization offered directly through the API. This would save significant development time. * More Granular Speaker Diarization: For calls with multiple participants, enhancing the real-time speaker diarization (labeling who is speaking) to be even more precise would be a fantastic addition for creating detailed call analyses. * Tighter Integration with TTS: Since Deepgram is also expanding into Text-to-Speech (TTS), offering a more seamlessly integrated STT-to-TTS pipeline could simplify the development stack for creating voice agents from start to finish. * Specialized, Pre-Trained Industry Models: While the general models are highly accurate, offering even more specialized, pre-trained models for specific industries like finance, healthcare, or legal-which are heavy on specific jargon-could push the accuracy even higher for those niche use cases.
Venkatesh C S - PeerSpot reviewer
Easy to learn but needs to improve in the area of the multi-language support offered
Speaking about the tool's multi-language support, I can say that Google supports more languages than any other cloud provider. I have not experienced any difficulties or challenges integrating Google Cloud Speech-to-Text into our company's workflow. I would suggest others choose the model correctly. For example, you must use a telephony model whenever it is a phone call or something that has been recorded. You can just go to the console and create it first, and then you'll have the entire code on the right side so that you can directly use it in your workflow. The tool is easy to learn. Considering that the tool is not accurate when it comes to native language, especially if you are going for some regional languages in India where there are more than 100 languages, I feel that the tool doesn't support regional languages, but it supports the most widely spoken languages, so only certain areas are accurate. If the call has been placed on hold, there are some deviations. I rate the tool a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Deepgram's low latency transcription has greatly impacted my ability to deliver reliable voice agents and provided very good transcription."
"The solution's Speech-to-Text conversion feature is really awesome."
"The features that I have been using in the tool have been very stable."
"Deepgram's transcription stands out compared to other solutions primarily due to its speed and accuracy; those are important points for me because not all providers or tools handled Spanish well, but Deepgram adjusted perfectly for that use case, and we also chose 11Labs voice, a South American voice, which worked very well with Deepgram."
"Deepgram is able to handle large volumes of audio data without compromising accuracy."
"The recognition of industry-specific terminology phrases and abbreviations is really important for us. We were able to get a good level of industry specificity with Deepgram."
"The speed of the solution for transcribing videos is good."
"The best thing with Deepgram is they are continually evolving and doing a lot of market research, and they take feedback seriously."
"Google Cloud Speech-to-Text helps to keep my team more productive."
"You could dictate a bunch of stuff, and then you can get ChatGPT or something to clean it up."
"The product's initial setup phase is very easy."
"We've found the solution scales well."
"I would suggest Google Cloud Speech-to-Text to others, primarily for the speaker diarization feature."
"The implementation is simple, and the outputs are very accurate and crisp."
"Google Cloud Speech-to-Text sounds incredibly natural, which is impressive."
"During the time I used Google Cloud Speech-to-Text, it was very impactful to the organization as it made our tasks much easier to perform."
 

Cons

"We've had issues in the past where it generates the transcript, and a lot of the text is duplicated."
"I would like it to be more accurate."
"When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time."
"The traditional Speech-to-Text doesn't understand when the user is done speaking in bot conversations."
"The solution does not properly identify the number of speakers."
"Regarding improvements for Deepgram, I think the quality of the transcriptions could be enhanced, as the Spanish accent poses challenges, making it harder to transcribe some words, and considering additional accents from Chilean or Argentine speakers could improve the model's performance with local words."
"We haven't seen a return on investment with Deepgram so far; we have been building POCs for the last two years but recently switched to AWS in the last two months due to scalability issues with the pay-as-you-go model."
"Deepgram is currently restricted to only the English variants, but it should include other languages, such as German or French."
"Google Cloud Speech-to-Text's trial experience could be improved by adding some extra minutes in the trial version."
"Given the numerous accents and dialects in India, Google Cloud Speech-to-Text could improve its handling of Indian accents."
"The one thing that I find is when I often use specialized terms, and the solution doesn't know them."
"The tool's telephony model does not produce accurate results."
"Since it is a paid service, it is very difficult to access if a user does not have the credentials. Also, we have to create the API keys and secret keys repeatedly to maintain authentication and privacy."
"Sometimes, speaker diarization is affected, leading to incorrect speaker identification."
"Google Cloud Speech-to-Text is 100 out of 100 when it works, and when it doesn't work, which is fairly often, it gets a zero. It doesn't fail gracefully; it fails in an unexpected way."
"The multilanguage support for the chatbot needs to be better."
 

Pricing and Cost Advice

"When using Deepgram, one needs to pay for the hours or minutes for which the transcription is needed."
"The solution’s pricing is cheap."
"Deepgram is a cheap solution."
"The pricing is moderate."
"Cost-wise, I would say it is all-inclusive in the payment made to Google."
"The tool's cost is also very low. The tool is cheaply priced. It charges around 0.13 INR per call with a duration of five minutes."
report
Use our free recommendation engine to learn which Speech-To-Text Services solutions are best for your needs.
872,778 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
10%
Comms Service Provider
10%
Computer Software Company
10%
Retailer
10%
Computer Software Company
12%
Comms Service Provider
8%
Manufacturing Company
7%
Healthcare Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business8
Large Enterprise1
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Deepgram?
My experience with pricing, setup cost, and licensing was good, as I found it to be cheaper without any problems.
What needs improvement with Deepgram?
Regarding improvements for Deepgram, I think the quality of the transcriptions could be enhanced, as the Spanish accent poses challenges, making it harder to transcribe some words, and considering ...
What is your primary use case for Deepgram?
I use Deepgram for a company that requested me to implement an AI voice agent for a security application that warns other neighbors of near alerts of some incidents that may occur in their neighbor...
What is your experience regarding pricing and costs for Google Cloud Speech-to-Text?
Our experience with pricing and licensing for Google Cloud Speech-to-Text is that we didn't have any other viable choices, so we cannot effectively evaluate if it's well-priced or badly priced.
What needs improvement with Google Cloud Speech-to-Text?
Google Cloud Speech-to-Text is not entirely accurate, so we have to correct for those errors in our AI software. It uses neural networks, and that stochastic processing is 70% to 75% accurate. It g...
What is your primary use case for Google Cloud Speech-to-Text?
I can answer questions about my experience with SQL Server as we are trying to capture reviews for SQL Server. We don't use the reporting services within SQL Server; we're using this for heavy-duty...
 

Overview

 

Sample Customers

Information Not Available
Home Depot, Paypal, Target, HSBC, McKesson
Find out what your peers are saying about Deepgram vs. Google Cloud Speech-to-Text and other solutions. Updated: September 2025.
872,778 professionals have used our research since 2012.