No more typing reviews! Try our Samantha, our new voice AI agent.

Deepgram vs IBM Watson Speech To Text comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Deepgram
Ranking in Speech-To-Text Services
1st
Average Rating
8.4
Reviews Sentiment
5.9
Number of Reviews
11
Ranking in other categories
Text-To-Speech Services (2nd), AI Customer Support (3rd), AI Sales & Marketing (6th), AI Scheduling & Coordination (2nd)
IBM Watson Speech To Text
Ranking in Speech-To-Text Services
6th
Average Rating
8.0
Reviews Sentiment
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2026, in the Speech-To-Text Services category, the mindshare of Deepgram is 19.8%, up from 10.0% compared to the previous year. The mindshare of IBM Watson Speech To Text is 3.7%, down from 5.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Speech-To-Text Services Mindshare Distribution
ProductMindshare (%)
Deepgram19.8%
IBM Watson Speech To Text3.7%
Other76.5%
Speech-To-Text Services
 

Featured Reviews

Arunkumar HG - PeerSpot reviewer
Technology Architect & Hands-On Leader | Prototyping, Automation, AI/LLM Integration | 20+ Years in at Regalix
A Powerful, Adaptable, and Constantly Evolving STT Solution for Voice Automation
Honestly, Deepgram has been exceptionally proactive in addressing the primary area that needed improvement. My main challenge was with the real-time detection of when a user has finished speaking in a live conversation, which is critical for a responsive voice bot. They directly solved this by releasing their Flux model. Because Flux is a recent release, I haven't yet had enough time to thoroughly test it and identify new limitations. At this stage, any "improvement" would be more of a "nice-to-have" feature rather than a fix for an existing problem. The core service is already very robust and meets all of our current needs. What additional features should be included in the next release? ---------------------------------------------------------------- Looking toward the future, here are a few features that could add even more value to an already excellent platform: * Advanced Built-in Analytics: While I can get the raw transcript and build my own analytics pipeline, it would be powerful to have features like sentiment analysis, emotion detection, or automatic summarization offered directly through the API. This would save significant development time. * More Granular Speaker Diarization: For calls with multiple participants, enhancing the real-time speaker diarization (labeling who is speaking) to be even more precise would be a fantastic addition for creating detailed call analyses. * Tighter Integration with TTS: Since Deepgram is also expanding into Text-to-Speech (TTS), offering a more seamlessly integrated STT-to-TTS pipeline could simplify the development stack for creating voice agents from start to finish. * Specialized, Pre-Trained Industry Models: While the general models are highly accurate, offering even more specialized, pre-trained models for specific industries like finance, healthcare, or legal-which are heavy on specific jargon-could push the accuracy even higher for those niche use cases.
it_user964722 - PeerSpot reviewer
Business Transformation and Automation Manager at a tech services company with 201-500 employees
Easy to understand, configure, and use
I would recommend it. IBM has several other solutions that can connect to it, so no need to buy different pieces from several providers. If you want to find a good solution for the customer and put some translation tool or machine learning for text understanding and so on, you can get this from IBM. It can be a one-stop shop for a good solution. I would rate this solution an eight out of ten. It has good quality, and it's easy to work with.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The best features of Deepgram for me are the level of transcription accuracy it provides and the amount of time it saves."
"The solution's most valuable feature is its speed of transcription, as it is one of the fastest tools, especially if you compare it to the second fastest solution that you can get, which is 20 times faster, so it is not just a marginally faster product."
"The best thing with Deepgram is they are continually evolving and doing a lot of market research, and they take feedback seriously."
"Deepgram is able to handle large volumes of audio data without compromising accuracy."
"The most valuable capabilities of Deepgram that I've found so far include low latency, as it offers less than 200 milliseconds, which is not provided by any other text-to-speech models."
"Deepgram's low latency transcription has greatly impacted my ability to deliver reliable voice agents and provided very good transcription."
"We have tracked a reduction of around 70% in the support cost and direct human interaction for support."
"The recognition of industry-specific terminology phrases and abbreviations is really important for us. We were able to get a good level of industry specificity with Deepgram."
"It was easy to understand, easy to configure, and easy to use."
"IBM has several other solutions that can connect to it, so no need to buy different pieces from several providers."
 

Cons

"The solution does not properly identify the number of speakers."
"When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time."
"Deepgram has a vast UI and a vast range of models, but there could be a simpler version for creating AI agents rather than providing a full-fledged platform for minimal use cases."
"I would not recommend Deepgram to other users because it does not properly identify video communication."
"Deepgram is currently restricted to only the English variants, but it should include other languages, such as German or French."
"We've had issues in the past where it generates the transcript, and a lot of the text is duplicated."
"In comparison to Deepgram, I would say that the transcript accuracy offered by other products is much higher."
"We've had issues in the past where it generates the transcript, and a lot of the text is duplicated."
"The quality needs to be updated. For speech to text, support for additional languages can be included. For example, support for the large markets in Eastern Europe, such as Polish or Romanian, would be nice."
"The quality needs to be updated. For speech to text, support for additional languages can be included."
 

Pricing and Cost Advice

"When using Deepgram, one needs to pay for the hours or minutes for which the transcription is needed."
"Deepgram is a cheap solution."
"The pricing is moderate."
"The solution’s pricing is cheap."
Information not available
report
Use our free recommendation engine to learn which Speech-To-Text Services solutions are best for your needs.
885,444 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
10%
Construction Company
8%
University
8%
Financial Services Firm
8%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise1
Large Enterprise1
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Deepgram?
My experience with pricing, setup cost, and licensing was good, as I found it to be cheaper without any problems.
What needs improvement with Deepgram?
Even though Deepgram has many customization options, I wish that Deepgram had voice cloning customization to a much larger extent. I also wish that the price were a bit lower if possible.
What is your primary use case for Deepgram?
My main purpose for Deepgram was to convert meeting voices to text very easily, and the other purpose was for content creation. I mostly use Deepgram for those two purposes.
Ask a question
Earn 20 points
 

Overview

 

Sample Customers

Information Not Available
American Airlines, UBank, Bitly, Eurobits
Find out what your peers are saying about Deepgram, Microsoft, Google and others in Speech-To-Text Services. Updated: March 2026.
885,444 professionals have used our research since 2012.