Try our new research platform with insights from 80,000+ expert users

Google Cloud Speech-to-Text vs IBM Watson Speech To Text comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Google Cloud Speech-to-Text
Ranking in Speech-To-Text Services
2nd
Average Rating
7.8
Reviews Sentiment
6.2
Number of Reviews
8
Ranking in other categories
No ranking in other categories
IBM Watson Speech To Text
Ranking in Speech-To-Text Services
6th
Average Rating
8.0
Reviews Sentiment
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of October 2025, in the Speech-To-Text Services category, the mindshare of Google Cloud Speech-to-Text is 15.9%, down from 23.1% compared to the previous year. The mindshare of IBM Watson Speech To Text is 3.7%, down from 6.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Speech-To-Text Services Market Share Distribution
ProductMarket Share (%)
Google Cloud Speech-to-Text15.9%
IBM Watson Speech To Text3.7%
Other80.4%
Speech-To-Text Services
 

Featured Reviews

Venkatesh C S - PeerSpot reviewer
Easy to learn but needs to improve in the area of the multi-language support offered
Speaking about the tool's multi-language support, I can say that Google supports more languages than any other cloud provider. I have not experienced any difficulties or challenges integrating Google Cloud Speech-to-Text into our company's workflow. I would suggest others choose the model correctly. For example, you must use a telephony model whenever it is a phone call or something that has been recorded. You can just go to the console and create it first, and then you'll have the entire code on the right side so that you can directly use it in your workflow. The tool is easy to learn. Considering that the tool is not accurate when it comes to native language, especially if you are going for some regional languages in India where there are more than 100 languages, I feel that the tool doesn't support regional languages, but it supports the most widely spoken languages, so only certain areas are accurate. If the call has been placed on hold, there are some deviations. I rate the tool a seven out of ten.
reviewer964722 - PeerSpot reviewer
Easy to understand, configure, and use
I would recommend it. IBM has several other solutions that can connect to it, so no need to buy different pieces from several providers. If you want to find a good solution for the customer and put some translation tool or machine learning for text understanding and so on, you can get this from IBM. It can be a one-stop shop for a good solution. I would rate this solution an eight out of ten. It has good quality, and it's easy to work with.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I would suggest Google Cloud Speech-to-Text to others, primarily for the speaker diarization feature."
"During the time I used Google Cloud Speech-to-Text, it was very impactful to the organization as it made our tasks much easier to perform."
"Google Cloud Speech-to-Text sounds incredibly natural, which is impressive."
"Google Cloud Speech-to-Text helps to keep my team more productive."
"We've found the solution scales well."
"The product's initial setup phase is very easy."
"The implementation is simple, and the outputs are very accurate and crisp."
"You could dictate a bunch of stuff, and then you can get ChatGPT or something to clean it up."
"It was easy to understand, easy to configure, and easy to use."
 

Cons

"Given the numerous accents and dialects in India, Google Cloud Speech-to-Text could improve its handling of Indian accents."
"Google Cloud Speech-to-Text is 100 out of 100 when it works, and when it doesn't work, which is fairly often, it gets a zero. It doesn't fail gracefully; it fails in an unexpected way."
"Google Cloud Speech-to-Text's trial experience could be improved by adding some extra minutes in the trial version."
"The tool's telephony model does not produce accurate results."
"Since it is a paid service, it is very difficult to access if a user does not have the credentials. Also, we have to create the API keys and secret keys repeatedly to maintain authentication and privacy."
"Sometimes, speaker diarization is affected, leading to incorrect speaker identification."
"The one thing that I find is when I often use specialized terms, and the solution doesn't know them."
"The multilanguage support for the chatbot needs to be better."
"The quality needs to be updated. For speech to text, support for additional languages can be included. For example, support for the large markets in Eastern Europe, such as Polish or Romanian, would be nice."
 

Pricing and Cost Advice

"The tool's cost is also very low. The tool is cheaply priced. It charges around 0.13 INR per call with a duration of five minutes."
"Cost-wise, I would say it is all-inclusive in the payment made to Google."
Information not available
report
Use our free recommendation engine to learn which Speech-To-Text Services solutions are best for your needs.
868,706 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
12%
Manufacturing Company
7%
Comms Service Provider
7%
Energy/Utilities Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Google Cloud Speech-to-Text?
Our experience with pricing and licensing for Google Cloud Speech-to-Text is that we didn't have any other viable choices, so we cannot effectively evaluate if it's well-priced or badly priced.
What needs improvement with Google Cloud Speech-to-Text?
Google Cloud Speech-to-Text is not entirely accurate, so we have to correct for those errors in our AI software. It uses neural networks, and that stochastic processing is 70% to 75% accurate. It g...
What is your primary use case for Google Cloud Speech-to-Text?
I can answer questions about my experience with SQL Server as we are trying to capture reviews for SQL Server. We don't use the reporting services within SQL Server; we're using this for heavy-duty...
Ask a question
Earn 20 points
 

Overview

 

Sample Customers

Home Depot, Paypal, Target, HSBC, McKesson
American Airlines, UBank, Bitly, Eurobits
Find out what your peers are saying about Microsoft, Google, Deepgram and others in Speech-To-Text Services. Updated: August 2025.
868,706 professionals have used our research since 2012.