IT Central Station is now PeerSpot: Here's why

Amazon Polly vs Google Cloud Text-to-Speech comparison

Cancel
You must select at least 2 products to compare!
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Amazon Polly is useful because it's helpful to hear the words on top of it when I can't take in information in a general way. Sometimes, it's very taxing if I'm trying to read cases. They have the neural voices, and they're so realistic. You don't even know that a person is not reading to you, making things much better. I know that they do have the ability to provide you with your own lexicon that's personal to you. I like that you can adjust the pitch and the speed of the voice because some people talk way too fast. Or if you're reading, I read slowly, so that's always helpful. One of the functions that I find helpful is that when reading material on the web, it's like it has its own browser. You go to the URL, and you don't have to read the whole thing, and you can stick the cursor on the place where you want it to start. Then if you want it to skip over something, you put it somewhere else, and that's ideal for reading case law because you skip around a lot. You don't really read it from start to finish. It helps if someone's going to read all those citations because they definitely want to be able to skip that."

More Amazon Polly Pros →

"It's not complex to set up."

More Google Cloud Text-to-Speech Pros →

Cons
"The price could be better. I wish it weren't so expensive to do because it's really cool. I would love to see them have lexicon packages of them like, this is for lawyers, this is for accountants, and it's going to have a lot of things in it. I also think they could do a better job at showing use cases other than telemarketing or contact center stuff like bots that are very commercial. I know that's where the money is, but it's such a huge hole that's missing for people with disabilities that are even worse than mine. Some people cannot see or hear at all, but they're not just cognitively impaired."

More Amazon Polly Cons →

"We had some problems with Dialogflow."

More Google Cloud Text-to-Speech Cons →

Pricing and Cost Advice
  • "The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more than one, it's so expensive to have to listen to a bunch of cases on my phone and have the neural voice read to me. It really wouldn't be worth it. It'd be paying probably more than what I make in the case. Right now, I'm on the free tier, and I think the number of minutes that you get is reasonable as long as you're not doing this all the time and you're using it judiciously. I have some credits that I think I can use, but I don't know how fast they'll go through."
  • More Amazon Polly Pricing and Cost Advice →

    Information Not Available
    report
    Use our free recommendation engine to learn which Text-To-Speech Services solutions are best for your needs.
    608,713 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon Polly is useful because it's helpful to hear the words on top of it when I can't take in information in a general way. Sometimes, it's very taxing if I'm trying to read cases. They have the… more »
    Top Answer:The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more… more »
    Top Answer:The price could be better. I wish it weren't so expensive to do because it's really cool. I would love to see them have lexicon packages of them like, this is for lawyers, this is for accountants, and… more »
    Top Answer:It's not complex to set up.
    Top Answer:I don't have a realistic outlook on the cost. We are working on the migration from server to data center in Jira, and that is a lot of work.
    Top Answer:When you make a chatbot, you need to do it for only one language, typically. We now work in multilanguage companies, however, and we need a chatbot with smarter skills. We need a chat with the… more »
    Ranking
    Views
    3,836
    Comparisons
    3,328
    Reviews
    1
    Average Words per Review
    813
    Rating
    7.0
    Views
    3,005
    Comparisons
    2,700
    Reviews
    1
    Average Words per Review
    491
    Rating
    7.0
    Comparisons
    Learn More
    Overview

    Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.

    In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.

    Finally, Amazon Polly Brand Voice can create a custom voice for your organization. This is a custom engagement where you will work with the Amazon Polly team to build an NTTS voice for the exclusive use of your organization.

    Google Cloud Text-to-Speech converts text into human-like speech in more than 180 voices across 30+ languages and variants. It applies groundbreaking research in speech synthesis (WaveNet) and Google's powerful neural networks to deliver high-fidelity audio. With this easy-to-use API, you can create lifelike interactions with your users that transform customer service, device interaction, and other applications.

    Offer
    Learn more about Amazon Polly
    Learn more about Google Cloud Text-to-Speech
    Sample Customers
    GoAnimate, Duolingo, Bandwidth
    Home Depot, Paypal, Target, HSBC, McKesson
    Top Industries
    VISITORS READING REVIEWS
    Comms Service Provider29%
    Computer Software Company20%
    Media Company7%
    Government7%
    VISITORS READING REVIEWS
    Comms Service Provider31%
    Computer Software Company19%
    Educational Organization6%
    Media Company5%
    Company Size
    VISITORS READING REVIEWS
    Small Business23%
    Midsize Enterprise24%
    Large Enterprise53%
    VISITORS READING REVIEWS
    Small Business25%
    Midsize Enterprise22%
    Large Enterprise54%

    Amazon Polly is ranked 1st in Text-To-Speech Services with 1 review while Google Cloud Text-to-Speech is ranked 2nd in Text-To-Speech Services with 1 review. Amazon Polly is rated 7.0, while Google Cloud Text-to-Speech is rated 7.0. The top reviewer of Amazon Polly writes "A text to spoken audio solution with a realistic neural voice feature, but the price could be better". On the other hand, the top reviewer of Google Cloud Text-to-Speech writes "Easy to set up and great for various languages but the sentiment analysis needs work". Amazon Polly is most compared with Microsoft Azure Speech Service and IBM Watson Text To Speech, whereas Google Cloud Text-to-Speech is most compared with Microsoft Azure Speech Service and IBM Watson Text To Speech.

    See our list of best Text-To-Speech Services vendors.

    We monitor all Text-To-Speech Services reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.