IT Central Station is now PeerSpot: Here's why

Amazon Polly vs Microsoft Azure Speech Service comparison

You must select at least 2 products to compare!
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pricing and Cost Advice
  • "The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more than one, it's so expensive to have to listen to a bunch of cases on my phone and have the neural voice read to me. It really wouldn't be worth it. It'd be paying probably more than what I make in the case. Right now, I'm on the free tier, and I think the number of minutes that you get is reasonable as long as you're not doing this all the time and you're using it judiciously. I have some credits that I think I can use, but I don't know how fast they'll go through."
  • More Amazon Polly Pricing and Cost Advice →

    Information Not Available
    Use our free recommendation engine to learn which Text-To-Speech Services solutions are best for your needs.
    610,229 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Amazon Polly is useful because it's helpful to hear the words on top of it when I can't take in information in a general way. Sometimes, it's very taxing if I'm trying to read cases. They have the… more »
    Top Answer:The price could be better. Neural voices are so realistic, and I want to say that they have it so that you can try to tell where the voice is coming from or something like that. But if I have more… more »
    Top Answer:The price could be better. I wish it weren't so expensive to do because it's really cool. I would love to see them have lexicon packages of them like, this is for lawyers, this is for accountants, and… more »
    Ask a question

    Earn 20 points

    Average Words per Review
    Average Words per Review
    Also Known As
    Azure Speech Service, MS Azure Speech Service
    Learn More

    Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.

    In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications.

    Finally, Amazon Polly Brand Voice can create a custom voice for your organization. This is a custom engagement where you will work with the Amazon Polly team to build an NTTS voice for the exclusive use of your organization.

    Easily add real-time speech-to-text capabilities to your applications for scenarios like voice commands, conversation transcription, and call center log analysis.

    Tailor your speech recognition models to adapt to users’ speaking styles, expressions, and unique vocabularies, and to accommodate background noises, accents, and voice patterns.

    Build smart apps and services that speak to users naturally with the Text to Speech service. Convert text to audio in near real time, tailor to change the speed of speech, pitch, volume, and more.

    Give your application a one-of-a-kind, recognizable brand voice using custom voice models. Simply record and upload training data, and the service will create a unique voice font tuned to your recording.

    Learn more about Amazon Polly
    Learn more about Microsoft Azure Speech Service
    Sample Customers
    GoAnimate, Duolingo, Bandwidth
    Top Industries
    Comms Service Provider30%
    Computer Software Company20%
    Media Company7%
    Comms Service Provider25%
    Computer Software Company22%
    Educational Organization5%
    Company Size
    Small Business23%
    Midsize Enterprise24%
    Large Enterprise53%
    Small Business21%
    Midsize Enterprise19%
    Large Enterprise61%

    Amazon Polly is ranked 1st in Text-To-Speech Services with 1 review while Microsoft Azure Speech Service is ranked 3rd in Text-To-Speech Services. Amazon Polly is rated 7.0, while Microsoft Azure Speech Service is rated 0.0. The top reviewer of Amazon Polly writes "A text to spoken audio solution with a realistic neural voice feature, but the price could be better". On the other hand, Amazon Polly is most compared with Google Cloud Text-to-Speech and IBM Watson Text To Speech, whereas Microsoft Azure Speech Service is most compared with Google Cloud Speech-to-Text, Amazon Transcribe, Google Cloud Text-to-Speech and IBM Watson Speech To Text.

    See our list of best Text-To-Speech Services vendors.

    We monitor all Text-To-Speech Services reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.