What is our primary use case?
We are using
Amazon Transcribe to convert voice to text. For example, we communicate over the phone, record the call, and then convert the conversation into text messages. We use
AWS Bedrock LLM models to analyze the conversation, understand the reason for it, and determine any questions asked and answers given. In another project for an insurance company, we use
Amazon Transcribe to convert conversation recordings into text. This text is then shared with another team for processing, allowing us to create more advertisements for the user and increase profit. It's about recording the user-agent conversation related to insurance claims and processing it to make targeted advertisements.
What is most valuable?
The valuable aspect of Amazon Transcribe is its ability to perform speech recognition and convert it into text. It's highly compatible with a serverless environment, making it easy to trigger the service and get results. Although no specific features handle diverse accents or dialects effectively, the scalability and ease of use are notable. It provides the best results for our needs, is highly scalable, and easy to manage. The service also benefits from cost savings, being a pay-as-you-go model with very reasonable pricing for audio transcription at $0.004 per second.
What needs improvement?
There is a need to improve the processing of background noise. Sometimes, surrounding sounds are recorded and Amazon Transcribe does not process these well, creating clutter. Adding functionality to remove background noise would be beneficial.
For how long have I used the solution?
I have been using Amazon Transcribe for two years.
What do I think about the stability of the solution?
There have been no performance issues with Amazon Transcribe.
What do I think about the scalability of the solution?
Amazon Transcribe is very easy to scale. Using DynamoDB and Lambda within
AWS makes the process seamless and efficient.
How are customer service and support?
There has been no need to contact technical support since everything is available in AWS documentation.
How would you rate customer service and support?
How was the initial setup?
We have configured some Lambda functions using EventBridge, which triggers the Lambda at specific times to fetch and process voice messages from an
S3 bucket through Amazon Transcribe. It takes approximately twenty to thirty minutes to set up.
What's my experience with pricing, setup cost, and licensing?
The pay-as-you-go model is cost-effective, with pricing for audio transcription around $0.004 per second.
What other advice do I have?
It totally depends on the requirement, so I have no specific advice. I rate Amazon Transcribe as eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
*Disclosure: I am a real user, and this review is based on my own experience and opinions.