Deepgram Reviews

Name: Deepgram
Brand: Deepgram
Rating: 4.2 (9 reviews)

Vendor: Deepgram

4.2 out of 5

9 reviews
77% willing to recommend

Leave a review

What is Deepgram?

Deepgram stands out for its speed in transcribing videos and speech to text, leveraging cutting-edge models like Whisper and Nova for exceptional performance and accuracy. Its latency is remarkably low, enabling swift transcription that users find superior to alternatives.

Get the Deepgram Buyer's Guide and find out what your peers are saying about Deepgram, Microsoft Azure Speech Service, Amazon Polly and more!

Deepgram is the #1 ranked solution in top Speech-To-Text Services, #1 ranked solution in top AI Scheduling & Coordination solutions, #3 ranked solution in top AI Sales & Marketing solutions, #4 ranked solution in top Text-To-Speech Services, and #8 ranked solution in top AI Customer Support solutions. PeerSpot users give Deepgram an average rating of 8.4 out of 10. Deepgram is most commonly compared to Microsoft Azure Speech Service: Deepgram vs Microsoft Azure Speech Service. Deepgram is popular among the large enterprise segment, accounting for 49% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 10% of all views.

Helped 879,310 peers since 2012

Featured Deepgram reviews

Arunkumar HG

Technology Architect & Hands-On Leader | Prototyping, Automation, AI/LLM Integration | 20+ Years in at a consultancy with 1,001-5,000 employees

Honestly, Deepgram has been exceptionally proactive in addressing the primary area that needed improvement. My main challenge was with the real-time detection of when a user has finished speaking in a live conversation, which is critical for a responsive voice bot. They directly solved this by releasing their Flux model. Because Flux is a recent release, I haven't yet had enough time to thoroughly test it and identify new limitations. At this stage, any "improvement" would be more of a "nice-to-have" feature rather than a fix for an existing problem. The core service is already very robust and meets all of our current needs. What additional features should be included in the next release? ---------------------------------------------------------------- Looking toward the future, here are a few features that could add even more value to an already excellent platform: * Advanced Built-in Analytics: While I can get the raw transcript and build my own analytics pipeline, it would be powerful to have features like sentiment analysis, emotion detection, or automatic summarization offered directly through the API. This would save significant development time. * More Granular Speaker Diarization: For calls with multiple participants, enhancing the real-time speaker diarization (labeling who is speaking) to be even more precise would be a fantastic addition for creating detailed call analyses. * Tighter Integration with TTS: Since Deepgram is also expanding into Text-to-Speech (TTS), offering a more seamlessly integrated STT-to-TTS pipeline could simplify the development stack for creating voice agents from start to finish. * Specialized, Pre-Trained Industry Models: While the general models are highly accurate, offering even more specialized, pre-trained models for specific industries like finance, healthcare, or legal-which are heavy on specific jargon-could push the accuracy even higher for those niche use cases.

Read full review

Oliver Spitzkat

Software Engineer at BIFROTEK

Deepgram is very reliable for normal conversation, but when it comes to special names or information, I'm not entirely sure how to implement it perfectly. I did use a feature to highlight special words based on the topic, but I've seen overweighting of this topic, where normal words weren't recognized properly. The handling of different topics needs to be optimized to be more accurate. I could upload lists with special words for specific topics. In my experience, it was not perfect, as there were only a few words that were understood properly. When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time. For example, when a person said "I'm experienced in Python development," Deepgram didn't get the word Python correctly. When Deepgram Nova 3 was released, I experienced problems with the API, but it could be on the Vapi side; I'm not sure, which is why I stick with version two, but I think currently there is no problem anymore. The setup could be easier, as everything is moving towards no-code solutions. You can configure everything properly on the dashboard and get a JSON file with the structured output to insert into other programs. If there were a way to configure special words for the API more readily, that would be helpful. The initial setup needs to be optimized to be more intuitive.

Read full review

Naveen Chowdary

AI Applied Engineer at Flexon Technologies Talent360.ai

One issue we've faced relates to the pricing structure; with the pay-as-you-go model, we only get eight concurrent connections for web sockets for text-to-speech, which makes it difficult to scale. For enterprise, the annual fee is around $25,000 to $30,000 USD, regardless of usage, which allows for 100 concurrent connections, but still doesn't provide enough scalability when we're using a lot. I have noticed that the web socket connection sometimes breaks due to inactivity, and increasing the timeout period would be beneficial. Additionally, if you leave the web socket connection with the TTS model for a certain period, it loses the connection. Even when used continuously for long periods, it occasionally gives an error. During a call with the support team, they acknowledged that there is an issue on their side. We have the intention to improve or optimize stability for future releases, but since Deepgram is not open-source, we don't have much control over that.

Read full review

Deepgram mindshare

Product category:

As of December 2025, the mindshare of Deepgram in the Text-To-Speech Services category stands at 10.0%, up from 3.1% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Text-To-Speech Services Market Share Distribution
Product	Market Share (%)
Deepgram	10.0%
Microsoft Azure Speech Service	23.5%
Amazon Polly	23.1%
Other	43.4%

Text-To-Speech Services

PeerResearch reports based on Deepgram reviews

Type	Title	Date
Category	Text-To-Speech Services	Dec 31, 2025	Download
Product	Reviews, tips, and advice from real users	Dec 31, 2025	Download
Comparison	Deepgram vs Amazon Polly	Dec 31, 2025	Download
Comparison	Deepgram vs Google Cloud Text-to-Speech	Dec 31, 2025	Download
Comparison	Deepgram vs Microsoft Azure Speech Service	Dec 31, 2025	Download

Valuable Features

Deepgram excels with fast transcription, industry-specific terminology recognition, and high accuracy, even under challenging conditions. It offers low latency, excellent configurability, and ease of integration. Users value its continuous innovation, cost-effectiveness, and outstanding customer support. It handles various languages and accents effectively and reliably delivers human-like voice quality. Deepgram positively impacts organizations by enhancing customer service and reducing human agent demands with superior performance and stability.

"Deepgram's low latency transcription has greatly impacted my ability to deliver reliable voice agents and provided very good transcription."
"The most valuable capabilities of Deepgram that I've found so far include low latency, as it offers less than 200 milliseconds, which is not provided by any other text-to-speech models."
"The best thing with Deepgram is they are continually evolving and doing a lot of market research, and they take feedback seriously."

Room for Improvement

Deepgram struggles with accurately identifying speakers in multi-speaker settings and faces difficulties with language accuracy, including spelling and grammar. It needs better support for different accents and languages and improved handling of unique terminology. Users report issues with duplication in transcripts from dual-channel audio and challenges during live transcription. Some users encounter limitations in scalability and occasional instability in WebSocket connections, highlighting the need for enhanced features like advanced analytics and better integration with Text-to-Speech.

"When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time."
"We haven't seen a return on investment with Deepgram so far; we have been building POCs for the last two years but recently switched to AWS in the last two months due to scalability issues with the pay-as-you-go model."
"The traditional Speech-to-Text doesn't understand when the user is done speaking in bot conversations."

ROI

Companies report excellent returns from Deepgram, citing significant cost savings compared to human transcription. Enhanced accuracy and speed lead to more efficient workflows, while low implementation costs and transparent pricing align with business needs. High accuracy and reliability enable quality service delivery. Though some have switched to AWS due to scalability issues, others experience financial gains, selling services with Deepgram configurations and receiving favorable feedback. The integration facilitates business growth and user satisfaction.

Pricing

Enterprise users find Deepgram's pricing competitive, considering it's cheaper than many alternatives. The cost depends on transcription time usage, becoming more affordable with increased commitment. Initial account setup can be around $200. Although live transcription may have ongoing charges, these often decrease. The pricing is ideal for those valuing quality yet mindful of budget efficiency. Efficient transcription speeds can lead to additional savings on cloud resources or on-premises deployments.

"When using Deepgram, one needs to pay for the hours or minutes for which the transcription is needed."
"Deepgram is a cheap solution."
"The pricing is moderate."

Popular Use Cases

Organizations use Deepgram primarily for transcribing speech to text, powering AI-driven voice bots, and creating speech recognition features. Teams develop applications like voice agents for security alerts, feedback survey apps, and bilingual transcription setups. Deepgram's technology supports call recordings, meetings, interactive AI chatbots, and web integrations. Companies leverage its reliability, speed, and accuracy to convert audio to text, which is then used for further processing in Large Language Models and other AI-driven applications.

Service and Support

Deepgram's customer service is praised for being responsive and helpful, providing timely resolutions. Support is accessible through Slack channels and emails, with additional resources like forums and webinars. Some find technical support average, facing issues with specific features. Documentation is well-structured, aiding users without needing frequent support interaction. Overall, the service is considered excellent, rated highly by multiple users for its efficiency and comprehensive resource availability. Use of AI with documentation is effective for implementation.

Deployment

Deepgram's initial setup is consistently described as user-friendly and straightforward. Users highlight the ease of installation and the support from Deepgram's resources, mentioning helpful documentation and code samples. Some note the stability and modularity of the system. Despite varying needs, it doesn't generally require large teams for setup. Suggestions include advancing toward no-code solutions and enhancing API configurability. Most users implemented Deepgram in both cloud and on-prem environments without external vendors.

Scalability

Deepgram shows significant scalability and efficiently handles large audio volumes. Users frequently mention its adaptability and ability to operate without many personnel. It supports simultaneous usage and maintains accuracy. There are positive mentions about its pay-as-you-go model, though some entities reported limits, opting for AWS for greater scalability. Despite issues like GDPR in Europe, Deepgram generally performs well without downtime or performance problems.

Stability

Deepgram demonstrates high stability, with no significant breakdowns or bugs reported. Users have not encountered downtime, and Deepgram consistently adjusts to workloads. While some initially faced minor issues, improvements in architecture have resolved these. Regular updates introduce new features without compromising existing functionality. Access-related issues occur only during the testing of new models, unrelated to stability. Deepgram provides a transparent platform with a status page, reinforcing its reliability for business-critical applications.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Deepgram Buyer's Guide for additional reliable information.

Review data by company size

By reviewers
Company Size	Count
Small Business	8

By reviewers

By visitors reading reviews
Company Size	Count
Small Business	49
Midsize Enterprise	25
Large Enterprise	71

By visitors reading reviews

Top industries

By visitors reading reviews

University

10%

Comms Service Provider

10%

Computer Software Company

10%

Financial Services Firm

10%

Retailer

Manufacturing Company

Government

Educational Organization

Marketing Services Firm

Media Company

Healthcare Company

Non Profit

Construction Company

Transportation Company

Outsourcing Company

Legal Firm

Real Estate/Law Firm

Insurance Company

Logistics Company

Consumer Goods Company

Performing Arts

Pharma/Biotech Company

Wholesaler/Distributor

Hospitality Company

Wellness & Fitness Company

Energy/Utilities Company

Compare Deepgram with alternative products

Learn more about Deepgram

Deepgram provides an efficient solution for transforming video and audio content into text, benefiting from its advanced ability to recognize industry-specific terminology. Users experience faster results compared to IBM Watson and OpenAI's Whisper model, with low latency contributing to its appeal. However, challenges in speaker recognition and language support remain areas for improvement. Additionally, stronger spelling and grammar accuracy could enhance its performance. Some seek expanded multi-language capabilities and improved manageability during testing phases, noting its slightly less accuracy compared to other tools.

What are Deepgram's most notable features?

Rapid Transcription: Utilizes cutting-edge models for quick speech-to-text conversion.
Industry Terminology Recognition: Excels in comprehending specific jargon and abbreviations.
Low Latency: Offers transcription with minimal delay, approximately 0.5 to 1 second.
Model Integration: Employs Whisper model combined with Nova for high accuracy.

What benefits should users look for when evaluating Deepgram?

High Speed: Significant improvement in processing time over competitors.
Performance Satisfaction: Users appreciate faster and more fluid transcription.
Textual Accuracy: Enhancements can lead to more reliable outputs in transcripts.
Streamlined Processes: Features like punctuation and Smart Format boost efficiency.

Deepgram is widely implemented across industries for transcribing speech to text, often used by organizations for generating machine transcripts of legal proceedings and other vital communications. Teams deploy it on local systems to convert videos and phone calls, integrating speech recognition seamlessly into applications.

Product Categories

Text-To-Speech Services

Speech-To-Text Services

AI Customer Support

AI Sales & Marketing

AI Scheduling & Coordination

Popular Comparisons

Microsoft Azure Speech Service vs Deepgram

Amazon Polly vs Deepgram

Google Cloud Text-to-Speech vs Deepgram

Google Cloud Speech-to-Text vs Deepgram

Amazon Transcribe vs Deepgram

AssemblyAI vs Deepgram

ElevenLabs vs Deepgram

Gladia vs Deepgram

Rev.ai vs Deepgram

Speechmatics vs Deepgram

See all alternatives

Deepgram Reviews Summary
Author info	Rating	Review Summary
Technology Architect & Hands-On Leader \| Prototyping, Automation, AI/LLM Integration \| 20+ Years in at a consultancy with 1,001-5,000 employees	5.0	I've used Deepgram for four years in voice bot projects, valuing its configurability, evolving models, and strong support. It's accurate across accents, cost-effective, highly scalable, and integrates smoothly, with Flux addressing earlier limitations in conversational speech detection.
Software Engineer at BIFROTEK	4.5	I use Deepgram for reliable, low-latency transcription in AI voice integrations, mainly for German. It performs well overall, though handling special terms needs improvement. Setup could be more intuitive, but it's contributed significantly to my business success.
AI Applied Engineer at Flexon Technologies Talent360.ai	4.0	I've used Deepgram for nearly two years for text-to-speech in AI chatbots; its low latency and realistic voices impressed me, but limited scalability and pricing led us to switch to AWS despite its higher latency.
Co-founder at a tech services company with 1-10 employees	4.5	I use Deepgram to transcribe bilingual conversations for a security AI voice agent, and I’m impressed by its accuracy, speed, and ease of integration, though improvements in handling Spanish accents would make it even better.
VP Product at PeerSpot	4.5	No summary available
AWS \| Back-End Team Lead at eScribers, LLC	4.5	No summary available
Back End Developer at AskHumans	4.0	No summary available
Full Stack Developer at Global IT App Info Solution	3.0	No summary available

Title	Rating	Mindshare	Recommending
Microsoft Azure Speech Service	4.5	23.5%	100%	3 interviews Add to research
Amazon Polly	3.7	23.1%	100%	5 interviews Add to research

Deepgram Reviews

What is Deepgram?

Featured Deepgram reviews

Deepgram mindshare

PeerResearch reports based on Deepgram reviews

Valuable Features

Room for Improvement

ROI

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

Top industries

Compare Deepgram with alternative products

Learn more about Deepgram

Related questions

Product Categories

Popular Comparisons