Deepgram Reviews

Name: Deepgram
Brand: Deepgram
Rating: 4.3 (10 reviews)

Vendor: Deepgram

4.3 out of 5

10 reviews
80% willing to recommend

Leave a review

What is Deepgram?

Deepgram stands out for its speed in transcribing videos and speech to text, leveraging cutting-edge models like Whisper and Nova for exceptional performance and accuracy. Its latency is remarkably low, enabling swift transcription that users find superior to alternatives.

Get the Deepgram Buyer's Guide and find out what your peers are saying about Deepgram, Microsoft Azure Speech Service, Amazon Polly and more!

Deepgram is the #1 ranked solution in top Speech-To-Text Services, #2 ranked solution in top Text-To-Speech Services, #2 ranked solution in top AI Scheduling & Coordination solutions, #3 ranked solution in top AI Customer Support solutions, and #6 ranked solution in top AI Sales & Marketing solutions. PeerSpot users give Deepgram an average rating of 8.6 out of 10. Deepgram is most commonly compared to Microsoft Azure Speech Service: Deepgram vs Microsoft Azure Speech Service. Deepgram is popular among the large enterprise segment, accounting for 42% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a educational organization, accounting for 9% of all views.

Helped 884,328 peers since 2012

Featured Deepgram reviews

Arunkumar HG

Technology Architect & Hands-On Leader | Prototyping, Automation, AI/LLM Integration | 20+ Years in at Regalix

Honestly, Deepgram has been exceptionally proactive in addressing the primary area that needed improvement. My main challenge was with the real-time detection of when a user has finished speaking in a live conversation, which is critical for a responsive voice bot. They directly solved this by releasing their Flux model. Because Flux is a recent release, I haven't yet had enough time to thoroughly test it and identify new limitations. At this stage, any "improvement" would be more of a "nice-to-have" feature rather than a fix for an existing problem. The core service is already very robust and meets all of our current needs. What additional features should be included in the next release? ---------------------------------------------------------------- Looking toward the future, here are a few features that could add even more value to an already excellent platform: * Advanced Built-in Analytics: While I can get the raw transcript and build my own analytics pipeline, it would be powerful to have features like sentiment analysis, emotion detection, or automatic summarization offered directly through the API. This would save significant development time. * More Granular Speaker Diarization: For calls with multiple participants, enhancing the real-time speaker diarization (labeling who is speaking) to be even more precise would be a fantastic addition for creating detailed call analyses. * Tighter Integration with TTS: Since Deepgram is also expanding into Text-to-Speech (TTS), offering a more seamlessly integrated STT-to-TTS pipeline could simplify the development stack for creating voice agents from start to finish. * Specialized, Pre-Trained Industry Models: While the general models are highly accurate, offering even more specialized, pre-trained models for specific industries like finance, healthcare, or legal-which are heavy on specific jargon-could push the accuracy even higher for those niche use cases.

Read full review

Oliver Spitzkat

Software Engineer at BIFROTEK

Deepgram is very reliable for normal conversation, but when it comes to special names or information, I'm not entirely sure how to implement it perfectly. I did use a feature to highlight special words based on the topic, but I've seen overweighting of this topic, where normal words weren't recognized properly. The handling of different topics needs to be optimized to be more accurate. I could upload lists with special words for specific topics. In my experience, it was not perfect, as there were only a few words that were understood properly. When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time. For example, when a person said "I'm experienced in Python development," Deepgram didn't get the word Python correctly. When Deepgram Nova 3 was released, I experienced problems with the API, but it could be on the Vapi side; I'm not sure, which is why I stick with version two, but I think currently there is no problem anymore. The setup could be easier, as everything is moving towards no-code solutions. You can configure everything properly on the dashboard and get a JSON file with the structured output to insert into other programs. If there were a way to configure special words for the API more readily, that would be helpful. The initial setup needs to be optimized to be more intuitive.

Read full review

Naveen Chowdary

AI Applied Engineer at Flexon Technologies Talent360.ai

One issue we've faced relates to the pricing structure; with the pay-as-you-go model, we only get eight concurrent connections for web sockets for text-to-speech, which makes it difficult to scale. For enterprise, the annual fee is around $25,000 to $30,000 USD, regardless of usage, which allows for 100 concurrent connections, but still doesn't provide enough scalability when we're using a lot. I have noticed that the web socket connection sometimes breaks due to inactivity, and increasing the timeout period would be beneficial. Additionally, if you leave the web socket connection with the TTS model for a certain period, it loses the connection. Even when used continuously for long periods, it occasionally gives an error. During a call with the support team, they acknowledged that there is an issue on their side. We have the intention to improve or optimize stability for future releases, but since Deepgram is not open-source, we don't have much control over that.

Read full review

Deepgram mindshare

Product category:

As of March 2026, the mindshare of Deepgram in the Text-To-Speech Services category stands at 10.8%, up from 5.1% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Text-To-Speech Services Mindshare Distribution
Product	Mindshare (%)
Deepgram	10.8%
Microsoft Azure Speech Service	21.2%
Amazon Polly	19.5%
Other	48.5%

Text-To-Speech Services

PeerResearch reports based on Deepgram reviews

Type	Title	Date
Category	Text-To-Speech Services	Mar 18, 2026	Download
Product	Reviews, tips, and advice from real users	Mar 18, 2026	Download
Comparison	Deepgram vs Amazon Polly	Mar 18, 2026	Download
Comparison	Deepgram vs Google Cloud Text-to-Speech	Mar 18, 2026	Download
Comparison	Deepgram vs Microsoft Azure Speech Service	Mar 18, 2026	Download

Valuable Features

Deepgram excels in high-speed transcription and industry-specific accuracy. Users appreciate its low latency, real-time speech-to-text capabilities, and adaptability with international accents. Its continuous innovation with models like Nova and Flux, ease of integration, and cost-effectiveness are highlighted as key strengths. Deepgram’s reliability, outstanding customer support, and impressive voice quality enhance productivity and efficiency, making it a preferred choice over rivals for organizations needing high accuracy and stable performance.

"The best features of Deepgram for me are the level of transcription accuracy it provides and the amount of time it saves."
"Deepgram's low latency transcription has greatly impacted my ability to deliver reliable voice agents and provided very good transcription."
"The most valuable capabilities of Deepgram that I've found so far include low latency, as it offers less than 200 milliseconds, which is not provided by any other text-to-speech models."

Room for Improvement

Deepgram struggles with accurate speaker identification and grammar in transcripts. It challenges with dual-channel audio and lacks features for managing new updates easily. Its accuracy lags behind competitors, especially with non-English accents. Limited language support and unstable live transcription impact user experience. Pricing issues affect scalability for large users. Enhancements needed include advanced analytics, specialized industry models, better speaker diarization, and seamless STT-to-TTS integration. Customizing and accurately detecting specialized terms and accent variations require improvement.

"Even though Deepgram has many customization options, I wish that Deepgram had voice cloning customization to a much larger extent."
"When I had an AI interview for coding, Deepgram didn't capture the names of programming languages or well-known LLMs accurately all the time."
"We haven't seen a return on investment with Deepgram so far; we have been building POCs for the last two years but recently switched to AWS in the last two months due to scalability issues with the pay-as-you-go model."

ROI

The ROI from Deepgram has been excellent for many users, significantly reducing transcription costs and improving efficiency. High accuracy and speed enhance workflows and user satisfaction. Cost-effective pricing aligns expenses with business needs. Users highlight the simplicity of integration without external help and value its foundation for services. Although some report scalability challenges leading to a switch, others attribute revenue increases to improved client satisfaction and reliable performance.

Pricing

Deepgram offers competitive pricing, generally cheaper than other transcription solutions. Users pay based on usage, with discounts for pre-committed hours, making it cost-effective for high-volume needs. Initial account setup is accessible at $200, which covers initial service use. While live transcription incurs some ongoing costs, they diminish over time. Enterprise users can benefit from affordable rates, with substantial savings in both usage and cloud resource costs, making Deepgram a financially viable choice.

"When using Deepgram, one needs to pay for the hours or minutes for which the transcription is needed."
"Deepgram is a cheap solution."
"The pricing is moderate."

Popular Use Cases

Deepgram's primary use involves transcribing speech to text for purposes such as transcribing videos, phone calls, meetings, and legal transcripts. Organizations utilize it for creating AI-driven voice bots, bilingual transcriptions, and text-to-speech processes. It records meeting voices, integrates into AI voice agents for safety applications, and supports feedback survey apps. Companies in diverse sectors rely on Deepgram for its speed, reliability, and ability to handle multilingual tasks with high accuracy.

Service and Support

Customer service and support for Deepgram receive mixed feedback. Some users appreciate the responsive team and effective communication via Slack and email, while others find technical assistance average due to unclear issue resolution. Many value the comprehensive resources, including forums and webinars, and note that detailed documentation often reduces the need for direct support. Some users rate technical support highly, highlighting prompt help and positivity, despite a few challenges with speaker identification.

Deployment

Users found Deepgram's initial setup to be straightforward and similar to other platforms. With comprehensive documentation and code samples, integration was efficient. Development teams appreciated the modularity, stable updates, and ease for on-premises and cloud deployment. A supportive community enhanced the setup experience. Some suggested more intuitive configuration and an easier method for API customization. The DIY approach was favored for its independence from external vendors.

Scalability

Users found Deepgram highly scalable, allowing significant growth with minimal personnel. The tool handles large audio and user volumes effectively, maintaining accuracy. While its architecture supports real-time transcription and offers flexibility through a usage-based pricing model, some switched to AWS for higher scalability despite latency differences. There were no performance issues reported, though some encountered GDPR compliance challenges. The hands-off nature and financial viability were noted positively despite newer alternatives.

Stability

Users have not faced breakdowns or bugs with Deepgram. Though occasionally some access issues arise during new model testing, these are unrelated to stability. Most users agree features are stable with no crashes. Despite initial challenges, updates have improved performance significantly. No downtime reports, and the service offers a transparent status page, confirming its reliability for voice bot applications. Connection losses are rare and performance matches workload demands.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Deepgram Buyer's Guide for additional reliable information.

Review data by company size

By reviewers
Company Size	Count
Small Business	8
Midsize Enterprise	1
Large Enterprise	1

By reviewers

By visitors reading reviews
Company Size	Count
Small Business	69
Midsize Enterprise	45
Large Enterprise	84

By visitors reading reviews

Top industries

By visitors reading reviews

Educational Organization

Financial Services Firm

University

Computer Software Company

Comms Service Provider

Manufacturing Company

Retailer

Government

Marketing Services Firm

Construction Company

Healthcare Company

Outsourcing Company

Media Company

Energy/Utilities Company

Legal Firm

Transportation Company

Non Profit

Recreational Facilities/Services Company

Insurance Company

Wholesaler/Distributor

Real Estate/Law Firm

Consumer Goods Company

Logistics Company

Performing Arts

Wellness & Fitness Company

Hospitality Company

Pharma/Biotech Company

Compare Deepgram with alternative products

Learn more about Deepgram

Deepgram provides an efficient solution for transforming video and audio content into text, benefiting from its advanced ability to recognize industry-specific terminology. Users experience faster results compared to IBM Watson and OpenAI's Whisper model, with low latency contributing to its appeal. However, challenges in speaker recognition and language support remain areas for improvement. Additionally, stronger spelling and grammar accuracy could enhance its performance. Some seek expanded multi-language capabilities and improved manageability during testing phases, noting its slightly less accuracy compared to other tools.

What are Deepgram's most notable features?

Rapid Transcription: Utilizes cutting-edge models for quick speech-to-text conversion.
Industry Terminology Recognition: Excels in comprehending specific jargon and abbreviations.
Low Latency: Offers transcription with minimal delay, approximately 0.5 to 1 second.
Model Integration: Employs Whisper model combined with Nova for high accuracy.

What benefits should users look for when evaluating Deepgram?

High Speed: Significant improvement in processing time over competitors.
Performance Satisfaction: Users appreciate faster and more fluid transcription.
Textual Accuracy: Enhancements can lead to more reliable outputs in transcripts.
Streamlined Processes: Features like punctuation and Smart Format boost efficiency.

Deepgram is widely implemented across industries for transcribing speech to text, often used by organizations for generating machine transcripts of legal proceedings and other vital communications. Teams deploy it on local systems to convert videos and phone calls, integrating speech recognition seamlessly into applications.

Product Categories

Text-To-Speech Services

Speech-To-Text Services

AI Customer Support

AI Sales & Marketing

AI Scheduling & Coordination

Popular Comparisons

Microsoft Azure Speech Service vs Deepgram

Amazon Polly vs Deepgram

Google Cloud Text-to-Speech vs Deepgram

Google Cloud Speech-to-Text vs Deepgram

Amazon Transcribe vs Deepgram

ElevenLabs vs Deepgram

AssemblyAI vs Deepgram

Gladia vs Deepgram

Speechmatics vs Deepgram

Sarvam AI Sarvam Samvaad vs Deepgram

Rev.ai vs Deepgram

See all alternatives

Deepgram Reviews Summary
Author info	Rating	Review Summary
Technology Architect & Hands-On Leader \| Prototyping, Automation, AI/LLM Integration \| 20+ Years in at Regalix	5.0	I've used Deepgram for four years in voice bot projects, valuing its configurability, evolving models, and strong support. It's accurate across accents, cost-effective, highly scalable, and integrates smoothly, with Flux addressing earlier limitations in conversational speech detection.
Software Engineer at BIFROTEK	4.5	I use Deepgram for reliable, low-latency transcription in AI voice integrations, mainly for German. It performs well overall, though handling special terms needs improvement. Setup could be more intuitive, but it's contributed significantly to my business success.
AI Applied Engineer at Flexon Technologies Talent360.ai	4.0	I've used Deepgram for nearly two years for text-to-speech in AI chatbots; its low latency and realistic voices impressed me, but limited scalability and pricing led us to switch to AWS despite its higher latency.
Co-founder at a tech services company with 1-10 employees	4.5	I use Deepgram to transcribe bilingual conversations for a security AI voice agent, and I’m impressed by its accuracy, speed, and ease of integration, though improvements in handling Spanish accents would make it even better.
Business Development Representative at a educational organization with 201-500 employees	4.5	I've used Deepgram for about a month to transcribe meetings and create content, appreciating its accuracy and time-saving features, though I wish it offered more advanced voice cloning and slightly lower pricing.
VP Product at PeerSpot	4.5	No summary available
AWS \| Back-End Team Lead at eScribers, LLC	4.5	No summary available
Back End Developer at AskHumans	4.0	No summary available

Title	Rating	Mindshare	Recommending
Microsoft Azure Speech Service	4.5	21.2%	100%	3 interviews Add to research
Amazon Polly	3.7	19.5%	100%	5 interviews Add to research

Deepgram Reviews

What is Deepgram?

Featured Deepgram reviews

Deepgram mindshare

PeerResearch reports based on Deepgram reviews

Valuable Features

Room for Improvement

ROI

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

Top industries

Compare Deepgram with alternative products

Learn more about Deepgram

Related questions

Product Categories

Popular Comparisons