No more typing reviews! Try our Samantha, our new voice AI agent.

AWS Glue vs Amazon Data Firehose comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon Data Firehose
Ranking in Cloud Data Integration
17th
Average Rating
9.0
Reviews Sentiment
8.1
Number of Reviews
1
Ranking in other categories
No ranking in other categories
AWS Glue
Ranking in Cloud Data Integration
1st
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2026, in the Cloud Data Integration category, the mindshare of Amazon Data Firehose is 1.0%, up from 1.0% compared to the previous year. The mindshare of AWS Glue is 7.6%, down from 18.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Integration Mindshare Distribution
ProductMindshare (%)
AWS Glue7.6%
Amazon Data Firehose1.0%
Other91.4%
Cloud Data Integration
 

Featured Reviews

Johnny Suleiman - PeerSpot reviewer
MS AWS expert at Bespin Global
Enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine learning algorithms to analyze, providing real-time data streaming Amazon Data Firehose enhances our…
SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is its capability for real-time data streaming."
"We have found it beneficial when moving data from one source to another."
"If I'm working with big data, common languages like Python work quite nicely, which is advantageous."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"The AWS Glue Data Catalog provides metadata management and schema discovery. AWS Glue simplifies data transformation with automatic schema detection, incremental data updates, and integration with other AWS services."
"The two features I find most valuable in AWS Glue are its user interface and ease of use."
"One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
"I appreciate AWS Glue for its cost-effectiveness."
"The price is very good; it's enticing people to move to the cloud."
 

Cons

"Amazon Data Firehose enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale."
"While working on AWS Glue, I could not find any training material for it."
"There is a learning curve to this tool."
"The setup and installation is a bit complex without advanced knowledge or training."
"Cost-wise, AWS Glue is expensive, so that's an area for improvement. The process for setting up the solution was also complex, which is another area for improvement."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"On occasion, the solution's dashboard reports that a project failed due to runtime but it actually succeeded."
"In the building and deployment aspects, there is room for improvement. The current process is a bit complicated and could benefit from being more user-friendly and simpler, which would help speed up the deployment process."
"It is very difficult to learn the tool and remember the syntaxes comparatively."
 

Pricing and Cost Advice

Information not available
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"The current cost is around forty to fifty thousand a month."
"AWS Glue is quite costly, especially for small organizations."
"Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
"This solution is affordable and there is an option to pay for the solution based on your usage."
"It is an expensive product. I rate its pricing a nine out of ten."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
902,270 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
20%
Manufacturing Company
8%
Computer Software Company
7%
Comms Service Provider
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise34
 

Questions from the Community

What is your experience regarding pricing and costs for Amazon Data Firehose?
The pricing is fair and balanced for the capabilities provided by Amazon Data Firehose.
What needs improvement with Amazon Data Firehose?
There is no specific improvement mentioned for Amazon Data Firehose itself. However, it was noted that there could be room for a better understanding of real-time data streaming concepts for junior...
What is your primary use case for Amazon Data Firehose?
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine lea...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

Information Not Available
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Palantir and others in Cloud Data Integration. Updated: June 2026.
902,270 professionals have used our research since 2012.