No more typing reviews! Try our Samantha, our new voice AI agent.

AWS Glue vs Amazon Data Firehose comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon Data Firehose
Ranking in Cloud Data Integration
17th
Average Rating
9.0
Reviews Sentiment
8.1
Number of Reviews
1
Ranking in other categories
No ranking in other categories
AWS Glue
Ranking in Cloud Data Integration
1st
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2026, in the Cloud Data Integration category, the mindshare of Amazon Data Firehose is 1.0%, up from 1.0% compared to the previous year. The mindshare of AWS Glue is 7.6%, down from 18.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Integration Mindshare Distribution
ProductMindshare (%)
AWS Glue7.6%
Amazon Data Firehose1.0%
Other91.4%
Cloud Data Integration
 

Featured Reviews

Johnny Suleiman - PeerSpot reviewer
MS AWS expert at Bespin Global
Enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine learning algorithms to analyze, providing real-time data streaming Amazon Data Firehose enhances our…
SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is its capability for real-time data streaming."
"Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."
"AWS Glue's best features are scalability and cloud-based features."
"The solution helps organizations gain flexibility in defining the structure of the data."
"Our entire use case was very easily handled or solved using this solution."
"Its ease of use, cost-effectiveness, and highly secure architecture are some of the most valuable features."
"The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
"We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
 

Cons

"Amazon Data Firehose enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale."
"In terms of improvement, the performance of AWS Glue could be faster."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"The drawbacks associated with the product stem from the fact that, based on the data volume, it can become very costly."
"There should be more connectors for different databases."
"The overall cost of AWS Glue could be better. It cost approximately $1,000 a month."
"The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS."
"The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."
"Overall, I consider the technical support to be fine, although the response time could be faster in certain cases."
 

Pricing and Cost Advice

Information not available
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"I rate the tool's pricing a four out of ten."
"AWS Glue is a high-priced solution that bills the client $150,000 to $250,000 annually."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
"I rate pricing an eight out of ten."
"I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten."
"The current cost is around forty to fifty thousand a month."
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
899,125 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
19%
Manufacturing Company
8%
Computer Software Company
8%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise34
 

Questions from the Community

What is your experience regarding pricing and costs for Amazon Data Firehose?
The pricing is fair and balanced for the capabilities provided by Amazon Data Firehose.
What needs improvement with Amazon Data Firehose?
There is no specific improvement mentioned for Amazon Data Firehose itself. However, it was noted that there could be room for a better understanding of real-time data streaming concepts for junior...
What is your primary use case for Amazon Data Firehose?
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine lea...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

Information Not Available
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Palantir and others in Cloud Data Integration. Updated: June 2026.
899,125 professionals have used our research since 2012.