No more typing reviews! Try our Samantha, our new voice AI agent.

AWS Glue vs Amazon Data Firehose comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon Data Firehose
Ranking in Cloud Data Integration
19th
Average Rating
9.0
Reviews Sentiment
8.1
Number of Reviews
1
Ranking in other categories
No ranking in other categories
AWS Glue
Ranking in Cloud Data Integration
1st
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of July 2026, in the Cloud Data Integration category, the mindshare of Amazon Data Firehose is 1.0%, down from 1.1% compared to the previous year. The mindshare of AWS Glue is 7.7%, down from 17.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Integration Mindshare Distribution
ProductMindshare (%)
AWS Glue7.7%
Amazon Data Firehose1.0%
Other91.3%
Cloud Data Integration
 

Featured Reviews

Johnny Suleiman - PeerSpot reviewer
MS AWS expert at Bespin Global
Enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine learning algorithms to analyze, providing real-time data streaming Amazon Data Firehose enhances our…
SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is its capability for real-time data streaming."
"It is AWS-integrated, there is end-to-end integration with the other AWS services, and it is also user-friendly."
"AWS Glue's best features are scalability and cloud-based features."
"What I like best about AWS Glue is its real-time data backup feature. Last week, there was a production push, and what used to take almost ten days to send out around fifty-six thousand emails now takes only two hours."
"One of the best features of the solution is its ability to easily integrate with other AWS services."
"One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."
"The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
"AWS Glue is quite better than other tools, but you have to learn it properly before you start using it."
 

Cons

"Amazon Data Firehose enhances our AI-driven analytics projects by providing a means to manage data streaming and delivery at any scale."
"It is very difficult to learn the tool and remember the syntaxes comparatively."
"While working on AWS Glue, I could not find any training material for it."
"There could be an enhanced way of managing pure metadata management or data cataloging."
"The solution needs to expand its 30-minute query or runtime."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"The solution’s stability could be improved."
"In terms of improvement, the performance of AWS Glue could be faster."
"Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult."
 

Pricing and Cost Advice

Information not available
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"AWS Glue is a paid service that doesn't come under the free trial of AWS."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"If you are using the solution for an enterprise business, it will be expensive."
"AWS Glue is quite costly, especially for small organizations."
"I rate the tool an eight on a scale of one to ten, where one is expensive, and ten is expensive."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
902,894 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
20%
Manufacturing Company
8%
Computer Software Company
7%
Comms Service Provider
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise34
 

Questions from the Community

What is your experience regarding pricing and costs for Amazon Data Firehose?
The pricing is fair and balanced for the capabilities provided by Amazon Data Firehose.
What needs improvement with Amazon Data Firehose?
There is no specific improvement mentioned for Amazon Data Firehose itself. However, it was noted that there could be room for a better understanding of real-time data streaming concepts for junior...
What is your primary use case for Amazon Data Firehose?
The primary use case of Amazon Data Firehose is for real-time streaming data, specifically for data analysis and collection purposes. It is used to extract useful data and export it for machine lea...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

Information Not Available
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Palantir and others in Cloud Data Integration. Updated: June 2026.
902,894 professionals have used our research since 2012.