No more typing reviews! Try our Samantha, our new voice AI agent.

AWS Data Pipeline [EOL] vs AWS Glue comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

AWS Data Pipeline [EOL]
Average Rating
8.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
AWS Glue
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
Cloud Data Integration (1st)
 

Featured Reviews

BR
Senior Director Data Architecture at Managed Markets Insight & Technology, LLC
A tool with great orchestration and development capabilities but needs to improve its user-defined functions
In the tool, parallel processing is an area that is contingent, in the sense that you have to be watchful for the cap that you have in terms of computing behind AWS Data Pipeline. You need to always watch for some reason. I am capped with 200 nodes, and if I get to use more than 200 nodes, the AWS Data Pipeline will fail. AWS doesn't state that I have almost gone beyond my limits, and it is allowing me now to go beyond the set limits if I talk to a representative and figure it out. Such aforementioned warnings are not let out by AWS, and they end up failing the nodes if I go beyond the set cap limits.
SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is a stable solution...It is a scalable solution."
"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"I like that it's flexible, powerful, and allows you to write your own queries and scripts to get the needed transformations."
"You do not need many frameworks to run Glue."
"It is AWS-integrated, there is end-to-end integration with the other AWS services, and it is also user-friendly."
"The solution helps organizations gain flexibility in defining the structure of the data."
"It is a stable and scalable solution."
"In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"The most valuable feature for me is the visual interface of AWS Glue."
"It's fairly straightforward as a product; it's not very complicated."
 

Cons

"The user-defined functions have shortcomings in AWS Data Pipeline."
"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"There could be an enhanced way of managing pure metadata management or data cataloging."
"The setup and installation is a bit complex without advanced knowledge or training."
"It is not clear how the partition discovery would have been affected by more data coming in."
"If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
"I would like to see stable libraries at the moment they are not there."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"The solution’s technical support could be improved."
"Improvements in the UI are needed, as it is challenging to understand some functionalities."
 

Pricing and Cost Advice

"The way we use it, I think it is fair as we're getting a good value for money compared to having a server or some other data pipeline."
"I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"If you are using the solution for an enterprise business, it will be expensive."
"I rate the tool's pricing a four out of ten."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
"I rate the product's pricing a five on a scale of one to ten, where one is a high price, and ten is a low price."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
893,244 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
19%
Computer Software Company
8%
Manufacturing Company
8%
Comms Service Provider
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise34
 

Questions from the Community

Ask a question
Earn 20 points
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: May 2026.
893,244 professionals have used our research since 2012.