Try our new research platform with insights from 80,000+ expert users

AWS Data Pipeline [EOL] vs AWS Glue comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

AWS Data Pipeline [EOL]
Average Rating
8.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
AWS Glue
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
Cloud Data Integration (1st)
 

Featured Reviews

BR
Senior Director Data Architecture at Managed Markets Insight & Technology, LLC
A tool with great orchestration and development capabilities but needs to improve its user-defined functions
In the tool, parallel processing is an area that is contingent, in the sense that you have to be watchful for the cap that you have in terms of computing behind AWS Data Pipeline. You need to always watch for some reason. I am capped with 200 nodes, and if I get to use more than 200 nodes, the AWS Data Pipeline will fail. AWS doesn't state that I have almost gone beyond my limits, and it is allowing me now to go beyond the set limits if I talk to a representative and figure it out. Such aforementioned warnings are not let out by AWS, and they end up failing the nodes if I go beyond the set cap limits.
SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"It is a stable solution...It is a scalable solution."
"Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs."
"One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
"AWS Glue's best features are scalability and cloud-based features."
"AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software."
"I appreciate AWS Glue for its cost-effectiveness."
"You do not need many frameworks to run Glue."
"It is AWS-integrated. There is end-to-end integration with the other AWS services. It is also user-friendly."
"The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
 

Cons

"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"The user-defined functions have shortcomings in AWS Data Pipeline."
"The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment."
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
"In the building and deployment aspects, there is room for improvement. The current process is a bit complicated and could benefit from being more user-friendly and simpler, which would help speed up the deployment process."
"I have encountered challenges with multi-region support."
"We face performance issues when using AWS Glue for data transformation and integration."
"The product has only a few built-in transformations."
"The price of the solution could improve."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
 

Pricing and Cost Advice

"The way we use it, I think it is fair as we're getting a good value for money compared to having a server or some other data pipeline."
"I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price."
"It is an expensive product. I rate its pricing a nine out of ten."
"AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
"AWS Glue is a paid service that doesn't come under the free trial of AWS."
"The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
"I rate the tool an eight on a scale of one to ten, where one is expensive, and ten is expensive."
"I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten."
"I rate pricing an eight out of ten."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
884,933 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
19%
Computer Software Company
10%
Manufacturing Company
8%
Government
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise32
 

Questions from the Community

Ask a question
Earn 20 points
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: February 2026.
884,933 professionals have used our research since 2012.