Try our new research platform with insights from 80,000+ expert users

AWS Data Pipeline [EOL] vs AWS Glue comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

AWS Data Pipeline [EOL]
Average Rating
8.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
AWS Glue
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
49
Ranking in other categories
Cloud Data Integration (1st)
 

Featured Reviews

Geoffrey Leigh - PeerSpot reviewer
A stable, scalable, and reliable solution for moving and processing data
We're only considering enhancing the presentation layer to give a more multidimensional OLAP view that AWS seems to have decided on. Redshift with the data mart structure is like an OLAP cube. Oracle Analytics Cloud is an over-code killer and is not what we need. I was looking at Mondrian, which used to be part of the open-source stack from another vendor that works. Still, I am also looking at some of the other OLAP environments like Kaiser and perhaps decided to go to Azure with Microsoft Azure analysis cloud, but that's not multidimensional either as SSAS used to be. We tried the Mondrian, and that didn't perform how we expected. So, we are looking at resetting something to perform as an OLAP in the cloud, particularly AWS, so that we might consider an Azure solution.
Saurabh Jaiswal - PeerSpot reviewer
Enables seamless integration and data preparation with robust transformation capabilities
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models. It offers transformation options on canvas or through ETL pipelines, notebooks, and code. Additionally, it supports data preparation, cleaning, and filtering seamlessly. AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"It is a stable solution...It is a scalable solution."
"The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
"The product has a valuable feature for data catalog."
"AWS Glue's best features are scalability and cloud-based features."
"The AWS Glue Data Catalog provides metadata management and schema discovery. AWS Glue simplifies data transformation with automatic schema detection, incremental data updates, and integration with other AWS services."
"Our entire use case was very easily handled or solved using this solution."
"Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."
"The solution is highly user-friendly, and its features are easy to use. The new addition of AWS Glue Data Catalog is also very beneficial, making the tool even more helpful for its users."
"The most valuable features currently are glue studio, jobs, and triggers."
 

Cons

"The user-defined functions have shortcomings in AWS Data Pipeline."
"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment."
"While working on AWS Glue, I could not find any training material for it."
"If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
"Setting up pipelines is challenging, especially with version control and testing requirements."
"It is quite clunky and code-heavy, which is my biggest problem."
"In the building and deployment aspects, there is room for improvement. The current process is a bit complicated and could benefit from being more user-friendly and simpler, which would help speed up the deployment process."
"The solution's visual ETL tool is of no use for actual implementation."
"It fails to handle massive databases acquired from various sources."
 

Pricing and Cost Advice

"The way we use it, I think it is fair as we're getting a good value for money compared to having a server or some other data pipeline."
"I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price."
"I rate the tool's pricing a four out of ten."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
"AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
"The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"I rate the product's pricing a five on a scale of one to ten, where one is a high price, and ten is a low price."
"AWS Glue is a high-priced solution that bills the client $150,000 to $250,000 annually."
"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
857,028 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
25%
Financial Services Firm
21%
Educational Organization
6%
Insurance Company
6%
Financial Services Firm
21%
Computer Software Company
13%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about AWS Data Pipeline?
The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool.
What is your experience regarding pricing and costs for AWS Data Pipeline?
I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price.
What needs improvement with AWS Data Pipeline?
The user-defined functions have shortcomings in AWS Data Pipeline. The user-defined functions could be one of the areas where I can write a custom function and embed it as a part of AWS Data Pipeli...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Comparisons

No data available
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: June 2025.
857,028 professionals have used our research since 2012.