We performed a comparison between AWS Data Pipeline [EOL] and AWS Glue based on real PeerSpot user reviews.
Find out what your peers are saying about Amazon Web Services (AWS), MuleSoft, Matillion and others in Cloud Data Integration."It is a stable solution...It is a scalable solution."
"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."
"It's fairly straightforward as a product; it's not very complicated."
"The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features."
"The most valuable feature of AWS Glue is scalability."
"Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs."
"Our entire use case was very easily handled or solved using this solution."
"AWS Glue's best features are scalability and cloud-based features."
"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"The user-defined functions have shortcomings in AWS Data Pipeline."
"The price of the solution could improve."
"Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."
"Glue could perform better. It sometimes takes too long to test a Glue job. Google Cloud Platform offers more Python scripts than AWS."
"If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."
"In terms of performance, if they can further optimize the execution time for serverless jobs, it would be a welcome improvement."
"The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment."
"The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."
"AWS Glue is more costly compared to other tools like Airflow."
AWS Data Pipeline [EOL] doesn't meet the minimum requirements to be ranked in Cloud Data Integration with 2 reviews while AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews. AWS Data Pipeline [EOL] is rated 8.0, while AWS Glue is rated 7.8. The top reviewer of AWS Data Pipeline [EOL] writes "A tool with great orchestration and development capabilities but needs to improve its user-defined functions". On the other hand, the top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". AWS Data Pipeline [EOL] is most compared with AWS Database Migration Service, Oracle Data Integrator (ODI), FME, Perspectium DataSync and IBM InfoSphere DataStage, whereas AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Informatica Cloud Data Integration and Talend Open Studio.
See our list of best Cloud Data Integration vendors.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.