No more typing reviews! Try our Samantha, our new voice AI agent.

AWS Glue vs IBM InfoSphere DataStage comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Feb 8, 2026

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.9
Organizations find AWS Glue efficient and cost-effective despite overhead costs, though some consider alternatives due to budget constraints.
Sentiment score
5.9
IBM InfoSphere DataStage increases ROI with improved performance, reduced maintenance, efficient management, and ongoing developer support despite some manual needs.
I advocate using Glue in such cases.
Data Architect at a financial services firm with 10,001+ employees
 

Customer Service

Sentiment score
6.5
AWS Glue customer service is praised for responsiveness and effectiveness, with mixed feedback on support speed, costs, and consistency.
Sentiment score
6.2
IBM InfoSphere DataStage support is generally well-rated for availability and responsiveness, but some report regional and efficiency issues.
Upgrades occur every four months, and new developments coincide with version updates.
Principal Consultant at a retailer with 1,001-5,000 employees
I would rate AWS support eight out of ten because they are technically strong and helpful in debugging complex Glue and cloud issues, and they are very responsive.
application security engineer at Hyperspace IT India
We also have the flexibility to submit a feature request to be included as part of the wishlist, potentially becoming a product feature in subsequent releases.
Sr Product Manager at a computer software company with 501-1,000 employees
I rate their support as nine on a scale from one to ten.
Senior Data Warehouse Developer at itcinfotech
IBM tech support has allocated dedicated resources, making it satisfactory.
Senior Officer at State Bank of India
 

Scalability Issues

Sentiment score
7.8
AWS Glue is highly scalable and serverless, praised for easy resource management, but needs better parallel computation.
Sentiment score
7.5
IBM InfoSphere DataStage scales well but may require hardware adjustments under heavy loads, with ratings between 7-9.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
Python AWS & AI Expert at a tech consulting company
For jobs requiring multiple RAM usage, we increase the number of workers accordingly.
Principal Consultant at a retailer with 1,001-5,000 employees
If the job provided suggestions about running this kind of parallel processing and how many virtual nodes are required, it would help.
Senior Data Warehouse Developer at itcinfotech
 

Stability Issues

Sentiment score
7.9
AWS Glue is stable and reliable with minor issues, scaling well, and efficient due to serverless architecture and tool integration.
Sentiment score
7.6
IBM InfoSphere DataStage is stable, especially on Linux, but experiences some instability on Windows due to memory issues.
As a managed service, it reduces management burdens.
Python AWS & AI Expert at a tech consulting company
 

Room For Improvement

AWS Glue faces challenges with startup times, interface complexity, language limitations, cost, performance, integration, and multi-cloud compatibility.
IBM InfoSphere DataStage requires enhanced interfaces, modern integration, better support, user-friendliness, and adaptability with improved performance and cloud capabilities.
Learning the latest functionalities is crucial, and while challenging, it is a vital part of staying current and ensuring an efficient ETL process.
Principal Consultant at a retailer with 1,001-5,000 employees
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
Data Architect at a financial services firm with 10,001+ employees
A more user-friendly and simpler process would help speed up the deployment process.
Python AWS & AI Expert at a tech consulting company
If the job itself gave some guidance, such as running this parallel processing with this many nodes, it would help; I think that is missing.
Senior Data Warehouse Developer at itcinfotech
I wonder if it supports other areas, such as cloud environments with open source support, or EdgeShift.
Sr Product Manager at a computer software company with 501-1,000 employees
The solution needs improvement in connectivity with big data technologies such as Spark.
Senior Officer at State Bank of India
 

Setup Cost

AWS Glue offers flexible, efficient serverless architecture but can be costly and unpredictable, especially for smaller organizations.
IBM InfoSphere DataStage pricing varies widely and can be costly, particularly for small businesses, despite being cheaper than competitors.
AWS charges based on runtime, which can be quite pricey.
Data Architect at a financial services firm with 10,001+ employees
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
Python AWS & AI Expert at a tech consulting company
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
Principal Consultant at a retailer with 1,001-5,000 employees
Pricing for IBM InfoSphere DataStage is moderate and not much expensive.
Senior Officer at State Bank of India
 

Valuable Features

AWS Glue excels with its easy interface, scalable ETL processing, seamless AWS integration, affordability, and serverless architecture.
IBM InfoSphere DataStage offers robust ETL capabilities, scalability, excellent integration, user-friendly design, and strong performance for large data volumes.
AWS Glue is very efficient and integrates well with the AWS ecosystem.
application security engineer at Hyperspace IT India
AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
Python AWS & AI Expert at a tech consulting company
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
Principal Consultant at a retailer with 1,001-5,000 employees
It is straightforward from a design and development perspective, and also for deployment.
Sr Product Manager at a computer software company with 501-1,000 employees
IBM InfoSphere DataStage is very scalable, allowing us to extend it according to our processing needs.
Senior Officer at State Bank of India
I have leveraged IBM InfoSphere DataStage's integration with IBM's Information Server suite, and it is indeed beneficial.
Senior Data Warehouse Developer at itcinfotech
 

Categories and Ranking

AWS Glue
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
Cloud Data Integration (1st)
IBM InfoSphere DataStage
Average Rating
7.8
Reviews Sentiment
6.7
Number of Reviews
43
Ranking in other categories
Data Integration (9th)
 

Featured Reviews

SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.
Prasad Bodduluri - PeerSpot reviewer
Senior Data Warehouse Developer at itcinfotech
Has required complex workarounds for scripts and struggles with unstructured data processing
There is no issue with IBM InfoSphere DataStage's graphical interface for designing data flows, but I will provide feedback that we are gathering the source from the Oracle database mainly, as well as from some spreadsheets. With respect to the Oracle DB Connector, if you write any PL/SQL or SQL with the connectors, there aren't many options, such as executing procedures in the PL/SQL, executing functions, or executing packages. The Oracle connector doesn't have many features and needs improvement. Nowadays many people are writing programs in Python or in PL/SQL with respect to Oracle, so especially in IBM InfoSphere DataStage, there are no features to call programs directly instead of calling them as a script. What I am facing, especially with parallel processing, is that a developer and admin have to sit together. They have to run the job multiple times with different combinations of parallel processing to get the best performance. Instead of that, if the job itself gave some guidance, such as running this parallel processing with this many nodes, it would help; I think that is missing. An additional feature I would want to see in the next release is the ability to work on logs, especially machine logs or artificial logs, to pull semi-structured or unstructured data without having to write extensive code in Python and integrate it. If IBM InfoSphere DataStage provided some feature for this, it would help.
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
885,789 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Computer Software Company
9%
Manufacturing Company
8%
Government
5%
Financial Services Firm
24%
Government
9%
Manufacturing Company
8%
Computer Software Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise32
By reviewers
Company SizeCount
Small Business23
Midsize Enterprise4
Large Enterprise26
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
Would you upgrade to more premium versions of IBM InfoSphere DataStage?
My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For ...
Is IBM InfoSphere DataStage more difficult to use compared to other tools in the field?
I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work ...
Do you rely on IBM Cloud Paks for your data? Have you utilized this product, or do you use IBM InfoSphere DataStage without it?
IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands...
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Dubai Statistics Center, Etisalat Egypt
Find out what your peers are saying about AWS Glue vs. IBM InfoSphere DataStage and other solutions. Updated: March 2026.
885,789 professionals have used our research since 2012.