Complex data integration projects which require integration from multiple data sources.
Partner at Avydium Data LLC
Its parallel processing capability allows you to go through extremely large data sets in no time at all
Pros and Cons
- "Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job."
- "Working with some of the big data components is good, but I can see improvements are needed."
What is our primary use case?
How has it helped my organization?
I have worked during many implementations using DataStage. All of the projects that I worked on have been successful. This is due mainly to the strict discipline around best practices, and by following a set of standards and templates designed to reduce complexity and improve automation, including strong reference architecture.
What is most valuable?
- Its parallel processing capability allows you to go through extremely large data sets in no time at all, if you do your job right.
- Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job.
- High scalability: Start small and go big with the same job. You just need to adjust the configuration file, no need to recompile.
- Strong metadata management: Business, technical, and process metadata can all be managed from a single place.
- Ease of integration with other tool sets: Easily supports APIs (or build your own) to support data streaming (or batched) from other systems.
- Data Quality Management from within the tool: Supporting data sampling, including profiling of data, directly from the development canvas.
What needs improvement?
High-cost of ownership: They could take a page from open source software, such as Talend.
Working with some of the big data components is good, but I can see improvements are needed, such as native support for Spark and HBase.
Buyer's Guide
IBM InfoSphere DataStage
October 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,778 professionals have used our research since 2012.
For how long have I used the solution?
More than five years.
What do I think about the stability of the solution?
No issues.
What do I think about the scalability of the solution?
No issues.
How are customer service and support?
Support is always good.
Which solution did I use previously and why did I switch?
Have used quite a few ETL tools in my job.
- Ab Initio: Even pricier, but has a highly competent ETL tool. It is complete, but hard to use.
- Informatica: Not as flexible and does not support the same level of complexity in its maps.
- Talend: It is a good tool suite, extensive, but can be cumbersome to cite all its pieces.
- ODI: For the Oracle centric world.
- SSIS: Week when compared to any of the above tool sets.
How was the initial setup?
Depends on type of environment that is being installed. I have seen fairly simple to overly complex initial setups due to the environment, not due to the tool.
What about the implementation team?
Both vendor and in-house team implementations:
IBM has top-notch support and tool services along with other partners as well. Depending on the partner, this can go from installation and configuration to solution development, etc.)
Most in-house teams that I have seen tend to have have good developers, but not always good architects. Like most every data integration project, if you do not have a strong architecture, your solution will eventually fail.
What was our ROI?
Depends on the project.
Which other solutions did I evaluate?
Have done many ETL tool evaluations based on client requirements. DataStage has always been in the top-three. It may not have been selected due to different weights being used for different sections of the evaluation for different clients, but it has always been in the top-three consistently.
What other advice do I have?
If you have the budget and your solution requires industrial/enterprise strength data integration, this product is always a good choice.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Architect at a tech services company with 51-200 employees
It has unlimited database connectors and free-of-charge application connectors.
Valuable Features
- Unlimited database connectors and free-of-charge application connectors - Siebel in our case
- Lot of transformation components
- High scalability
Improvements to My Organization
It's standardized our batch integration.
Room for Improvement
It needs a better scheduling mechanism.
Use of Solution
I've been using it as a customer for six years.
Deployment Issues
We had no issues with the deployment.
Stability Issues
We had issues with Java on AIX platform (version 8.7) - currently migrated to Linux platform without issues.
Scalability Issues
It's highly scalable.
Customer Service and Technical Support
We have close contact with support in their Polish lab.
Initial Setup
The initial setup is easy as it's done through a web installation process with an HA setup option.
Pricing, Setup Cost and Licensing
Check the Information Server bundle offering, especially with InfoSphere Information Server for Data Integration.
Other Solutions Considered
Our customer checked AbInitio and Informatica PowerCenter – DataStage was best as it could be delivered quickest.
Other Advice
You should also look at Redbooks and DeveloperWorks articles for knowledge gathering.
Disclosure: My company has a business relationship with this vendor other than being a customer. We are a BP of IBM.
Buyer's Guide
IBM InfoSphere DataStage
October 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,778 professionals have used our research since 2012.
Solutions Specialist at a tech services company with 501-1,000 employees
It has valuable administrative features, particularly since I don't program in this environment, but avoid the Netezza adapter as it's poor.
Valuable Features
Anything that is administrative related, as I don't program in this environment.
Room for Improvement
The recovery feature. We had DS repos in a bad condition, but IBM couldn't recover it.
Use of Solution
I've been using it for one year.
Deployment Issues
No issues encountered.
Stability Issues
No issues encountered.
Scalability Issues
No issues encountered.
Customer Service and Technical Support
Customer Service:
Mostly, I would give 9/10. I did have one bad experience, so that leaves a bad impression.
Technical Support:It's generally good, although sometimes I see a lot of confusion about how to resolve issues.
Initial Setup
It was complex.
Implementation Team
It was setup by an outside vendor.
Other Solutions Considered
No other options were evaluated.
Other Advice
Don't use the Netezza adapter, as it's poor
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros
sharing their opinions.
Updated: October 2025
Product Categories
Data IntegrationPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Azure Data Factory
Informatica PowerCenter
Oracle Data Integrator (ODI)
Talend Open Studio
Oracle GoldenGate
SAP Data Services
Qlik Replicate
Alteryx Designer
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- How do you compare Informatica PowerCenter with IBM DataStage?
- Would you upgrade to more premium versions of IBM InfoSphere DataStage?
- Is IBM InfoSphere DataStage more difficult to use compared to other tools in the field?
- Do you rely on IBM Cloud Paks for your data? Have you utilized this product, or do you use IBM InfoSphere DataStage without it?
- When evaluating Data Integration, what aspect do you think is the most important to look for?
- Microsoft SSIS vs. Informatica PowerCenter - which solution has better features?
- What are the best on-prem ETL tools?
- Which integration solution is best for a company that wants to integrate systems between sales, marketing, and project development operations systems?
- Experiences with Oracle GoldenGate vs. Oracle Data Integrator?
- Should we choose Data Hub or GoldenGate?













