We are a consulting company and we use this solution for our clients. We set up the data for them. We have various healthcare-related information from their vendor and business partners. They have integrated them and get data reports from it.
Systems Integration Associate Director at a computer software company with 10,001+ employees
Helpful support, and the Hierarchical Data Stage is good
Pros and Cons
- "The Hierarchical Data Stage is good."
- "It improves how our client's organization functions."
- "The interface needs improvement."
- "Many companies are moving away from DataStage because it is expensive."
What is our primary use case?
How has it helped my organization?
It improves how our client's organization functions.
What is most valuable?
We mainly use the designer and developer qualities. We use the basic features that we have.
They have many good features. The Hierarchical Data Stage is good.
What needs improvement?
The interface needs improvement. The interface in Informatica is easier than in DataStage.
The licensing can be improved. Many companies are moving away from DataStage because it is expensive.
The biggest issue that is unclear is how are they integrating into DevOps when they are binary files.
We would like to see DataStage integrated with DevOps so that a pipeline can be created for auto-deployment. Right now we are all doing it manually.
Buyer's Guide
IBM InfoSphere DataStage
June 2026
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
902,417 professionals have used our research since 2012.
For how long have I used the solution?
I have been working with IBM InfoSphere DataStage for seven years.
We have the 11.3 version but have recently migrated to the 11.7 version.
What do I think about the stability of the solution?
It's a stable product, it's not new.
What do I think about the scalability of the solution?
It's very scalable. Our clients are medium-sized companies with a 1.5 billion turnover.
How are customer service and support?
We reached out to IBM because the file was not readable, and they resolved the issue.
Technical support is good. I have not found any issues with technical support. I would rate them an eight out of ten.
In some cases, they have a delay in giving suggestions for the configuration.
Which solution did I use previously and why did I switch?
Previously, in another company, I worked with Informatica. There are not a lot of differences but the interface is easier than it is in DataStage.
How was the initial setup?
I don't do the setup, but I think that they have many challenges.
Initially, we had challenges with the configuration. We were trying to use the comparison for Excel, and reading the Excel files from the source, but the files were not readable.
What's my experience with pricing, setup cost, and licensing?
It's very expensive.
Which other solutions did I evaluate?
What other advice do I have?
I am not a developer, I have a team within our company for that.
There is a cloud migration strategy going on, so they are thinking of moving to the cloud. They want a tool that is not heavy and suitable for their budget.
The recommendation for using this tool would depend on the requirements.
I don't have anything bad to say about this product.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
IT Analyst at vvolve management consultants
Simplified data transformation and reporting with business logic implementation
Pros and Cons
- "It's useful for reporting and selecting different extract files."
- "Currently, the solution does not support cloud migration."
What is our primary use case?
We use IBM InfoSphere DataStage to extract data from different sources and perform business logic. It helps us in data transformation and loading into our data warehouse. The tool is also used for reporting purposes and selecting different extract files.
What is most valuable?
The IBM InfoSphere DataStage solution is user-friendly and easy to learn, which makes it convenient to work on. It supports business logic implementation.
Additionally, it's useful for reporting and selecting different extract files.
What needs improvement?
Currently, the solution does not support cloud migration. We cannot connect to cloud tools using IBM InfoSphere DataStage. This is an area where improvement is needed.
For how long have I used the solution?
I have been using IBM InfoSphere DataStage for ten plus years.
What do I think about the stability of the solution?
IBM InfoSphere DataStage is stable.
What do I think about the scalability of the solution?
IBM InfoSphere DataStage is scalable.
How are customer service and support?
I haven't faced any challenges with the technical support in version eleven point one. Previously, we faced challenges in version nine point one, but these were addressed after migrating to version eleven point one.
I would rate the technical support ten out of ten.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I have not worked with any other solutions for data integration. My career has been focused on using InfoSphere DataStage only.
How was the initial setup?
The initial setup was straightforward.
What about the implementation team?
Our setup and implementation were done in-house by using the DevOps processes within our team. We rely on the DevOps and Jenkins tool for deployment.
What other advice do I have?
If dealing with complex data, I recommend IBM InfoSphere DataStage. For less complexity, other tools might be suitable.
On a scale of one to ten, I rate IBM InfoSphere DataStage as nine.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Other
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
IBM InfoSphere DataStage
June 2026
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.
902,417 professionals have used our research since 2012.
Managing Partner at a tech services company with 11-50 employees
Easy to understand to monitor the data lineage from source to target but pricing could be better
Pros and Cons
- "IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target."
- "DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey."
What is our primary use case?
IBM InfoSphere DataStage is a core ETL tool. We use it with source systems like mainframes. DataStage is perfectly suited for extracting data from mainframes.
What is most valuable?
IBM is stable and accurate to monitor. It's easy to understand to monitor the data lineage from source to target.
What needs improvement?
DataStage is quite expensive. It is too hard to find a consultant using DataStage in Turkey.
For how long have I used the solution?
I have been using IBM InfoSphere DataStage for three years. I also used this solution for two years back in 2009-10.
What do I think about the stability of the solution?
The product is stable.
I rate the solution’s stability a nine out of ten.
What do I think about the scalability of the solution?
I rate the solution’s scalability an eight out of ten.
How are customer service and support?
The quality and response time of support is fine. It's pretty quick.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Informatica is the first choice for me. It's easy to use and not so expensive compared to DataStage.
How was the initial setup?
You install IBM InfoSphere DataStage once you've set it up properly. It's robust and reliable, but initially configuring it can be challenging.
What other advice do I have?
The first consideration is the type of source system they have, whether it is a mainframe or not. Another key indicator for me to suggest DataStage is if the client has other IBM ecosystems, such as data quality or IBM governance tools. This makes it highly suitable because you can easily establish data lineage.
Overall, I rate the solution a seven out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
System Engineer at a energy/utilities company with 51-200 employees
Easy-to-deploy product with good scalability
Pros and Cons
- "The product is easy to deploy."
- "There could be more customization options for the product."
What needs improvement?
There could be more customization options for the product.
For how long have I used the solution?
We have been using IBM InfoSphere DataStage for 20 years. At present, we are using version 11.7.
What do I think about the stability of the solution?
I rate IBM InfoSphere DataStage’s stability a five out of ten.
What do I think about the scalability of the solution?
The product is suitable for enterprise companies. We have 100 users for it. I rate the platform’s scalability a seven out of ten. It is easily scalable compared to other systems.
How are customer service and support?
The complexity of the technical support services depends on the contact person. Sometimes, it is a good experience, while sometimes a poor experience communicating with their executives.
How would you rate customer service and support?
Neutral
How was the initial setup?
The product is easy to deploy. I rate the process an eight or nine. The deployment time depends on the specific requirements of customers. It takes approximately three months to complete. It requires a team of five to 100 people to execute it, depending on the company size.
What's my experience with pricing, setup cost, and licensing?
The product is expensive. I rate its pricing a ten out of ten.
What other advice do I have?
I rate IBM InfoSphere DataStage an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Owner at 7Spring Consult
A stable and scalable ETL tool that needs to integrate basic data quality check features
Pros and Cons
- "I am impressed with the tool's ETL tracing."
- "It would be great if they can include some basic version of data quality checking features."
What is our primary use case?
I work as a consultant and I have several projects with the Russian banks. My main expertise is building data warehouses and I use the product as a ETL.
What is most valuable?
I am impressed with the tool's ETL tracing.
What needs improvement?
It would be great if they can include some basic version of data quality checking features.
What do I think about the stability of the solution?
The solution is quite stable.
What do I think about the scalability of the solution?
I would rate the product's scalability an eight out of ten.
What other advice do I have?
I would rate the product a nine out of ten. You need to get a balance between batch ETL processing and streaming.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer.
Consultant - Data Engineering at South Asian Technologies
Data integration tool that is scalable and offers good customer support for premium customers
Pros and Cons
- "When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses."
- "Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface."
What is our primary use case?
We started using this solution as we needed a system upgrade. We had to move the Db2 data to a AS400 system.
What is most valuable?
What needs improvement?
Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface.
For how long have I used the solution?
We have been using this solution for a couple of months.
What do I think about the scalability of the solution?
This is a scalable solution.
How are customer service and support?
When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses.
How would you rate customer service and support?
Positive
How was the initial setup?
The infrastructure and the software configuration part was done by one of my teammates. We were able to complete it in two working days. This was only for the installation of DataStage.
What other advice do I have?
I would advise others to identify the communication between servers and the client tools correctly. If working from a client environment and connecting to the server, configuration should be done correctly, otherwise you may encounter some issues.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer.
Owner at 7Spring Consult
Reliable, simple to install, and useful
Pros and Cons
- "IBM InfoSphere DataStage is a good product; it is quite useful and powerful."
- "It would be useful to provide support for Python, AR, and Java."
- "From a practice point of view, solutions such as IBM InfoSphere DataStage and Oracle Data Integrator are losing ground, whereas open-source solutions are becoming increasingly powerful."
What is our primary use case?
I am a consultant. I provide product information for our clients.
What is most valuable?
IBM InfoSphere DataStage is a good product.
It is quite useful and powerful.
What needs improvement?
From a practice point of view, solutions such as IBM InfoSphere DataStage and Oracle Data Integrator are losing ground, whereas open-source solutions are becoming increasingly powerful.
For example, we are currently working hard on several examples, and in a few years, open-source solutions will take the lead in the market. It will be used by large enterprises.
Clients are looking for open-source solutions more and more.
It would be useful to provide support for Python, R, and Java.
For how long have I used the solution?
I have more than 22 years of experience with many different products.
It has been three to four years that we have been using IBM InfoSphere DataStage.
What do I think about the stability of the solution?
I have no issues with the stability of IBM InfoSphere DataStage.
How are customer service and support?
Clients are quite dependant on support from the vendor. For example, if you want to activate a new feature on the product, you must create a ticket. You have no information on when it will be implemented, and the vendor does not know because they have a stream of tickets that are completed by the priority given to the ticket.
Which solution did I use previously and why did I switch?
I am a consultant. I have different projects with different platforms. We are constantly going back and forth to different solutions for different projects.
I have had clients who have used Amazon Redshift.
Over the years, my clients have used many different products. For example, they use IBM Landscape and we use IBM InfoSphere.
How was the initial setup?
The initial setup was straightforward. We did not have issues.
What's my experience with pricing, setup cost, and licensing?
Comparable solutions will have common disadvantages, which is the total cost of the project.
It's quite expensive.
Which other solutions did I evaluate?
From time to time, I evaluate different products for my clients.
What other advice do I have?
We have had different projects with three of four clients. The average term per project has been nine months and one year.
If you are working with an open-source solution or another solution, you can implement some features by yourself. For example, in the case of Amazon, which has Amazon Lambda, you can easily write your code in Python or Java, and it will orchestrate it. You can create your features yourself easily and gives you more abilities to make your solution run quicker, eliminating the dependence from the vendor.
I would rate IBM InfoSphere DataStage an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Head of IT Integration & Finance Transformation at a financial services firm with 5,001-10,000 employees
Great flexibility with full delivery integration
Pros and Cons
- "Offers great flexibility."
- "The key feature for me is the flexibility this solution offers."
- "Currently lacking virtualization ability."
- "Technical support could be better - more knowledgeable and customer friendly."
What is our primary use case?
The most powerful use of this solution is for full delivery integration. Aside from the ETL aspect, it makes data lineage. We are customers of IBM InfoSphere and I'm head of IT integration and finance transformation.
What is most valuable?
The key feature for me is the flexibility this solution offers.
What needs improvement?
The solution is currently lacking virtualization ability. If they were to include it, it could be a good evolution on this framework. I'd like to see an improvement in support and a more customer friendly and knowledgeable support staff.
For how long have I used the solution?
I've been using this solution for over 20 years.
What do I think about the scalability of the solution?
Scalability is a question of licensing, but it can definitely be done.
How are customer service and technical support?
Technical support could be better - more knowledgeable and customer friendly.
How was the initial setup?
We outsourced deployment to IBM and didn't have any issues. Implementation took about two weeks. We have between 20 and 30 users in the company and our maintenance is carried out by IBM.
What's my experience with pricing, setup cost, and licensing?
We pay an annual licensing fee.
Which other solutions did I evaluate?
Yes, we evaluated other tools, and this tool is competing internally with our PySQL development. It's a battle between code and local.
What other advice do I have?
I rate this solution an eight out of 10.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data/Solution Architect at a computer software company with 51-200 employees
Robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data
Pros and Cons
- "As a data integration platform, it is easy to use. It is quite robust and useful for volumetric analysis when you have huge volumes of data. We have tested it for up to ten million rows, and it is robust enough to process ten million rows internally with its parallel processing. Its error logging mechanism is far simpler and easier to understand than other data integration tools. The newer version of InfoSphere has the data catalog and IDC lineage. They are helpful in the easy traceability of columns and tables."
- "As a data integration platform, it is easy to use, quite robust, and useful for volumetric analysis when you have huge volumes of data."
- "Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly. We tried to get hold of API documentation, but the documentation is not very well thought out. It should be more structured and elaborate. In terms of additional features, I would like to see good reporting on performance and performance-tuning recommendations that can be based on AI. I would also like to see better data profiling information being reported on InfoSphere."
- "Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly."
What is our primary use case?
We use it for creating a pattern for data integration with our data vault. We have also used it for creating APIs.
What is most valuable?
As a data integration platform, it is easy to use. It is quite robust and useful for volumetric analysis when you have huge volumes of data. We have tested it for up to ten million rows, and it is robust enough to process ten million rows internally with its parallel processing.
Its error logging mechanism is far simpler and easier to understand than other data integration tools.
The newer version of InfoSphere has the data catalog and IDC lineage. They are helpful in the easy traceability of columns and tables.
What needs improvement?
Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly. We tried to get hold of API documentation, but the documentation is not very well thought out. It should be more structured and elaborate.
In terms of additional features, I would like to see good reporting on performance and performance-tuning recommendations that can be based on AI. I would also like to see better data profiling information being reported on InfoSphere.
For how long have I used the solution?
It was DataStage previously, and then it became InfoSphere. I have used DataStage for ten years and InfoSphere for one year.
What do I think about the stability of the solution?
It is quite stable. In the newer components of InfoSphere, you have a mapping tool called FastTrack and a metadata generator, which can have issues from time to time, but they get resolved.
What do I think about the scalability of the solution?
It is not that easy to scale on-premises. I have worked on the ones deployed on Windows or Unix, and scalability is often dependent on whether you can add more CPUs or boxes. On the cloud, it would have been easier to scale. However, the current version can only be deployed on Windows or Unix.
How are customer service and technical support?
I have not been in touch with them recently. Earlier, I was in touch with their technical support and had raised tickets because some weird errors, such as fantom error, were being logged in the error log, which made no sense. We used to get in touch with their support team to understand these.
Which solution did I use previously and why did I switch?
I have used Informatica and SAS CA. IBM InfoSphere has the highest cost of licensing as compared to others. It is not very widely used, and it is very difficult to find people who have this sort of knowledge.
The newer version of Informatica is on the cloud and is much more user-friendly than InfoSphere because it provides profiling information in nice graphs and charts. It also provides a lot of templates. For example, if I want to build a whole dimensional kind of structure, Informatica has a template. I just need to use that template. So, the ease of use is far better in Informatica, and it has everything that InfoSphere has. The only thing is that Informatica comes in bundles. That's the reason sometimes organizations don't go for it. For example, the data integration is a separate section, and the data quality is a separate section. They have separate pricing.
How was the initial setup?
The initial setup is quite simple. It didn't take more than half an hour to set it up on my laptop.
What about the implementation team?
I implemented it myself. In terms of maintenance, a particular version might not require any maintenance. There could be bug fixes and minor versions going in for some versions.
What's my experience with pricing, setup cost, and licensing?
It is quite expensive.
What other advice do I have?
I would recommend this solution for large-scale implementation where you need a complex transformation and data integration to happen according to a structured format, either a data vault or a dimension model. It is suitable for big companies because of the cost. It is a very valuable platform for data in large volumes. For small volumes, you have other open-source tools that can do the same thing for you.
I am part of a consultancy, and I have deployed this product for companies. We have five to eight developers. Because InfoSphere is a licensed product, and its licenses cost a lot, there are not many InfoSphere developers.
I would rate IBM InfoSphere DataStage an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Manager at a consultancy with 1,001-5,000 employees
Robust and scalable but the initial setup is not straightforward and the price is high
Pros and Cons
- "It's a robust solution."
- "This solution has an end-to-end process used for data integration."
- "The initial setup could be more straightforward."
What is our primary use case?
We are a solution provider and this is one of the products that we implement for our clients.
This solution has an end-to-end process used for data integration.
What is most valuable?
It's a robust solution.
What needs improvement?
The initial setup could be more straightforward.
For how long have I used the solution?
We have been providing IBM InfoSphere DataStage for one year.
What do I think about the stability of the solution?
I believe this solution is stable. We have not received any feedback from our clients.
What do I think about the scalability of the solution?
To my understanding, this solution is scalable.
We have several customers who are currently using it.
How are customer service and technical support?
I have not contacted technical support.
Which solution did I use previously and why did I switch?
We have a long list of different providers such as Informatica, IBM, Oracle, Microsoft SSIS, Pentaho, and Talend.
How was the initial setup?
The installation was not straightforward and I would rate it at medium complexity.
What about the implementation team?
The installation required assistance from an expert from IBM.
What's my experience with pricing, setup cost, and licensing?
The price is expensive but there are no licensing fees.
What other advice do I have?
Informatica provides a cloud-based deployment but we only work with the on-premises version. This is a product that I can recommend.
I would rate this solution a six out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros
sharing their opinions.
Updated: June 2026
Product Categories
Data IntegrationPopular Comparisons
Informatica Intelligent Data Management Cloud (IDMC)
Qlik Talend Cloud
Palantir Foundry
Informatica PowerCenter
Azure Data Factory
Oracle Data Integrator (ODI)
Oracle GoldenGate
SAP Data Services
Pentaho Data Integration and Analytics
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- How do you compare Informatica PowerCenter with IBM DataStage?
- Would you upgrade to more premium versions of IBM InfoSphere DataStage?
- Is IBM InfoSphere DataStage more difficult to use compared to other tools in the field?
- Do you rely on IBM Cloud Paks for your data? Have you utilized this product, or do you use IBM InfoSphere DataStage without it?
- When evaluating Data Integration, what aspect do you think is the most important to look for?
- Microsoft SSIS vs. Informatica PowerCenter - which solution has better features?
- What are the best on-prem ETL tools?
- Which integration solution is best for a company that wants to integrate systems between sales, marketing, and project development operations systems?
- Experiences with Oracle GoldenGate vs. Oracle Data Integrator?
- What are the must-have features for a Data integration system?
















