No more typing reviews! Try our Samantha, our new voice AI agent.

IBM InfoSphere DataStage vs WinPure Data Quality Platform comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

IBM InfoSphere DataStage
Ranking in Data Integration
9th
Average Rating
7.8
Reviews Sentiment
6.7
Number of Reviews
43
Ranking in other categories
No ranking in other categories
WinPure Data Quality Platform
Ranking in Data Integration
112th
Average Rating
0.0
Reviews Sentiment
8.3
Number of Reviews
1
Ranking in other categories
Master Data Management (MDM) Software (21st)
 

Mindshare comparison

As of April 2026, in the Data Integration category, the mindshare of IBM InfoSphere DataStage is 1.9%, down from 5.4% compared to the previous year. The mindshare of WinPure Data Quality Platform is 0.3%, up from 0.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
IBM InfoSphere DataStage1.9%
WinPure Data Quality Platform0.3%
Other97.8%
Data Integration
 

Featured Reviews

Prasad Bodduluri - PeerSpot reviewer
Senior Data Warehouse Developer at itcinfotech
Has required complex workarounds for scripts and struggles with unstructured data processing
There is no issue with IBM InfoSphere DataStage's graphical interface for designing data flows, but I will provide feedback that we are gathering the source from the Oracle database mainly, as well as from some spreadsheets. With respect to the Oracle DB Connector, if you write any PL/SQL or SQL with the connectors, there aren't many options, such as executing procedures in the PL/SQL, executing functions, or executing packages. The Oracle connector doesn't have many features and needs improvement. Nowadays many people are writing programs in Python or in PL/SQL with respect to Oracle, so especially in IBM InfoSphere DataStage, there are no features to call programs directly instead of calling them as a script. What I am facing, especially with parallel processing, is that a developer and admin have to sit together. They have to run the job multiple times with different combinations of parallel processing to get the best performance. Instead of that, if the job itself gave some guidance, such as running this parallel processing with this many nodes, it would help; I think that is missing. An additional feature I would want to see in the next release is the ability to work on logs, especially machine logs or artificial logs, to pull semi-structured or unstructured data without having to write extensive code in Python and integrate it. If IBM InfoSphere DataStage provided some feature for this, it would help.
reviewer1126923 - PeerSpot reviewer
Systems Architect at a tech services company with 1,001-5,000 employees
A solution that offers good performance, good stability, and a straightforward setup
We are a reseller of the solution. We handle on-premises deployment. The solution is very similar to IBM. It may have a bit more integration capabilities that IBM doesn't have yet, but they are similar. In terms of which would be better, it would come down to the customer's requirements. I'd rate the solution nine out of ten. All of our customers are very happy and very impressed with the product.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Compared to other ETL tools, DataStage has excellent debugging and development capabilities. And the availability of connectors, even though we sometimes have to opt for specific ones. Also, the availability of patches is good."
"The solution is stable."
"If you have the budget and your solution requires industrial/enterprise strength data integration, this product is always a good choice."
"The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly, the solution did not require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities."
"Once you have Infosphere up and running properly, it is stable."
"The solution is very easy to use."
"It generates highly efficient backend code to write data onto IBM systems, which I find valuable."
"It works with multiple servers and offers high availability."
"With regards to choosing flash versus traditional storage, it's just so much simpler to manage and users get so much better performance from flash."
"With regards to choosing flash versus traditional storage, it's just so much simpler to manage and users get so much better performance from flash."
 

Cons

"From a practice point of view, solutions such as IBM InfoSphere DataStage and Oracle Data Integrator are losing ground, whereas open-source solutions are becoming increasingly powerful."
"Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions."
"The error messaging needs to be improved."
"Many companies are moving away from DataStage because it is expensive."
"Its loading process is very slow. It takes a lot of time for around 5 or 6 million records, and we are not able to provide real-time data to the vendors due to this delay."
"Working with some of the big data components is good, but I can see improvements are needed."
"What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag."
"High-cost of ownership: They could take a page from open source software, such as Talend."
"The pricing model could use improvement."
"The pricing model could use improvement."
 

Pricing and Cost Advice

"It's very expensive."
"The cost is too high."
"The pricing depends on the setup. However, we paid $100,000 as a one-time cost for an on-premises setup."
"The solution is cheap."
"The price is expensive but there are no licensing fees."
"Small and medium-sized companies cannot afford to pay for this solution."
"I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage."
"Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
Information not available
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
885,444 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
24%
Government
9%
Manufacturing Company
8%
Computer Software Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business23
Midsize Enterprise4
Large Enterprise26
No data available
 

Questions from the Community

Would you upgrade to more premium versions of IBM InfoSphere DataStage?
My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For ...
Is IBM InfoSphere DataStage more difficult to use compared to other tools in the field?
I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work ...
Do you rely on IBM Cloud Paks for your data? Have you utilized this product, or do you use IBM InfoSphere DataStage without it?
IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands...
Ask a question
Earn 20 points
 

Overview

 

Sample Customers

Dubai Statistics Center, Etisalat Egypt
Birmingham Hippodrome, Luton Borough Council, Apex Innovations, Sandia National Laboratories, Clear Blue, Vodafone
Find out what your peers are saying about Microsoft, Informatica, Qlik and others in Data Integration. Updated: March 2026.
885,444 professionals have used our research since 2012.