Try our new research platform with insights from 80,000+ expert users
Partner at Avydium Data LLC
User
Its parallel processing capability allows you to go through extremely large data sets in no time at all
Pros and Cons
  • "Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job."
  • "Working with some of the big data components is good, but I can see improvements are needed."

What is our primary use case?

Complex data integration projects which require integration from multiple data sources.

How has it helped my organization?

I have worked during many implementations using DataStage. All of the projects that I worked on have been successful. This is due mainly to the strict discipline around best practices, and by following a set of standards and templates designed to reduce complexity and improve automation, including strong reference architecture.

What is most valuable?

  • Its parallel processing capability allows you to go through extremely large data sets in no time at all, if you do your job right. 
  • Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job. 
  • High scalability: Start small and go big with the same job. You just need to adjust the configuration file, no need to recompile.
  • Strong metadata management: Business, technical, and process metadata can all be managed from a single place.
  • Ease of integration with other tool sets: Easily supports APIs (or build your own) to support data streaming (or batched) from other systems.
  • Data Quality Management from within the tool: Supporting data sampling, including profiling of data, directly from the development canvas.

What needs improvement?

High-cost of ownership: They could take a page from open source software, such as Talend.

Working with some of the big data components is good, but I can see improvements are needed, such as native support for Spark and HBase.

Buyer's Guide
IBM InfoSphere DataStage
October 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,778 professionals have used our research since 2012.

For how long have I used the solution?

More than five years.

What do I think about the stability of the solution?

No issues.

What do I think about the scalability of the solution?

No issues.

How are customer service and support?

Support is always good.

Which solution did I use previously and why did I switch?

Have used quite a few ETL tools in my job.

  • Ab Initio: Even pricier, but has a highly competent ETL tool. It is complete, but hard to use. 
  • Informatica: Not as flexible and does not support the same level of complexity in its maps.
  • Talend: It is a good tool suite, extensive, but can be cumbersome to cite all its pieces.
  • ODI: For the Oracle centric world.
  • SSIS: Week when compared to any of the above tool sets.

How was the initial setup?

Depends on type of environment that is being installed. I have seen fairly simple to overly complex initial setups due to the environment, not due to the tool.

What about the implementation team?

Both vendor and in-house team implementations:

IBM has top-notch support and tool services along with other partners as well. Depending on the partner, this can go from installation and configuration to solution development, etc.)

Most in-house teams that I have seen tend to have have good developers, but not always good architects. Like most every data integration project, if you do not have a strong architecture, your solution will eventually fail.

What was our ROI?

Depends on the project.

Which other solutions did I evaluate?

Have done many ETL tool evaluations based on client requirements. DataStage has always been in the top-three. It may not have been selected due to different weights being used for different sections of the evaluation for different clients, but it has always been in the top-three consistently.

What other advice do I have?

If you have the budget and your solution requires industrial/enterprise strength data integration, this product is always a good choice.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
PeerSpot user
Architect at a tech services company with 51-200 employees
Consultant
Top 20
It has unlimited database connectors and free-of-charge application connectors.

Valuable Features

  • Unlimited database connectors and free-of-charge application connectors - Siebel in our case
  • Lot of transformation components
  • High scalability

Improvements to My Organization

It's standardized our batch integration.

Room for Improvement

It needs a better scheduling mechanism.

Use of Solution

I've been using it as a customer for six years.

Deployment Issues

We had no issues with the deployment.

Stability Issues

We had issues with Java on AIX platform (version 8.7) - currently migrated to Linux platform without issues.

Scalability Issues

It's highly scalable.

Customer Service and Technical Support

We have close contact with support in their Polish lab.

Initial Setup

The initial setup is easy as it's done through a web installation process with an HA setup option.

Pricing, Setup Cost and Licensing

Check the Information Server bundle offering, especially with InfoSphere Information Server for Data Integration.

Other Solutions Considered

Our customer checked AbInitio and Informatica PowerCenter – DataStage was best as it could be delivered quickest.

Other Advice

You should also look at Redbooks and DeveloperWorks articles for knowledge gathering.

Disclosure: My company has a business relationship with this vendor other than being a customer. We are a BP of IBM.
PeerSpot user
Buyer's Guide
IBM InfoSphere DataStage
October 2025
Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,778 professionals have used our research since 2012.
it_user273756 - PeerSpot reviewer
Solutions Specialist at a tech services company with 501-1,000 employees
Consultant
It has valuable administrative features, particularly since I don't program in this environment, but avoid the Netezza adapter as it's poor.

Valuable Features

Anything that is administrative related, as I don't program in this environment.

Room for Improvement

The recovery feature. We had DS repos in a bad condition, but IBM couldn't recover it.

Use of Solution

I've been using it for one year.

Deployment Issues

No issues encountered.

Stability Issues

No issues encountered.

Scalability Issues

No issues encountered.

Customer Service and Technical Support

Customer Service:

Mostly, I would give 9/10. I did have one bad experience, so that leaves a bad impression.

Technical Support:

It's generally good, although sometimes I see a lot of confusion about how to resolve issues.

Initial Setup

It was complex.

Implementation Team

It was setup by an outside vendor.

Other Solutions Considered

No other options were evaluated.

Other Advice

Don't use the Netezza adapter, as it's poor

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.
Updated: October 2025
Product Categories
Data Integration
Buyer's Guide
Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.