IBM InfoSphere DataStage OverviewUNIXBusinessApplication

IBM InfoSphere DataStage is the #13 ranked solution in top Data Integration Tools. PeerSpot users give IBM InfoSphere DataStage an average rating of 7.8 out of 10. IBM InfoSphere DataStage is most commonly compared to SSIS: IBM InfoSphere DataStage vs SSIS. IBM InfoSphere DataStage is popular among the large enterprise segment, accounting for 76% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 23% of all views.
IBM InfoSphere DataStage Buyer's Guide

Download the IBM InfoSphere DataStage Buyer's Guide including reviews and more. Updated: May 2023

What is IBM InfoSphere DataStage?

IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.

The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.

The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:

  • Designing data flows to extract information from multiple sources, transform the data, and deliver it to target databases or applications.

  • Delivery of relevant and accurate data through direct connections to enterprise applications.

  • Reduction of development time and improvement of consistency through prebuilt functions.

  • Utilization of InfoSphere Information Server tools for accelerating the project delivery cycle.

IBM InfoSphere DataStage can be deployed in various ways, including:

  • As a service: The tool can be accessed from a subscription model, where its capabilities are a part of IBM DataStage on IBM Cloud Park for Data as a Service. This option offers full management on IBM Cloud.

  • On premises or in any cloud: The two editions - IBM DataStage Enterprise and IBM DataStage Enterprise Plus - can run workloads on premises or in any cloud when added to IBM DataStage on IBM Cloud Pak for Data as a Service.

  • On premises: The basic jobs of the tool can be run on premises using IBM DataStage.

IBM InfoSphere DataStage Features

The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:

  • AI services: The tool offers services such as data science, event messaging, data warehousing, and data virtualization. It accelerates processes through artificial intelligence (AI) and offers a connection with IBM Cloud Paks - the cloud-native insight platform of the solution.

  • Parallel engine: Through this feature, ETL performance can be optimized to process data at scale. This is achieved through parallel engine and load balancing, which maximizes throughput.

  • Metadata support: This feature of the product uses the IBM Watson Knowledge Catalog to protect companies' sensitive data and monitor who can access it and at what levels.

  • Automated delivery pipelines: IBM InfoSphere DataStage reduces costs by automating continuous integration and delivery of pipelines.

  • Prebuilt connectors: The feature for prebuilt connectivity and stages allows users to move data between multiple cloud sources and data warehouses, including IBM native products.

  • IBM DataStage Flow Designer: This feature offers assistance through machine learning design. The product offers its clients a user-friendly interface which facilitates the work process.

  • IBM InfoSphere QualityStage: The tool provides a feature that automatically resolves data quality issues and increases the reliability of the delivered data.

  • Automated failure detection: Through this feature, companies can reduce infrastructure management efforts, relying on the automated detection that the tool offers.

  • Distributed data processing: Cloud runtimes can be executed remotely through this feature while maintaining its sovereignty and decreasing costs.

IBM InfoSphere DataStage Benefits

This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:

  • Increased speed of workload execution due to better balancing and a parallel engine.

  • Reduction of data movement costs through integrations and seamless design of jobs.

  • Modernization of data integration by extending the capabilities of companies' data.

  • Delivery of reliable data through IBM Cloud Pak for Data.

  • Utilization of a drag-and-drop interface which assists in the delivery of data without the need for code.

  • Effective data manipulation allows data to be merged before being mapped and transformed.

  • Creating easier access of users to their data by providing visual maps of the process and the delivered data.

Reviews from Real Users

A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.

Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.

IBM InfoSphere DataStage Customers

Dubai Statistics Center, Etisalat Egypt

IBM InfoSphere DataStage Video

IBM InfoSphere DataStage Pricing Advice

What users are saying about IBM InfoSphere DataStage pricing:
  • "I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage."
  • "The pricing depends on the setup. However, we paid $100,000 as a one-time cost for an on-premises setup."
  • "It's quite expensive."
  • IBM InfoSphere DataStage Reviews

    Filter by:
    Filter Reviews
    Industry
    Loading...
    Filter Unavailable
    Company Size
    Loading...
    Filter Unavailable
    Job Level
    Loading...
    Filter Unavailable
    Rating
    Loading...
    Filter Unavailable
    Considered
    Loading...
    Filter Unavailable
    Order by:
    Loading...
    • Date
    • Highest Rating
    • Lowest Rating
    • Review Length
    Search:
    Showingreviews based on the current filters. Reset all filters
    Tirthankar Roy Chowdhury - PeerSpot reviewer
    Teamlead at Tata consultancy services
    Real User
    Top 5Leaderboard
    User-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features
    Pros and Cons
    • "The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities."
    • "What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well."

    What is our primary use case?

    IBM InfoSphere DataStage was mostly used for ETL and data integration purposes, so extract, transfer, and load, including some data quality use cases. My team used the solution to extract data from various sources, do some business transformations, load the data into a target database, or generate files.

    What is most valuable?

    The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities.

    What needs improvement?

    What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag.

    Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources.

    The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.

    For how long have I used the solution?

    I've been working with IBM InfoSphere DataStage for more than seven years.

    Buyer's Guide
    IBM InfoSphere DataStage
    May 2023
    Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: May 2023.
    709,643 professionals have used our research since 2012.

    What do I think about the stability of the solution?

    IBM InfoSphere DataStage is a stable product and it's been in the market for quite some time, but in its latest version, there's been some instability caused by the new features introduced in the solution. The architecture was changed a lot and that was causing issues and frequent outages that my company had to go back to IBM for troubleshooting. My team didn't face issues in the earlier version of IBM InfoSphere DataStage. It was the latest version that had instability issues.

    What do I think about the scalability of the solution?

    IBM InfoSphere DataStage is a very scalable product.

    How are customer service and support?

    IBM InfoSphere DataStage has a pretty good technical support, but with the new version, particularly the new architecture and the microservice concept, support sometimes takes a bit of time, even for the IBM team to figure out what's wrong, but once that's been figured out, the team comes up with the solution or with a patch.

    How was the initial setup?

    Setting up IBM InfoSphere DataStage was easy.

    How long the deployment takes would depend on certain factors, but it usually takes just two to three hours.

    What's my experience with pricing, setup cost, and licensing?

    I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage.

    What other advice do I have?

    The last version of IBM InfoSphere DataStage which I've worked with was version 11.7.

    I work for an IT service company that works with multiple clients on multiple projects, so close to two hundred people use IBM InfoSphere DataStage for various clients.

    Per project, on average, three people take care of IBM InfoSphere DataStage deployment, maintenance, and support-related activities.

    My advice to people looking into implementing IBM InfoSphere DataStage is that it's a very good product. A lot of similar products have come up nowadays, but this product has a pretty good reputation as it's been in the market for quite a while. I do think other products such as Talend, Informatica PowerCenter, and Informatica Data Quality are better than IBM InfoSphere DataStage.

    My rating for IBM InfoSphere DataStage is eight out of ten.

    My company has a partnership with IBM.

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
    PeerSpot user
    CEO at DELOMID IT
    Real User
    Top 20
    Powerful and agile with good support
    Pros and Cons
    • "It works with multiple servers and offers high availability."
    • "I'd like to be able to do more with the data and metadata, including copy and pasting, et cetera."

    What is our primary use case?

    We use the solution for data warehousing and data migration. 

    What is most valuable?

    The solution is one of the best solutions. 

    It is very powerful. It's quite agile. 

    It works with multiple servers and offers high availability. It can handle very complex architecture. It has good active-passive capabilities. It makes migrations very easy. 

    It is stable.

    The solution can scale. 

    It offers pretty good support services, at least in France.

    What needs improvement?

    A lot about the solution could be improved. 

    I'd like to be able to do more with the data and metadata, including copy and pasting, et cetera. It has become easier with the cloud, however. 

    I'd like to have the ability to customize code. 

    For how long have I used the solution?

    I've been using the solution for more than ten years. I've used it since 2006. I started with version 7.

    What do I think about the stability of the solution?

    The product is stable and reliable. I'd rate it eight out of ten. There are no bugs or glitches. It doesn't crash. 

    What do I think about the scalability of the solution?

    The solution is scalable. I'd rate the scalability eight out of ten. 

    We have six or seven developers on the solution. 

    How are customer service and support?

    I tend to do most of the troubleshooting. I do not need the assistance of technical support as I am quite knowledgeable. That said, my understanding is that support in France is very good. 

    Which solution did I use previously and why did I switch?

    I've also used SQL and Talend. I've also used Informatica, Spark, and AWS Glue. I use a variety of solutions for various clients. 

    How was the initial setup?

    The installation is pretty fast. It doesn't take too long. We did a deployment a few years ago, and it only took maybe two to three days. We updated it to version 11 at that point. The length of time depends on the architecture. It can vary a bit. 

    First, we have to install it on the web server. After that, we have to set up the repository with Oracle or DB2. That takes a lot of time. When you are a big organization, it takes a lot of people. There are configurations and prerequisites that have to be considered. 

    Only one person is needed to manage the maintenance. 

    What about the implementation team?

    We handle the installation ourselves. I handle it mostly on my own. 

    What was our ROI?

    I have not noted any ROI statistics.

    What's my experience with pricing, setup cost, and licensing?

    The pricing depends on the setup. However, we paid $100,000 as a one-time cost for an on-premises setup. 

    You do have extra costs when using the product on-premises. For example, you need to have servers to host it. 

    What other advice do I have?

    I used to be a partner with IBM. I have to reset the partnership. 

    I would recommend the solution for on-premises setups. 

    I would rate the solution eight out of ten. 

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    Flag as inappropriate
    PeerSpot user
    Buyer's Guide
    IBM InfoSphere DataStage
    May 2023
    Learn what your peers think about IBM InfoSphere DataStage. Get advice and tips from experienced pros sharing their opinions. Updated: May 2023.
    709,643 professionals have used our research since 2012.
    Shifa Shah - PeerSpot reviewer
    Data engineer at nust
    Real User
    Top 10
    A scalable ETL tool with a slow connection that can make it time-consuming to work on
    Pros and Cons
    • "The solution's scalability is really good...we are using multi-instance jobs where you can scale them easily."
    • "It takes a lot of time to actually trigger your job and then go into the logs and other stuff. So all of this is really time-consuming."

    What is our primary use case?

    Right now, I'm working for a telecom company. So, we are using IBM InfoSphere DataStage for constructing ETL jobs for them so that they can load data from their various different sources into their warehouse.


    What is most valuable?

    The valuable feature of the solution is, I think, its functionality. So, there are a lot of transformations that you can apply by just using a transformer. Also, you don't need to complicate your SQL queries while trying to transform your data. Hence, the transformer is something I like in the solution.


    What needs improvement?

    I don't know if it's just a problem with me, but the issue I see is that when we connect to the server from the client, especially when you're going to run a job or something, the whole connection is really slow. It takes a lot of time to actually trigger your job and then go into the logs and other stuff. So all of this is really time-consuming.


    For how long have I used the solution?

    I have been using IBM InfoSphere DataStage for five years. Also, I am using IBM InfoSphere DataStage Version 11.7. My company is a consultant for DataStage.


    What do I think about the stability of the solution?

    Most of the time, it is stable. Sometimes there are some issues you don't understand and go away when you have a read-only job. But that is quite rare. Other times, it seems quite stable.

    What do I think about the scalability of the solution?

    The solution's scalability is really good. In terms of parallel jobs, we are using multi-instance jobs where you can scale them easily.

    In my company, my team is spread across multiple countries, including Pakistan and India.


    How are customer service and support?

    I haven't contacted IBM's technical support.


    How was the initial setup?

    The solution's initial setup is straightforward. Also, it's a one-time activity. It is better to have a competent person for deployment since newbies cannot do it themselves.

    When I started using IBM InfoSphere DataStage, it was already deployed on the server. So I did not have to go through the installation phase.


    What was our ROI?

    ROI is something that the client takes care of, and I think they must be keeping track of it and getting a certain result indicating a good ROI. So, that's why they may have continued using it over the years.

    Which other solutions did I evaluate?

    Before DataStage, I did not evaluate other options. Our client was already comfortable with DataStage, so that's what we had to use.


    What other advice do I have?

    I recommend that other people who want to use it go for DataStage on the cloud. The on-prem version of the solution looks and feels old. Also it's time-consuming as well. Overall, I rate the solution a six out of ten.


    Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
    Flag as inappropriate
    PeerSpot user
    Program Manager at a consultancy with 10,001+ employees
    Real User
    Top 20
    The solution can incorporate very complex business rules, is moderately scalable, and is stable
    Pros and Cons
    • "The most valuable feature of the solution is the ability to incorporate very complex business rules in Data Stage."
    • "The solution can be a bit more user-friendly, similar to Informatica."

    What is our primary use case?

    The solution is mainly used for, marketing campaigns, customer segmentation, and home loans.

    What is most valuable?

    The most valuable feature of the solution is the ability to incorporate very complex business rules in Data Stage.

    What needs improvement?

    The solution can be a bit more user-friendly, similar to Informatica.

    I would like the solution to have some basic streaming functionality added.

    For how long have I used the solution?

    I have been using the solution for one year.

    What do I think about the stability of the solution?

    We don't currently have much in our production environment. We are gradually moving into production, so whatever small setup we have is okay for now. I'm taking the overall perspective into account and I think we do have dependencies on the other jobs. This is purely based on the feedback we receive, which sometimes means that we're not able to run our process because there are dependencies, similar to other jobs also. The jobs don't complete on time. We received feedback that there was a problem handling data, which caused jobs to fail and needed to be rerun. This could be product-specific, design-specific, or anything else, but I think there is room for improvement in terms of stability. I would give the solution a seven out of ten.

    What do I think about the scalability of the solution?

    I think that scalable systems should also have good performance. The scalability of this solution in my opinion may not be on the same level as Informatica Power Exchange Data Integration.

    I give the scalability of the solution a seven out of ten. We are facing problems whenever we have huge amounts of data and there are job failures. We need to take care of how to tackle that situation. 

    How was the initial setup?

    We didn't need to do anything because the customer, with whom we are working on the project, had already set everything up for us. The initial setup was not in our preview.

    What other advice do I have?

    I give the solution a seven out of ten.

    We have a separate platform team or support team. In case of any query, it used to be routed to this team, which was internally used to deal with the Data Stage people.

    I'm not a technical expert because I haven't been a developer for 12 years. This is what I understand from the feedback I've received. Informatica Power Exchange Data Integration is much better from a scalability perspective, compared to IBM InfoSphere Data Stage. Scalability, user-friendliness, and inclusion of different business rules are all important, but I think Informatica Power Exchange Data Integration gives us one step further on that.

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Google
    Disclosure: My company has a business relationship with this vendor other than being a customer:
    Flag as inappropriate
    PeerSpot user
    Data analyst at ASR Nederland N.V.
    Real User
    Top 5Leaderboard
    Reliable with good performance but has some data quality issues
    Pros and Cons
    • "The solution is stable."
    • "The initial setup can be complex."

    What is our primary use case?

    I'm working for an insurance provider. They have applications where they register claims, insurance, et cetera. We get a flat file from the vendor and put those flat files into our Oracle Data Warehouse and report on the data. We publish those reports to our institutional investors, partners, and business users, including banks. 

    What is most valuable?

    The performance is good.

    It is working fine.

    The solution is stable.

    It can scale well.

    What needs improvement?

    There can be data quality issues sometimes. It might not be the application. It may be a human error or an issue with the users or developers as well. 

    The initial setup can be complex. 

    For how long have I used the solution?

    I've been using the solution for a year.

    What do I think about the stability of the solution?

    The solution is very stable and reliable. We use it in production without issue.

    What do I think about the scalability of the solution?

    It is scalable and easy to extend. We have had issues with our data pipelines and were able to scale up within a few days. 

    There are around 150 users on the solution right now. 

    We do have plans to increase the number of users. We can easily increase or decrease as needed. 

    How are customer service and support?

    I've never used IBM technical support. 

    Which solution did I use previously and why did I switch?

    Another product was used previously. We are a financial services organization. We've used Oracle as the financial business suite. With Oracle, there is DB2 in the back end. There are SAP data services used as well, and there's integration between the applications. 

    How was the initial setup?

    I did not directly set up the solution. 

    Right now, we have the product on-premises.

    It is an IBM tool, and there are a lot of steps to deploy it. It's not straightforward. You need to have experience. You need to create replication objects, and you need to learn the commands and scripts. It can be quite complex. 

    What about the implementation team?

    You do need to have some third-party assistance. A consultant needs to assist with the setup. 

    What's my experience with pricing, setup cost, and licensing?

    The company has a contract with IBM. I'm not sure of the exact pricing. 

    What other advice do I have?

    I'd rate the solution seven out of ten. 

    I haven't used it too much. I need more time with the solution. 

    Whether another user should try it or not depends on the environment. If you already have a lot of Oracle applications, it might make sense. It can do everything any other ETL can do.

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    Flag as inappropriate
    PeerSpot user
    Owner at 7Spring Consult
    Real User
    Top 20
    Reliable, simple to install, and useful
    Pros and Cons
    • "It is quite useful and powerful."
    • "It would be useful to provide support for Python, AR, and Java."

    What is our primary use case?

    I am a consultant. I provide product information for our clients.

    What is most valuable?

    IBM InfoSphere DataStage is a good product.

    It is quite useful and powerful.

    What needs improvement?

    From a practice point of view, solutions such as IBM InfoSphere DataStage and Oracle Data Integrator are losing ground, whereas open-source solutions are becoming increasingly powerful.

    For example, we are currently working hard on several examples, and in a few years, open-source solutions will take the lead in the market. It will be used by large enterprises. 

    Clients are looking for open-source solutions more and more.

    It would be useful to provide support for Python, R, and Java.

    For how long have I used the solution?

    I have more than 22 years of experience with many different products. 

    It has been three to four years that we have been using IBM InfoSphere DataStage.

    What do I think about the stability of the solution?

    I have no issues with the stability of IBM InfoSphere DataStage.

    How are customer service and support?

    Clients are quite dependant on support from the vendor. For example, if you want to activate a new feature on the product, you must create a ticket. You have no information on when it will be implemented, and the vendor does not know because they have a stream of tickets that are completed by the priority given to the ticket.

    Which solution did I use previously and why did I switch?

    I am a consultant. I have different projects with different platforms. We are constantly going back and forth to different solutions for different projects.

    I have had clients who have used Amazon Redshift.

    Over the years, my clients have used many different products. For example, they use IBM Landscape and we use IBM InfoSphere.

    How was the initial setup?

    The initial setup was straightforward. We did not have issues.

    What's my experience with pricing, setup cost, and licensing?

    Comparable solutions will have common disadvantages, which is the total cost of the project.

    It's quite expensive.

    Which other solutions did I evaluate?

    From time to time, I evaluate different products for my clients.

    What other advice do I have?

    We have had different projects with three of four clients. The average term per project has been nine months and one year.

    If you are working with an open-source solution or another solution, you can implement some features by yourself. For example, in the case of Amazon, which has Amazon Lambda, you can easily write your code in Python or Java, and it will orchestrate it. You can create your features yourself easily and gives you more abilities to make your solution run quicker, eliminating the dependence from the vendor.

    I would rate IBM InfoSphere DataStage an eight out of ten.

    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    ArturKowalczyk - PeerSpot reviewer
    Technology Innovation Leader at Netrix S.A.
    Real User
    Top 5Leaderboard
    Flexible with good connectivity and good modeling
    Pros and Cons
    • "We like the flexibility of modeling."
    • "The error messaging needs to be improved."

    What is our primary use case?

    The product is primarily used for  intense data transformation; it's part of the risk management, and dataflow, and is sourcing data from the data warehouse on the SAP Sybase platform.

    What is most valuable?

    The connectivity with the databases and the speed and flexibility of modeling is excellent. We like the flexibility of modeling.

    The solution is stable.

    It can scale.

    What needs improvement?

    We'd like better integration with source control and error and diagnostic information. The error messaging needs to be improved. 

    The solution is a bit complicated. 

    For how long have I used the solution?

    I've been using the solution for four years. 

    What do I think about the stability of the solution?

    It's stable. it's reliable. There are no bugs or glitches. It doesn't crash or freeze. 

    What do I think about the scalability of the solution?

    We can scale the solution as needed. 

    There are about 50 users on the solution right now. 

    How are customer service and support?

    While technical support may have been used, I have never personally dealt with them.

    Which solution did I use previously and why did I switch?

    I've used SSIS as well and find this product to be more difficult to set up.

    How was the initial setup?

    The initial setup can be challenging. It's harder to set up than, for example, SSIS.

    I'm not sure how long it took to set up, as it was already in place when I joined the team. However, I would say it took a week to deploy.

    We have five people on hand that can handle deployment and maintenance tasks. They are all engineers. 

    What about the implementation team?

    The initial setup can be handled in-house. 

    What's my experience with pricing, setup cost, and licensing?

    The licensing we have is permanent. 

    What other advice do I have?

    I'd recommend the product to others. 

    I'd rate it a nine out of ten. We've been pleased with its capabilities overall. 

    Which deployment model are you using for this solution?

    On-premises
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    Flag as inappropriate
    PeerSpot user
    Utkarsh Shrivastava - PeerSpot reviewer
    ETL/Solution Architect at Crux
    Real User
    Top 20
    Good performance optimization and useful for ETL purposes when we're building data warehouses or data marts
    Pros and Cons
    • "The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms"
    • "In the future, I would like to see more integration with cloud technologies."

    What is our primary use case?

    The primary use case is for ETL purposes for when we're building data warehouses or data marts. We use it to get the data from different disparate sources, do some ETL on them, and we use DataStage and then load them into the data warehouse, database, or data mart.

    This solution used to be on-premises, but they've recently come out with a hybrid offering.

    What is most valuable?

    The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms. I have not found those in Informatica or Talend.

    What needs improvement?

    As a product, it needs to be more stable. It's a legacy product, so even though it's high-performing, it's not very stable compared to other products like Informatica or Talend. The UI also looks dated.

    In the future, I would like to see more integration with cloud technologies. Technical support could be improved.

    For how long have I used the solution?

    I've worked with DataStage for about 9 years.

    What do I think about the stability of the solution?

    The stability could be better.

    What do I think about the scalability of the solution?

    It's scalable.

    How are customer service and support?

    I would rate technical support 6 out of 10.

    How was the initial setup?

    For the on-prem solution, it was moderately complex. I'm not sure about the hybrid version.

    What other advice do I have?

    I would rate this solution 8 out of 10.

    Which deployment model are you using for this solution?

    Hybrid Cloud
    Disclosure: I am a real user, and this review is based on my own experience and opinions.
    PeerSpot user
    Buyer's Guide
    Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.
    Updated: May 2023
    Product Categories
    Data Integration Tools
    Buyer's Guide
    Download our free IBM InfoSphere DataStage Report and get advice and tips from experienced pros sharing their opinions.