IBM InfoSphere DataStage vs Informatica PowerCenter comparison

Cancel
You must select at least 2 products to compare!
IBM Logo
11,157 views|9,214 comparisons
82% willing to recommend
Informatica Logo
19,928 views|16,697 comparisons
90% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between IBM InfoSphere DataStage and Informatica PowerCenter based on real PeerSpot user reviews.

Find out in this report how the two Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed IBM InfoSphere DataStage vs. Informatica PowerCenter Report (Updated: March 2024).
768,246 professionals have used our research since 2012.
Q&A Highlights
Question: How do you compare Informatica PowerCenter with IBM DataStage?
Answer: My experience with both of these tools is that differences are not that meaningful. Informatica has nicer UI but that's it. For 95% of projects one will not feel a difference nor from a performance point of view nor from development speed (assuming that you have developers experienced equally in both tools). So I would take into consideration the following factors: 1. Do you have specific requirements that can be addressed by each of these tools (you can check it by conducting PoC) 2. What will be a cost of the tool: 2.1 What technology is known to your developers? (training costs, time required to master technology by your developers, resources availability ) 2.2 What is the license cost? I do not want to diminish both technologies, but from a cost perspective, I know that it is worth to consider them only when you have daily more than 200 GB data with high to medium transformation complexity. Otherwise, there are cheaper tools on the market.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Offers great flexibility.""The solution has improved the time it takes to perform tasks related to batch applications.""The solution's scalability is really good...we are using multi-instance jobs where you can scale them easily.""Finding logs is very easy on the solution.""The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms""The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities.""The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable.""I am impressed with the tool's ETL tracing."

More IBM InfoSphere DataStage Pros →

"It reduces a lot of legacy coding.""Good product if you are trying implement data quality, data integration, and data management projects.""The partitioning and optimization to help enhance our development is a very valuable aspect of Informatica PowerCenter.""The setup is straightforward.""The technical support is excellent.""The reliability of the product and the way of orchestration of different services is valuable to us.""The most complex task, in this case, was to read and transform BLOB data, and Java transformation in Informatica Power Center was a great solution.""Informatica PowerCenter is very good for integrating a huge amount of data in a very short duration, such as a minute. It is also very easy to use. After you provide the source and the target, mappings are automatically done, which makes it easy to use for the development team."

More Informatica PowerCenter Pros →

Cons
"What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.""Working with some of the big data components is good, but I can see improvements are needed.""Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions.""The pricing should be lower.""Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface.""In the future, I would like to see more integration with cloud technologies.""I really like this tool, but the administration should be on the same client application because a lot of administration features are not on the client-side, and they usually need to have administrative access. It's quite complicated to force IT teams to have separate administrative access from the developers.""We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great."

More IBM InfoSphere DataStage Cons →

"The UI is outdated and old-fashioned, at least in our current version. Also, we have experienced some stability issues with the Workflow Monitor application.""Integration with Artificial Intelligence would benefit this solution.""What I didn't like about it is that the platform itself is not great at distributed processing. When you need high parallel processing, it has some inherent issues. We had to use Java transformation, and it did not go very well. I have heard that it is going to the cloud, but we haven't tried that.""Now they are migrating to a new version, and they have something that is called Informatica Developer. Previously, they just had PowerCenter. Now, when they move everything to Informatica Developer it's not as good or stable like it was when it was PowerCenter, though it has some nice features. This Developer tool could be better.""The licensing is difficult.""If you want to transfer a ZIP file, it is a pain. You need to use Command-Line. Sometimes we just want to transfer a file. It should be easy to move them from A to B.""What needs improvement in Informatica PowerCenter is the cloud experience because, nowadays, other companies, such as AWS, Azure, and Google, have more experience in the cloud. The pricing for Informatica PowerCenter on the cloud is also very expensive for customers, so some customers prefer open-source tools or lower-priced tools, such as Azure. From my point of view, Informatica must work on the pricing policy and review the policy on the cloud for Informatica PowerCenter or propose more tools with lower pricing. Clients want the automatic integration of Informatica PowerCenter with other tools. Currently, the integration process is manual, and you have to add other tools to facilitate the integration, especially with the DevOps methodology. You need scripts and tools for the integration, and you'll need to use other integration tools if you want automatic deployment for Informatica PowerCenter, so this is another area for improvement in the solution. What I'd like to see in the next release of the solution is for the integration with APIs to be simpler, because currently, the API integration feature of Informatica PowerCenter is very difficult. It's not intuitive. You have to facilitate API integration and the real-time streaming of messages in Kafka, for example, so that should be improved.""Integrated Reporting service should be more smoothly transitioned from view to function to be in sync with the main design."

More Informatica PowerCenter Cons →

Pricing and Cost Advice
  • "High-cost of ownership: They could take a page from open source software."
  • "Pricing varies based on use, and it is not as costly as some competing enterprise solutions."
  • "Small and medium-sized companies cannot afford to pay for this solution."
  • "The cost is too high."
  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "We have found the pricing very cost-effective. The licensing is CPU and data source-based."
  • "Cost could be improved."
  • "Licensing is a one time cost. But maintenance costs depend on what you want, how long you need it. Maintenance is a kind of insurance. With health insurance, you don't know whether you will get sick or need to go to hospital or not but you have to have insurance. It's the same thing with support. If you have that expertise in resolving issues, if you have enough experience in your IT department, I would say you don't need the support. But in practice, they recommend you go with the support. If you want support you have to pay for it."
  • "Price-wise, it's more expensive than SSIS, but it's a better tool, so it has more features. Licensing is on a yearly basis."
  • "Its maintenance is expensive."
  • "It's much more expensive, almost three times more expensive than most other solutions."
  • "We are satisfied with the pricing."
  • "I consider this to be an expensive product."
  • More Informatica PowerCenter Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
    768,246 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations There are many third-party vendors offering ETL solutions, but two of the most popular are PowerCenter Informatica and Microsoft SSIS (SQL Server Integration Services). Each technology has its advantages but there are also similarities on how they carry out the extract-transform-load processes and only differ in terminologies. If you’re in the process of choosing ETL tools and PowerCenter Informatica and Microsoft SSIS made it to your shortlist, here is a short comparative discussion detailing the differences between the two, as well as their benefits. Package Configuration Most enterprise data integration projects would require the capacity to develop a solution in one platform and test and deploy it in a separate environment without having to manually change the established workflow. In order to achieve this seamless movement between two environments, your ETL technology should allow the dynamic update of the project’s properties using the content or a parameter file or configuration. Both Informatica and SSIS support this functionality using different methodologies. In Informatica, every session can have more than one source and one or more destination connections. There are… Read more →
    Answers from the Community
    Miriam Tover
    Kirill Slivchikov - PeerSpot reviewerKirill Slivchikov
    Real User

    My experience with these products is telling me that:

    - Informatica is much more flexible, it has more points, where different types of codding and tuning are available. If the landscape is heterogeneous and complex - it’s the right choice. On the other hand, it works more slowly.

    - IBM DS is very strong in code efficiency and flows parallelism. If the landscape is IBM-oriented or not so complex, but data volumes are huge - it’s the right choice. On the other hand, coding and tuning abilities are more ascetic.

    - Informatica needs dedicated admin in the project team, IBM DS - does not.
    - Informatica has an evolving cloud version, IBM DS hasn’t yet.
    - Informatica is not proper working with Hadoop, IBM DS is.

    The pricing of both is more or less equal.

    Questions from the Community
    Top Answer: My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For the… more »
    Top Answer: I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work… more »
    Top Answer:IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands… more »
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up and… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Ranking
    7th
    out of 100 in Data Integration
    Views
    11,157
    Comparisons
    9,214
    Reviews
    15
    Average Words per Review
    452
    Rating
    7.9
    3rd
    out of 100 in Data Integration
    Views
    19,928
    Comparisons
    16,697
    Reviews
    29
    Average Words per Review
    471
    Rating
    7.6
    Comparisons
    Also Known As
    PowerCenter
    Learn More
    Overview

    IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.

    The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.

    The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:

    • Designing data flows to extract information from multiple sources, transform the data, and deliver it to target databases or applications.

    • Delivery of relevant and accurate data through direct connections to enterprise applications.

    • Reduction of development time and improvement of consistency through prebuilt functions.

    • Utilization of InfoSphere Information Server tools for accelerating the project delivery cycle.

    IBM InfoSphere DataStage can be deployed in various ways, including:

    • As a service: The tool can be accessed from a subscription model, where its capabilities are a part of IBM DataStage on IBM Cloud Park for Data as a Service. This option offers full management on IBM Cloud.

    • On premises or in any cloud: The two editions - IBM DataStage Enterprise and IBM DataStage Enterprise Plus - can run workloads on premises or in any cloud when added to IBM DataStage on IBM Cloud Pak for Data as a Service.

    • On premises: The basic jobs of the tool can be run on premises using IBM DataStage.

    IBM InfoSphere DataStage Features

    The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:

    • AI services: The tool offers services such as data science, event messaging, data warehousing, and data virtualization. It accelerates processes through artificial intelligence (AI) and offers a connection with IBM Cloud Paks - the cloud-native insight platform of the solution.

    • Parallel engine: Through this feature, ETL performance can be optimized to process data at scale. This is achieved through parallel engine and load balancing, which maximizes throughput.

    • Metadata support: This feature of the product uses the IBM Watson Knowledge Catalog to protect companies' sensitive data and monitor who can access it and at what levels.

    • Automated delivery pipelines: IBM InfoSphere DataStage reduces costs by automating continuous integration and delivery of pipelines.

    • Prebuilt connectors: The feature for prebuilt connectivity and stages allows users to move data between multiple cloud sources and data warehouses, including IBM native products.

    • IBM DataStage Flow Designer: This feature offers assistance through machine learning design. The product offers its clients a user-friendly interface which facilitates the work process.

    • IBM InfoSphere QualityStage: The tool provides a feature that automatically resolves data quality issues and increases the reliability of the delivered data.

    • Automated failure detection: Through this feature, companies can reduce infrastructure management efforts, relying on the automated detection that the tool offers.

    • Distributed data processing: Cloud runtimes can be executed remotely through this feature while maintaining its sovereignty and decreasing costs.

    IBM InfoSphere DataStage Benefits

    This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:

    • Increased speed of workload execution due to better balancing and a parallel engine.

    • Reduction of data movement costs through integrations and seamless design of jobs.

    • Modernization of data integration by extending the capabilities of companies' data.

    • Delivery of reliable data through IBM Cloud Pak for Data.

    • Utilization of a drag-and-drop interface which assists in the delivery of data without the need for code.

    • Effective data manipulation allows data to be merged before being mapped and transformed.

    • Creating easier access of users to their data by providing visual maps of the process and the delivered data.

    Reviews from Real Users

    A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.

    Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.

    Informatica PowerCenter is a data integration and data visualization tool. The solution works as an enterprise data integration platform that helps organizations access, transform, and integrate data from various systems. The product is designed to support companies in the full cycle of a project, from its initial rollout to critical deployments. Informatica PowerCenter allows developers and analysts to collaborate while accelerating the work process to deploy projects within days instead of months.

    The Advanced edition of the product provides an additional real-time engine which allows companies to have always-on enterprise data integration. This ensures seamless collaboration and increment of data lineage visibility and impacts analysis.

    The Premium edition of the solution offers an early warning system that detects unexpected behaviors or incorrect utilization of resources in the workflows and alerts companies in the case that these occur. This version of the product also offers automatic data validation, which ensures data accuracy and reduces testing time and expenditure of resources for by up to 90%.

    Informatica PowerCenter Features

    The product provides users with various features which allow them to execute data integration initiatives such as analytics, data warehousing, data governance, consolidation, and application migration. The features of the solution include:

    • Collaboration: Informatica PowerCenter offers role-based tools and processes which enable business self-service while benefiting from high-quality IT resources.

    • Automation: Through various automations and easy-to-use software, users can utilize graphical and codeless tools and initiate effective data integration without additional knowledge.

    • Scalability: The tool provides high scalability to users, which ensures seamless performance and minimum downtime. PowerCenter also has adaptive load balancing, pushdown optimization, and dynamic partitioning.

    • Monitoring: Through the extensive monitoring feature, the operations and governance of the solution are easily overseen by users. The tool also provides alerts that can prevent damage to the system.

    • Real-time data: Through real-time data, users can monitor applications and analytics, ensuring their efficient operation.

    • Prototyping: Informatica lets its users collaborate with information technology to prototype, profile, and validate results in a timely manner.

    • Connectivity: Users can access and integrate data from different types of sources through high-performance connectors.

    • Automated data validation testing: The product offers script-free automated and repeatable audit and validation of data.

    • Data transformation: This feature allows users to use comprehensive parsing of JSON, PDF, XML, Microsoft Office, and the Internet of Things (IoT) for non-relation data.

    • Cloud applications connectivity: The product allows for seamless connection to cloud application sources and targets.

    Informatica PowerCenter Benefits

    The benefits of using Informatica PowerCenter include:

    • The tool can work over a wide range of systems and platforms and also allows for lean integration.

    • It enhances the quality and speed of performance and optimizes the cost of the process for your organization.

    • PowerCenter supports multiple databases, including TPump, Parallel Transporter Fastload, and Teradata MLoad.

    • The tool is very easy to monitor and maintain, which simplifies the data integration process for companies.

    • The centralized error logging system allows users to locate errors in a timely manner and correct them.

    • The tool can convert data from an application to another format, as it serves as one of the most powerful data transformation solutions.

    • PowerCenter can also serve as middleware between two applications.

    • The solution offers both parallel processing and load balancing.

    • PowerCenter is a tool with a high level of security, which also minimizes essential administration activities.

    • The solution ensures the quality of information, as it does not allow invalid or unwanted data to be uploaded to the source.

    Reviews from Real Users

    Yahya T., a developer and architect at L'Oreal, says the product is stable, provides good support, and integrating it with other systems is very fast.

    Mohamed E., a senior manager for Data management and data governance at a tech company, says PowerCenter is stable, mature, and offers flexibility in building the pipeline and has a drag-and-drop mode because it's GUI-based; technical support is brilliant.

    Sample Customers
    Dubai Statistics Center, Etisalat Egypt
    University of Texas MD Anderson Cancer Center, LexisNexis, Rabobank
    Top Industries
    REVIEWERS
    Computer Software Company50%
    Insurance Company14%
    Transportation Company7%
    Healthcare Company7%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Manufacturing Company11%
    Computer Software Company10%
    Insurance Company7%
    REVIEWERS
    Computer Software Company22%
    Financial Services Firm20%
    Insurance Company7%
    Retailer7%
    VISITORS READING REVIEWS
    Financial Services Firm18%
    Computer Software Company12%
    Manufacturing Company8%
    Insurance Company8%
    Company Size
    REVIEWERS
    Small Business45%
    Midsize Enterprise6%
    Large Enterprise49%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise75%
    REVIEWERS
    Small Business16%
    Midsize Enterprise11%
    Large Enterprise73%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    IBM InfoSphere DataStage vs. Informatica PowerCenter
    March 2024
    Find out what your peers are saying about IBM InfoSphere DataStage vs. Informatica PowerCenter and other solutions. Updated: March 2024.
    768,246 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews while Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews. IBM InfoSphere DataStage is rated 7.8, while Informatica PowerCenter is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". On the other hand, the top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Talend Open Studio and IBM InfoSphere Information Server, whereas Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, Databricks and SAP Data Services. See our IBM InfoSphere DataStage vs. Informatica PowerCenter report.

    See our list of best Data Integration vendors.

    We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.