Coming October 25: PeerSpot Awards will be announced! Learn more

IBM InfoSphere DataStage vs Informatica Cloud Data Integration comparison

Cancel
You must select at least 2 products to compare!
Comparison Buyer's Guide
Executive Summary

We performed a comparison between IBM InfoSphere DataStage and Informatica Cloud Data Integration based on real PeerSpot user reviews.

Find out in this report how the two Data Integration Tools solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.

To learn more, read our detailed IBM InfoSphere DataStage vs. Informatica Cloud Data Integration report (Updated: August 2022).
633,952 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It is really easy to set up and the interface is easy to use.""In StreamSets, everything is in one place.""StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.""It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution.""StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."

More StreamSets Pros →

"We are mostly using transmission rules. It has a lot of functions and logic related to transmission. It is a user-friendly tool with in-built functions.""The most valuable feature is the data integration for data warehousing.""When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses.""The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms""Offers great flexibility.""The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities.""It is quite useful and powerful.""The Hierarchical Data Stage is good."

More IBM InfoSphere DataStage Pros →

"It has all the advantages of the Cloud in that you can use it without worrying about infrastructure, upkeep, or upgrades.""Informatica Cloud Data Integration is stable.""The solution's initial setup is quite straightforward.""The mass ingestion functionality and the elasticity of the solution are great.""We have a lot of integrations, and it's very easy to create integrations. They have a lot of connectors.""Informatica Cloud Data Integration is stable.""Their new licensing is very flexible. With Informatica Cloud, you have plenty of items under the same umbrella, such as services, offerings, data quality, and data masking. You have also got master data management and API management. What I really like about them is that you don't need to go to Informatica and say that you need a data integration module. You would say that you need iPaaS or Informatica Cloud. They'll then try to understand your needs and give you IPUs, which are the processing units. If I purchased a hundred IPUs from Informatica as a customer, I can use 70 IPUs for data integration. I would also need data quality, so I can use 10 IPUs for data quality. I can use the remaining 20 IPUs for API management. Down the line, if I see that my initial data integration needs for the development phase are met, then out of the 70 IPUs assigned for data integration, I can use 30 IPUs for data masking. I can shuffle these numbers in any way within the Informatica Cloud umbrella for the tenure for which I have subscribed to these IPUs. I can use all services the way I want. This flexibility is what I really love about Informatica. It also has got good connectors.""The product is scalable and excellent for enterprise-level organizations."

More Informatica Cloud Data Integration Pros →

Cons
"We've seen a couple of cases where it appears to have a memory leak or a similar problem.""We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.""Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.""If you use JDBC Lookup, for example, it generally takes a long time to process data.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."

More StreamSets Cons →

"In the future, I would like to see more integration with cloud technologies.""The interface needs improvement.""Currently lacking virtualization ability.""What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.""The response time from support is slow and needs to be improved.""The initial setup could be more straightforward.""Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface.""It would be useful to provide support for Python, AR, and Java."

More IBM InfoSphere DataStage Cons →

"The cloud version of the Informatica, it's a very substandard product. They might say it's enterprise-ready but it's not at all ready. They need to add more features, such as improved data replication features. If you look at other tools, such as Matillion they are now cloud-native and flexible. Additionally, Informatica Cloud Data Integration should have a good migration strategy from Informatica PowerCenter to Informatica Cloud Data Integration.""There may be some types of limitations with the performance.""Informatica Cloud Data Integration could improve the price by making it less expensive.""The current features are a bit complicated, and we need to write big scripts and test.""I have received feedback from certain teams and there is a steep learning curve to use this solution.""Cost-wise, it could be better.""The regions in which the data resides are still limited. This could be an issue in terms of the data residency laws of some of the countries. They should get more regions.""Its pricing model can be improved. The response time from technical support can also be improved."

More Informatica Cloud Data Integration Cons →

Pricing and Cost Advice
  • "We are running the community version right now, which can be used free of charge."
  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • "The pricing is good, but not the best. They have some customized plans you can opt for."
  • More StreamSets Pricing and Cost Advice →

  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • "It's quite expensive."
  • "I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
  • "Its pricing model can be improved."
  • "I'm not sure about the most recent pricing trends, but I don't believe it's significantly different from PowerCenter. I believe it is nearly the same."
  • "The price of Informatica Cloud Data Integration could be reduced."
  • More Informatica Cloud Data Integration Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
    633,952 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:It is really easy to set up and the interface is easy to use.
    Top Answer:We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then… more »
    Top Answer:We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
    Top Answer:The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require… more »
    Top Answer:I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the… more »
    Top Answer:What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a… more »
    Comparisons
    Learn More
    StreamSets
    Video Not Available
    Overview

    StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.

    Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.

    With StreamSets, you can deliver the continuous data that drives the connected enterprise.

    IBM InfoSphere DataStage integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.
    Informatica Cloud Data Integration is a unified and modern user experience across all iPaaS patterns, supported by a meta-data driven, modular, artificial intelligence-based microservices architecture tailored for IT, developers, and business users. It help delivers accessible, trusted, and secure data to facilitate more valuable business decisions and identify competitive advantages. Informatica Cloud Data Integration can help to distribute data warehouse and analytics projects. Features include the following: Deliver integrated and relevant data, Scale with data synchronization and replication, Integrate asset management, Support high-volume workloads, Design cloud mappings and Execute visual data integration flows.
    Offer
    Learn more about StreamSets
    Learn more about IBM InfoSphere DataStage
    Learn more about Informatica Cloud Data Integration
    Sample Customers
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Dubai Statistics Center, Etisalat Egypt
    Chicago Cubs, Telegraph Media Group
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm16%
    Computer Software Company15%
    Insurance Company8%
    Government7%
    REVIEWERS
    Computer Software Company70%
    Aerospace/Defense Firm10%
    Healthcare Company10%
    Financial Services Firm10%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company17%
    Comms Service Provider11%
    Insurance Company7%
    REVIEWERS
    Computer Software Company47%
    Pharma/Biotech Company13%
    Individual & Family Service7%
    Manufacturing Company7%
    VISITORS READING REVIEWS
    Computer Software Company21%
    Financial Services Firm12%
    Energy/Utilities Company8%
    Insurance Company8%
    Company Size
    REVIEWERS
    Small Business14%
    Midsize Enterprise29%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business45%
    Midsize Enterprise3%
    Large Enterprise52%
    VISITORS READING REVIEWS
    Small Business13%
    Midsize Enterprise11%
    Large Enterprise76%
    REVIEWERS
    Small Business21%
    Midsize Enterprise25%
    Large Enterprise54%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise75%
    Buyer's Guide
    IBM InfoSphere DataStage vs. Informatica Cloud Data Integration
    August 2022
    Find out what your peers are saying about IBM InfoSphere DataStage vs. Informatica Cloud Data Integration and other solutions. Updated: August 2022.
    633,952 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration Tools with 10 reviews while Informatica Cloud Data Integration is ranked 3rd in Cloud Data Integration with 10 reviews. IBM InfoSphere DataStage is rated 7.8, while Informatica Cloud Data Integration is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "Robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data". On the other hand, the top reviewer of Informatica Cloud Data Integration writes "A UI-based tool with great scripting ". IBM InfoSphere DataStage is most compared with SSIS, Talend Open Studio, AWS Glue and Informatica PowerCenter, whereas Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, AWS Glue, Mule Anypoint Platform and Qlik Replicate. See our IBM InfoSphere DataStage vs. Informatica Cloud Data Integration report.

    We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.