IBM InfoSphere DataStage vs Talend Data integration comparison

Cancel
You must select at least 2 products to compare!
IBM Logo
10,952 views|9,105 comparisons
82% willing to recommend
Talend Logo
267 views|198 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between IBM InfoSphere DataStage and Talend Data integration based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed IBM InfoSphere DataStage vs. Talend Data integration Report (Updated: March 2024).
769,630 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The solution has improved the time it takes to perform tasks related to batch applications.""The most valuable feature is the data integration for data warehousing.""Highly customizable: Allowing you to handle multiple data latencies (scheduled batch, on-demand, and real-time) in the same job.""The performance optimization is quite good in DataStage. It provides parallelism and pipelining mechanisms""When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses.""The solution is very easy to use.""The solution is stable.""The most valuable feature of the solution is the ability to incorporate very complex business rules in Data Stage."

More IBM InfoSphere DataStage Pros →

"We have multiple use cases for this solution. We integrate with Salesforce, SAP and Oracle databases to build business logic and provide reporting.""I'm very passionate about this solution because if you look at any other tool that costs around $200 - $300,000, like Delphix which costs you a million dollars, Talend is very cheap and is almost is at par with what others can do. There is one thing which Delphix does which Talend cannot do, but overall, I would say apart from that, if you're looking for a solution, you should give it a try.""The product's integration with PostgreSQL and Jira has been helpful for us. Its performance is good. However, we do not use it for large data sets.""Talend Data integration has a wide library of connectors."

More Talend Data integration Pros →

Cons
"In terms of intermediate storage, we have some challenges, especially with customers who store data in intermediate locations.""Reduced cost would allow more customers to choose the product. It's quite expensive in relation to the cost of other similar solutions.""The initial setup could be more straightforward.""It takes a lot of time to actually trigger your job and then go into the logs and other stuff. So all of this is really time-consuming.""I'd like to be able to do more with the data and metadata, including copy and pasting, et cetera.""Improvements for DataStage could include better integration with modern data sources like cloud solutions and documents, along with enhancing its capability to handle non-structured data.""Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface.""The graphical user interface (GUI) feels a lot like the interfaces from the 1980s."

More IBM InfoSphere DataStage Cons →

"Sometimes there are bugs which are unidentified and we have to follow-up with the Talend team to resolve them. In a critical situation, it takes time for them to update patches.""The tool's technical support needs to be better. It doesn't have a local data center but pushes everything to the cloud. They need to check in with customers to see if they're happy and how well the solutions work. They need to assign a customer success manager for the accounts they sell.""Due to using the open-source version of Talend Data Integration, which lacks a scheduler, our current approach involves developing jobs in Talend, exporting them as Java packages, and utilizing an external scheduler, such as Windows Scheduler, to manage the scheduling process.""There are no concurrent licenses, they only have seat licenses on cloud. That's the whole challenge. For example, if in any project your headcount increases or decreases, you do not have that concurrence and you have a seat license, you run into challenges because you have to procure a few more licenses for getting the job done."

More Talend Data integration Cons →

Pricing and Cost Advice
  • "High-cost of ownership: They could take a page from open source software."
  • "Pricing varies based on use, and it is not as costly as some competing enterprise solutions."
  • "Small and medium-sized companies cannot afford to pay for this solution."
  • "The cost is too high."
  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "I have been using the open-source version."
  • More Talend Data integration Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    769,630 professionals have used our research since 2012.
    Questions from the Community
    Top Answer: My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For the… more »
    Top Answer: I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work… more »
    Top Answer:IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands… more »
    Top Answer:The product's integration with PostgreSQL and Jira has been helpful for us. Its performance is good. However, we do not use it for large data sets.
    Top Answer:Due to using the open-source version of Talend Data Integration, which lacks a scheduler, our current approach involves developing jobs in Talend, exporting them as Java packages, and utilizing an… more »
    Ranking
    7th
    out of 101 in Data Integration
    Views
    10,952
    Comparisons
    9,105
    Reviews
    16
    Average Words per Review
    467
    Rating
    7.9
    23rd
    Views
    267
    Comparisons
    198
    Reviews
    2
    Average Words per Review
    316
    Rating
    8.0
    Comparisons
    Also Known As
    Talend Cloud Integration, Talend Integration Cloud, Talend Cloud Remote Engine for AWS
    Learn More
    Overview

    IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.

    The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.

    The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:

    • Designing data flows to extract information from multiple sources, transform the data, and deliver it to target databases or applications.

    • Delivery of relevant and accurate data through direct connections to enterprise applications.

    • Reduction of development time and improvement of consistency through prebuilt functions.

    • Utilization of InfoSphere Information Server tools for accelerating the project delivery cycle.

    IBM InfoSphere DataStage can be deployed in various ways, including:

    • As a service: The tool can be accessed from a subscription model, where its capabilities are a part of IBM DataStage on IBM Cloud Park for Data as a Service. This option offers full management on IBM Cloud.

    • On premises or in any cloud: The two editions - IBM DataStage Enterprise and IBM DataStage Enterprise Plus - can run workloads on premises or in any cloud when added to IBM DataStage on IBM Cloud Pak for Data as a Service.

    • On premises: The basic jobs of the tool can be run on premises using IBM DataStage.

    IBM InfoSphere DataStage Features

    The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:

    • AI services: The tool offers services such as data science, event messaging, data warehousing, and data virtualization. It accelerates processes through artificial intelligence (AI) and offers a connection with IBM Cloud Paks - the cloud-native insight platform of the solution.

    • Parallel engine: Through this feature, ETL performance can be optimized to process data at scale. This is achieved through parallel engine and load balancing, which maximizes throughput.

    • Metadata support: This feature of the product uses the IBM Watson Knowledge Catalog to protect companies' sensitive data and monitor who can access it and at what levels.

    • Automated delivery pipelines: IBM InfoSphere DataStage reduces costs by automating continuous integration and delivery of pipelines.

    • Prebuilt connectors: The feature for prebuilt connectivity and stages allows users to move data between multiple cloud sources and data warehouses, including IBM native products.

    • IBM DataStage Flow Designer: This feature offers assistance through machine learning design. The product offers its clients a user-friendly interface which facilitates the work process.

    • IBM InfoSphere QualityStage: The tool provides a feature that automatically resolves data quality issues and increases the reliability of the delivered data.

    • Automated failure detection: Through this feature, companies can reduce infrastructure management efforts, relying on the automated detection that the tool offers.

    • Distributed data processing: Cloud runtimes can be executed remotely through this feature while maintaining its sovereignty and decreasing costs.

    IBM InfoSphere DataStage Benefits

    This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:

    • Increased speed of workload execution due to better balancing and a parallel engine.

    • Reduction of data movement costs through integrations and seamless design of jobs.

    • Modernization of data integration by extending the capabilities of companies' data.

    • Delivery of reliable data through IBM Cloud Pak for Data.

    • Utilization of a drag-and-drop interface which assists in the delivery of data without the need for code.

    • Effective data manipulation allows data to be merged before being mapped and transformed.

    • Creating easier access of users to their data by providing visual maps of the process and the delivered data.

    Reviews from Real Users

    A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.

    Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.

    Talend Data Integration is a powerful solution that enables integration with various databases, including Salesforce, SAP, and Oracle, to build business logic and provide reporting. It offers TMC (for creating sub-admins, which are useful for organizations with multiple applications using Talend), profiling, and project creation. The cloud solution also provides automated upgrades, while on-premises solutions require manual upgrades. 

    Talend Cloud Integration has helped organizations by allowing teams to use remote engine concept on AWS, spin up their job servers, process their own data on the cloud, and seamlessly integrate cloud-to-cloud environments.

    Sample Customers
    Dubai Statistics Center, Etisalat Egypt
    ACCOR, ADR, L'OREAL, AstraZeneca
    Top Industries
    REVIEWERS
    Computer Software Company50%
    Insurance Company14%
    Transportation Company7%
    Healthcare Company7%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Manufacturing Company11%
    Computer Software Company10%
    Insurance Company7%
    VISITORS READING REVIEWS
    Computer Software Company20%
    Financial Services Firm10%
    Manufacturing Company9%
    Retailer7%
    Company Size
    REVIEWERS
    Small Business45%
    Midsize Enterprise6%
    Large Enterprise49%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise74%
    VISITORS READING REVIEWS
    Small Business20%
    Midsize Enterprise14%
    Large Enterprise67%
    Buyer's Guide
    IBM InfoSphere DataStage vs. Talend Data integration
    March 2024
    Find out what your peers are saying about IBM InfoSphere DataStage vs. Talend Data integration and other solutions. Updated: March 2024.
    769,630 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration with 37 reviews while Talend Data integration is ranked 23rd in Cloud Data Integration with 4 reviews. IBM InfoSphere DataStage is rated 7.8, while Talend Data integration is rated 8.0. The top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". On the other hand, the top reviewer of Talend Data integration writes "Very affordable and on par with much more expensive solutions". IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Talend Open Studio and Informatica PowerCenter, whereas Talend Data integration is most compared with Talend Open Studio, SAP Cloud Platform, Oracle Data Integrator (ODI), AWS Glue and Microsoft Azure Logic Apps. See our IBM InfoSphere DataStage vs. Talend Data integration report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.