IBM InfoSphere DataStage vs Matillion ETL comparison

Cancel
You must select at least 2 products to compare!
Comparison Buyer's Guide
Executive Summary

We performed a comparison between IBM InfoSphere DataStage and Matillion ETL based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed IBM InfoSphere DataStage vs. Matillion ETL Report (Updated: March 2024).
765,386 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The product is easy to deploy.""We are mostly using transmission rules. It has a lot of functions and logic related to transmission. It is a user-friendly tool with in-built functions.""The Hierarchical Data Stage is good.""The ETL tools are probably the most valuable feature. It has an IBM tool, a friendly UI and it makes things more comfortable.""Offers great flexibility.""Once you have Infosphere up and running properly, it is stable.""We can view what we want to do. We can transform data and put them on tables.""The solution has improved the time it takes to perform tasks related to batch applications."

More IBM InfoSphere DataStage Pros →

"The most valuable feature of Matillion ETL is its user-friendly graphical interface.""The tool's middle-dimensional structure significantly simplifies obtaining the right data at the appropriate level. This feature makes deploying our applications easier since we utilize a single source without publishing data from various sources.""It is an incredibly user-friendly and intuitive tool, making the learning curve quite smooth""It can scale to a great extent. It can handle the load that we are putting on it, which is about 5TBs.""It has good integrations with Amazon Redshift and other AWS services.""It has improved the costs of managing my customer’s data.""Matillion ETL is one hundred percent stable.""The most valuable feature of Matillion ETL is the UI experience in which you can drag and drop most of the transformation."

More Matillion ETL Pros →

Cons
"The initial setup can be complex.""So, there are some features that are missing. If I compare DataStage to Talend, Talend allows you to write custom code in Java or use these tools in your applications as well if you are building a job application. But in DataStage, it does not allow you to write custom code for any component.""I want the tool to continue with the on-prem version, not the cloud one.""There are three things that could improve - the cloud, monitoring and cloud integration. It's a solid product but not a modern one and of course it depends what you're looking for.""The solution can be a bit more user-friendly, similar to Informatica.""What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.""We would be happy to see in next versions the ability to return several parameters from jobs. Now, jobs can return just one parameter. If they could return several parameters, that would be great.""The response time from support is slow and needs to be improved."

More IBM InfoSphere DataStage Cons →

"The tool's lineage is very weak.""Sometimes, we have issues with the solution's stability and need to restart it for three weeks or more.""The product must enhance its near-real-time data capture feature.""The cost of the solution is high and could be reduced.""I found some of the more complex aspects of ETL challenging, but I grasped the concepts fairly quickly.""Going forward, I would like them to add custom jobs, since we still have to run these outside of Matillion.""Performance can be improved for efficiency, and it can be made faster.""In the next release, we would like to have connections to more databases."

More Matillion ETL Cons →

Pricing and Cost Advice
  • "High-cost of ownership: They could take a page from open source software."
  • "Pricing varies based on use, and it is not as costly as some competing enterprise solutions."
  • "Small and medium-sized companies cannot afford to pay for this solution."
  • "The cost is too high."
  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "I have heard from my manager and other higher ups, "This product is cheaper than other things on the market," and they have done the research."
  • "It is cost-effective. Based on our use case, it's efficient and cheap. It saves a lot of money and our upfront costs are less."
  • "The prices needs to be lower."
  • "It was very easy to purchase through the AWS Marketplace, but it was also expensive."
  • "Purchasing it through the AWS Marketplace is pretty convenient. There is a little bit of back and forth in terms of the licensing based on the machine size, but it seems to have worked out well. it is convenient to have it all as part of our AWS billing."
  • "It is not necessarily a cheap solution. However, it's reasonable priced, especially with the smaller machines that we run it on."
  • "The AWS pricing and licensing are a cost-effective solution for data integration needs."
  • "It was procured through the AWS Marketplace because it keeps things simple. They offer retail-like checkout and bill through your existing Amazon Web Services account."
  • More Matillion ETL Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    765,386 professionals have used our research since 2012.
    Questions from the Community
    Top Answer: My company currently uses the free version of the product, and we are definitely switching to a paid one. We needed a tool that can help us not only integrate our data but use it effectively. For the… more »
    Top Answer: I think the tool may cause some difficulties if you have not used other data integration solutions before. I have worked at companies that used different tools for data integration, and they work… more »
    Top Answer:IBM Cloud Paks makes a big difference in your data integration. My company has been using it alongside IBM InfoSphere DataStage and while the main product is good on its own, this one truly expands… more »
    Top Answer:It is an incredibly user-friendly and intuitive tool, making the learning curve quite smooth
    Top Answer:We pay $5.40 per EC2 running hour, and we can reduce costs by stopping and starting the EC2 instances strategically. For instance, in our production environment, we run it for sixteen hours a day… more »
    Top Answer:There's room for improvement in how it handles data streaming capabilities. Our main challenge currently is that Matillion runs on an EC2 instance, limiting us to running only two processes… more »
    Ranking
    7th
    out of 94 in Data Integration
    Views
    11,053
    Comparisons
    9,113
    Reviews
    14
    Average Words per Review
    439
    Rating
    7.9
    4th
    Views
    3,495
    Comparisons
    2,369
    Reviews
    11
    Average Words per Review
    701
    Rating
    8.6
    Comparisons
    Also Known As
    Matillion ETL for Redshift, Matillion ETL for Snowflake, Matillion ETL for BigQuery
    Learn More
    Overview

    IBM InfoSphere DataStage is a high-quality data integration tool that aims to design, develop, and run jobs that move and transform data for organizations of different sizes. The product works by integrating data across multiple systems through a high-performance parallel framework. It supports extended metadata management, enterprise connectivity, and integration of all types of data.

    The solution is the data integration component of IBM InfoSphere Information Server, providing a graphical framework for moving data from source systems to target systems. IBM InfoSphere DataStage can deliver data to data warehouses, data marts, operational data sources, and other enterprise applications. The tool works with various types of patterns - extract, transform and load (ETL), and extract, load, and transform (ELT). The scalability of the platform is achieved by using parallel processing and enterprise connectivity.

    The solution has various versions, catering to different types of companies, which include the Server Edition, the Enterprise Edition, and the MVS Edition. Depending on which version a company has bought, different goals can be achieved. They include the following:

    • Designing data flows to extract information from multiple sources, transform the data, and deliver it to target databases or applications.

    • Delivery of relevant and accurate data through direct connections to enterprise applications.

    • Reduction of development time and improvement of consistency through prebuilt functions.

    • Utilization of InfoSphere Information Server tools for accelerating the project delivery cycle.

    IBM InfoSphere DataStage can be deployed in various ways, including:

    • As a service: The tool can be accessed from a subscription model, where its capabilities are a part of IBM DataStage on IBM Cloud Park for Data as a Service. This option offers full management on IBM Cloud.

    • On premises or in any cloud: The two editions - IBM DataStage Enterprise and IBM DataStage Enterprise Plus - can run workloads on premises or in any cloud when added to IBM DataStage on IBM Cloud Pak for Data as a Service.

    • On premises: The basic jobs of the tool can be run on premises using IBM DataStage.

    IBM InfoSphere DataStage Features

    The tool has various features through which users can integrate and utilize their data effectively. The components of IBM InfoSphere DataStage include:

    • AI services: The tool offers services such as data science, event messaging, data warehousing, and data virtualization. It accelerates processes through artificial intelligence (AI) and offers a connection with IBM Cloud Paks - the cloud-native insight platform of the solution.

    • Parallel engine: Through this feature, ETL performance can be optimized to process data at scale. This is achieved through parallel engine and load balancing, which maximizes throughput.

    • Metadata support: This feature of the product uses the IBM Watson Knowledge Catalog to protect companies' sensitive data and monitor who can access it and at what levels.

    • Automated delivery pipelines: IBM InfoSphere DataStage reduces costs by automating continuous integration and delivery of pipelines.

    • Prebuilt connectors: The feature for prebuilt connectivity and stages allows users to move data between multiple cloud sources and data warehouses, including IBM native products.

    • IBM DataStage Flow Designer: This feature offers assistance through machine learning design. The product offers its clients a user-friendly interface which facilitates the work process.

    • IBM InfoSphere QualityStage: The tool provides a feature that automatically resolves data quality issues and increases the reliability of the delivered data.

    • Automated failure detection: Through this feature, companies can reduce infrastructure management efforts, relying on the automated detection that the tool offers.

    • Distributed data processing: Cloud runtimes can be executed remotely through this feature while maintaining its sovereignty and decreasing costs.

    IBM InfoSphere DataStage Benefits

    This solution offers many benefits for the companies that utilize it for data integration. Some of these benefits include:

    • Increased speed of workload execution due to better balancing and a parallel engine.

    • Reduction of data movement costs through integrations and seamless design of jobs.

    • Modernization of data integration by extending the capabilities of companies' data.

    • Delivery of reliable data through IBM Cloud Pak for Data.

    • Utilization of a drag-and-drop interface which assists in the delivery of data without the need for code.

    • Effective data manipulation allows data to be merged before being mapped and transformed.

    • Creating easier access of users to their data by providing visual maps of the process and the delivered data.

    Reviews from Real Users

    A data/solution architect at a computer software company says the product is robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data.

    Tirthankar Roy Chowdhury, team leader at Tata Consultancy Services, feels the tool is user-friendly with a lot of functionalities, and doesn't require much coding because of its drag-and-drop features.

    Matillion ETL is a powerful tool for extracting, transforming, and loading large amounts of data from various sources into cloud data warehouses like Snowflake. Its ability to load data dynamically and efficiently using metadata is a standout feature, as is its open-source ETL with good performance and high efficiency. 

    The solution has a graphical interface for jobs, is easily adjustable and extensible, and allows for scheduling and error reporting. Matillion ETL has helped organizations move to a cloud-based solution, bridge the gap between on-premises and on-cloud, and perform complex migration projects.

    Sample Customers
    Dubai Statistics Center, Etisalat Egypt
    Thrive Market, MarketBot, PWC, Axtria, Field Nation, GE, Superdry, Quantcast, Lightbox, EDF Energy, Finn Air, IPRO, Twist, Penn National Gaming Inc
    Top Industries
    REVIEWERS
    Computer Software Company54%
    Transportation Company8%
    Healthcare Company8%
    Financial Services Firm8%
    VISITORS READING REVIEWS
    Financial Services Firm26%
    Manufacturing Company11%
    Computer Software Company11%
    Insurance Company8%
    REVIEWERS
    Manufacturing Company36%
    Financial Services Firm36%
    Healthcare Company9%
    Computer Software Company9%
    VISITORS READING REVIEWS
    Computer Software Company16%
    Financial Services Firm14%
    Government8%
    Manufacturing Company8%
    Company Size
    REVIEWERS
    Small Business46%
    Midsize Enterprise7%
    Large Enterprise48%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise9%
    Large Enterprise74%
    REVIEWERS
    Small Business23%
    Midsize Enterprise36%
    Large Enterprise41%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise13%
    Large Enterprise68%
    Buyer's Guide
    IBM InfoSphere DataStage vs. Matillion ETL
    March 2024
    Find out what your peers are saying about IBM InfoSphere DataStage vs. Matillion ETL and other solutions. Updated: March 2024.
    765,386 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration with 36 reviews while Matillion ETL is ranked 4th in Cloud Data Integration with 22 reviews. IBM InfoSphere DataStage is rated 7.8, while Matillion ETL is rated 8.6. The top reviewer of IBM InfoSphere DataStage writes "User-friendly with a lot of functions for transmission rules, but has slow performance and not suitable for a huge volume of data". On the other hand, the top reviewer of Matillion ETL writes "Efficient data integration and transformation with seamless cloud-native integration". IBM InfoSphere DataStage is most compared with IBM Cloud Pak for Data, SSIS, Azure Data Factory, Talend Open Studio and Alteryx Designer, whereas Matillion ETL is most compared with Azure Data Factory, Snowflake, AWS Glue, Informatica PowerCenter and Oracle Data Integrator (ODI). See our IBM InfoSphere DataStage vs. Matillion ETL report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.