Coming October 25: PeerSpot Awards will be announced! Learn more

IBM InfoSphere DataStage vs SSIS comparison

Cancel
You must select at least 2 products to compare!
StreamSets Logo
5,397 views|3,634 comparisons
IBM Logo
14,883 views|12,339 comparisons
Microsoft Logo
Read 21 SSIS reviews
26,385 views|20,452 comparisons
Comparison Buyer's Guide
Executive Summary
Updated on September 7, 2022

We performed a comparison between IBM InfoSphere DataStage and SSIS based on our users’ reviews in five categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: IBM InfoSphere DataStage users feel its setup is moderately complex to complex. SSIS users feel its setup is straightforward and easy.
  • Features: Users of both products are generally happy with their stability and scalability, with a few mentioning that IBM InfoSphere DataStage could be more stable.

    IBM InfoSphere DataStage users note being particularly satisfied with its powerful data integration for data warehousing. Some note that it could improve its API documentation and have better integration with cloud technology.

    SSIS users say they’re impressed with the solution's user-friendly UI. Some users mention that the product’s integration with Power BI and Azure Data Factory could be improved.
  • Pricing: IBM InfoSphere DataStage users mostly note dissatisfaction with its pricing. SSIS users mostly say that it is affordable and fairly priced, but some mention annoyance with its licensing model.
  • ROI: IBM InfoSphere DataStage users do not mention ROI. SSIS users report seeing an ROI.
  • Service and Support: IBM InfoSphere DataStage users report below average to average support, many noting that support is slow in response. SSIS users mostly report excellent, 24 hour support.

Comparison Results: SSIS beat out IBM InfoSphere DataStage in all categories we compared.

To learn more, read our detailed IBM InfoSphere DataStage vs. SSIS report (Updated: August 2022).
633,952 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It is really easy to set up and the interface is easy to use.""In StreamSets, everything is in one place.""StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution.""StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."

More StreamSets Pros →

"We are mostly using transmission rules. It has a lot of functions and logic related to transmission. It is a user-friendly tool with in-built functions.""As a data integration platform, it is easy to use. It is quite robust and useful for volumetric analysis when you have huge volumes of data. We have tested it for up to ten million rows, and it is robust enough to process ten million rows internally with its parallel processing. Its error logging mechanism is far simpler and easier to understand than other data integration tools. The newer version of InfoSphere has the data catalog and IDC lineage. They are helpful in the easy traceability of columns and tables.""The most valuable feature is the data integration for data warehousing.""It's a robust solution.""The Hierarchical Data Stage is good.""When we have needed help from the IBM team, they were helpful. Our company is a premium partner so we get fast responses.""Offers great flexibility.""The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require that much raw coding because most of its features were drag and drop, plus it had a large number of functionalities."

More IBM InfoSphere DataStage Pros →

"It's a competent product.""The UI is very user-friendly.""SSIS integrates well with SQL servers and Microsoft products.""The most valuable feature of SSIS is that you can take data from other servers which are not MS SQL Server or Oracle.""The script component is very powerful, things that you cannot normally do, is feasible through C#.""The most valuable features of SSIS are that it works with the query language and it can import data from different sources.""The performance is good.""You can get data from any data source with SSIS and dump it to any outside source. It is helpful. Getting, extracting, converting, and dumping data doesn't require much effort because we can do everything in the user interface. You drag and drop, then give the required input. It's intuitive."

More SSIS Pros →

Cons
"If you use JDBC Lookup, for example, it generally takes a long time to process data.""We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.""The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time.""We've seen a couple of cases where it appears to have a memory leak or a similar problem.""Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."

More StreamSets Cons →

"Currently lacking virtualization ability.""The interface needs improvement.""The initial setup could be more straightforward.""The response time from support is slow and needs to be improved.""It would be useful to provide support for Python, AR, and Java.""Their web interface is good but the on-prem sites are outdated. The solution could also be improved if they could integrate the data pipeline scheduling part of their interface.""What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its competitors, so a lot of the clients my company has worked with prefer other tools over IBM InfoSphere DataStage because of the high price tag. Another area for improvement in the solution stems from a lot of new types of databases, for example, databases in the cloud and big data have become available, and IBM InfoSphere DataStage is working on various connectors for different data sources, but that still isn't up-to-date, meaning that some connectors are missing for modern data sources. The latest version of IBM InfoSphere DataStage also has a complex architecture, so my team faced frequent outages and that should be improved as well.""Its documentation is not up to the mark. While building APIs, we had a lot of problems trying to get around it because it is not very user-friendly. We tried to get hold of API documentation, but the documentation is not very well thought out. It should be more structured and elaborate. In terms of additional features, I would like to see good reporting on performance and performance-tuning recommendations that can be based on AI. I would also like to see better data profiling information being reported on InfoSphere."

More IBM InfoSphere DataStage Cons →

"Options for scaling could be improved.""It's difficult to refactor SSIS. It gets cumbersome to reuse the solution.""SSIS can improve by the minimum code requirements in stored procedures and exporting data is difficult. They could make it easier, it should be as easy as it is to import data.""The creation of the measure in the DAC's model could be improved.""The solution could improve by having quicker release updates.""We purchase an add on called task factory primarily to allow bulk delete, update and upsert capability. I'd like to see this be part of the standard package.""It's a legacy tool, that is nearing the end of its useful life.""Microsoft should offer an on-premises support warranty for those using that deployment. They seem to be withdrawing from on-premises options."

More SSIS Cons →

Pricing and Cost Advice
  • "We are running the community version right now, which can be used free of charge."
  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • "The pricing is good, but not the best. They have some customized plans you can opt for."
  • More StreamSets Pricing and Cost Advice →

  • "It's very expensive."
  • "Our internal team takes care of group licensing and cost. We don't have individual licenses. We have group licensing at the company level. Usually, IBM doesn't charge anything separately on the licensing side. For storage and everything else, we are paying around $6,000 per month, which is not very high. It includes Linux data storage, execution, and licensing. They're charging $40 for one-hour execution. Based on that, we are spending around $2,000 on the production environment and $1,000 on the lower environment for testing and development-side executions. For the mainframe, we are using the Db2 mainframe database, and we are spending around $1,000 on the Db2 mainframe database as well. All this comes out to be around $6,000. We, however, would like to have some cost reduction."
  • "The price is expensive but there are no licensing fees."
  • "It is quite expensive."
  • "It's quite expensive."
  • "I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the clients my company works with, though the pricing is higher compared to other solutions, so many clients choose to go with a different solution rather than IBM InfoSphere DataStage."
  • More IBM InfoSphere DataStage Pricing and Cost Advice →

  • "It would be beneficial if the solution had a less costly cloud offering."
  • "Based on my experience and understanding, Talend comes out to be a little bit expensive as compared to SSIS. The average cost of having Talend with Talend Management Console is around 72K per region, which is much higher than SSIS. SSIS works very well with Microsoft technologies, and if you have Microsoft technologies, it is not really expensive to have SSIS. If you have SQL Server, SSIS is free."
  • "We have an enterprise license for this solution."
  • "It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
  • "All of my clients have this product included as part of their Microsoft license."
  • "SSIS is a cheaper option compared to the cost of other ETL tools."
  • "Our license with SSIS is annual."
  • "t's incredibly cost effective, easy to learn the basics quickly (although like all ETL tools requires the traditional learning curve to get good at) and has an immense user base."
  • More SSIS Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
    633,952 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:It is really easy to set up and the interface is easy to use.
    Top Answer:We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then… more »
    Top Answer:We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
    Top Answer:The best feature of IBM InfoSphere DataStage for me was that it was very much user-friendly. The solution didn't require… more »
    Top Answer:I have no information on the exact pricing for IBM InfoSphere DataStage because the solution is usually procured by the… more »
    Top Answer:What needs improvement in IBM InfoSphere DataStage is its pricing. The pricing for the solution is higher than its… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called… more »
    Top Answer:What is the OLAP that you are using? Hosted in Cloud or on-premise?  The target DB should have its tool to extract… more »
    Top Answer:Like most Microsoft products, SSIS is user-friendly and easy to use.
    Comparisons
    Also Known As
    SQL Server Integration Services
    Learn More
    StreamSets
    Video Not Available
    Overview

    StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.

    Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.

    With StreamSets, you can deliver the continuous data that drives the connected enterprise.

    IBM InfoSphere DataStage integrates data across multiple systems using a high performance parallel framework, and it supports extended metadata management and enterprise connectivity. The scalable platform provides more flexible integration of all types of data, including big data at rest (Hadoop-based) or in motion (stream-based), on distributed and mainframe platforms.

    SQL Server Integration Services (SSIS) is a Microsoft platform designed to build enterprise-level data integration and data transformation solutions. Users now have the ability to solve intricate business queries by downloading and copying files, cleansing and mining data, loading data warehouses, and managing SQL Server objects and data through Integration Services. SSIS is a component of the Microsoft SQL Server data software used for many data migration tasks.

    SSIS loads data into one or more destinations by extracting and transforming data from a wide variety of sources such as XML data files, flat files, and relational data sources.

    Integration Services include:

    • An SSIS catalog database to manage, store, and run packages.
    • An ample set of built-in transformations and tasks.
    • Graphical tools for building packages.

    Create packages programmatically and code custom tasks through programming the extensive Integration Services. With graphical Integration Services, you can create solutions without writing any code.

    Benefits of SSIS

    There are many benefits of SSIS, such as:

    • Solve complex business problems - encrypt files with SSIS and send them to various network locations.

    • Migration of DTS packages to SSIS - users can migrate DTS packages to SSIS while choosing to run DTS packages using DTS runtime or incorporate DTS packages into SSIS.

    • Development of ETL processes - Microsoft’s SSIS packages provide the ability to extract, transform, and load data into data warehouses. This service takes data from various sources, like CSV files, XML files, flat files, and relational data sources and transforms and loads them to their destinations.

    • Data migration from other databases - with Integrations Services, users can transform data to make sure it complies with the rules of the database they are migrating to.

    • Managing and automating SQL server objects - SSIS packages can manage and automate SQL server objects which will help save you time and resources.

    Reviews from Real Users

    The SSIS platform stands out among its competitors for a variety of reasons. Two major ones are its debugging capabilities during data flow execution and its easy connectivity with other Microsoft tools.

    Muhammad J., a senior manager software developer at Techlogix, notes, "The debugging capabilities are great, particularly during data flow execution. You can look into the data and see what's going on in the pipeline."

    Ismail L., a data engineer at a tech service company, writes, "The most valuable thing is that it is easy to connect with Microsoft tools. In Europe, particularly in France, a lot of companies use Excel, SQL Server, and other Microsoft tools, and it is easier to connect SSIS with Microsoft tools than other products."

    Offer
    Learn more about StreamSets
    Learn more about IBM InfoSphere DataStage
    Learn more about SSIS
    Sample Customers
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Dubai Statistics Center, Etisalat Egypt
    PKP Energetyka, UniCredit Bank, Mostar, waldwasser, Ashok leyland, Florida Atlantic University, Stadt Frankfurt am Main
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm16%
    Computer Software Company15%
    Insurance Company8%
    Government7%
    REVIEWERS
    Computer Software Company70%
    Aerospace/Defense Firm10%
    Healthcare Company10%
    Financial Services Firm10%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company17%
    Comms Service Provider11%
    Insurance Company7%
    REVIEWERS
    Financial Services Firm22%
    Healthcare Company8%
    Government8%
    Manufacturing Company8%
    VISITORS READING REVIEWS
    Computer Software Company17%
    Financial Services Firm15%
    Comms Service Provider11%
    Insurance Company7%
    Company Size
    REVIEWERS
    Small Business14%
    Midsize Enterprise29%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    REVIEWERS
    Small Business45%
    Midsize Enterprise3%
    Large Enterprise52%
    VISITORS READING REVIEWS
    Small Business13%
    Midsize Enterprise11%
    Large Enterprise76%
    REVIEWERS
    Small Business30%
    Midsize Enterprise21%
    Large Enterprise49%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise13%
    Large Enterprise69%
    Buyer's Guide
    IBM InfoSphere DataStage vs. SSIS
    August 2022
    Find out what your peers are saying about IBM InfoSphere DataStage vs. SSIS and other solutions. Updated: August 2022.
    633,952 professionals have used our research since 2012.

    IBM InfoSphere DataStage is ranked 7th in Data Integration Tools with 10 reviews while SSIS is ranked 3rd in Data Integration Tools with 21 reviews. IBM InfoSphere DataStage is rated 7.8, while SSIS is rated 7.8. The top reviewer of IBM InfoSphere DataStage writes "Robust, easy to use, has a simple error logging mechanism, and works very well for huge volumes of data". On the other hand, the top reviewer of SSIS writes "SSIS 2016 - The good, the bad, and the ugly". IBM InfoSphere DataStage is most compared with Talend Open Studio, AWS Glue, Informatica PowerCenter, Azure Data Factory and Oracle Data Integrator (ODI), whereas SSIS is most compared with Informatica PowerCenter, Talend Open Studio, Oracle Data Integrator (ODI), AWS Glue and Alteryx Designer. See our IBM InfoSphere DataStage vs. SSIS report.

    See our list of best Data Integration Tools vendors and best Cloud Data Integration vendors.

    We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.