Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs SSIS vs Spring Cloud Data Flow comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of September 2025, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.7%, up from 0.9% compared to the previous year. The mindshare of Spring Cloud Data Flow is 1.2%, up from 0.9% compared to the previous year. The mindshare of SSIS is 5.9%, down from 7.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Market Share Distribution
ProductMarket Share (%)
SSIS5.9%
Pentaho Data Integration and Analytics1.7%
Spring Cloud Data Flow1.2%
Other91.2%
Data Integration
 

Featured Reviews

Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…
Alokik Gupta - PeerSpot reviewer
Effective microservice and task management but needs more dashboard features
The dashboards in Spring Cloud Dataflow are quite valuable. By injecting the dependency of Spring Cloud Dataflow into our Spring Boot application and annotating it with 'enable task annotation', we can manage tasks effectively. Additionally, the platform allows us to create pipelines and use microservices like a logical AND gate, giving us greater control over our microservices.
Chris Farris - PeerSpot reviewer
User-friendly interface facilitates efficient data migration and reduces operational overhead
Its usability and ease of use are standout features. It is really easy to use - almost too easy at times. Users don't have to be experts to use their tool set and utilize the clicking mechanisms, drop down boxes, and other features to build a package and run it. The configuration is very easy and the system is very stable. I would rate it at a 10 as it is highly reliable; we have never had any problems with it. Very little maintenance is required on my side. While you need to keep the product up to date, the maintenance requirements are minimal. SSIS integrates with all Microsoft tools seamlessly. We have used it against Oracle, DB2 and SQL. It integrates and can move data around from any data platforms, making it very diverse in that standpoint. It is a very robust tool that can work with many data sources, and its main strength is that it is extremely easy to use.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"It has improved our data integration capabilities​."
"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created."
"I find the drag and drop feature in Pentaho Data Integration very useful for integration."
"One of the valuable features is the ability to use PL/SQL statements inside the data transformations and jobs."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"There are a lot of options in Spring Cloud. It's flexible in terms of how we can use it. It's a full infrastructure."
"The most valuable features of Spring Cloud Data Flow are the simple programming model, integration, dependency Injection, and ability to do any injection. Additionally, auto-configuration is another important feature because we don't have to configure the database and or set up the boilerplate in the database in every project. The composability is good, we can create small workloads and compose them in any way we like."
"The product is very user-friendly."
"The ease of deployment on Kubernetes, the seamless integration for orchestration of various pipelines, and the visual dashboard that simplifies operations even for non-specialists such as quality analysts."
"The most valuable feature is real-time streaming."
"The best thing I like about Spring Cloud Data Flow is its plug-and-play model."
"The solution's most valuable feature is that it allows us to use different batch data sources, retrieve the data, and then do the data processing, after which we can convert and store it in the target."
"The dashboards in Spring Cloud Dataflow are quite valuable."
"It is easily scheduled and integrates well with SQL Server and SQL Server Agent jobs."
"The solution is easy to use and developer friendly."
"The data reader is the most valuable feature."
"SSIS provides you with lookup and transformation functions, and you have the flexibility to write your own custom code."
"The most valuable feature of SSIS is that you can take data from other servers which are not MS SQL Server or Oracle."
"The interface is very user-friendly."
"The most valuables features are the relatively short learning curve, and the automation capabilities provided through the BIML add-in for SSDT."
"The setup was easy. All Microsoft products are easy to set up."
 

Cons

"Larger data jobs take more time to execute."
"The support for the Enterprise Edition is okay, but what they have done in the last three or four years is move more and more things to that edition. The result is that they are breaking the Community Edition. That's what our impression is."
"I would like to see improvements made for real-time data processing."
"Although it is a low-code solution with a graphical interface, often the error messages that you get are of the type that a developer would be happy with. You get a big stack of red text and Java errors displayed on the screen, and less technical people can get intimidated by that. It can be a bit intimidating to get a wall of red error messages displayed. Other graphical tools that are focused at the power user level provide a much more user-friendly experience in dealing with your exceptions and guiding the user into where they've made the mistake."
"I would like to see more improvements with AS400 DB2."
"A big problem after deploying something that we do in Lumada is with Git. You get a binary file to do a code review. So, if you need to do a review, you have to take pictures of the screen to show each step. That is the biggest bug if you are using Git."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
"On the tool's online discussion forums, you may get stuck with an issue, making it an area where improvements are required."
"Spring Cloud Data Flow could improve the user interface. We can drag and drop in the application for the configuration and settings, and deploy it right from the UI, without having to run a CI/CD pipeline. However, that does not work with Kubernetes, it only works when we are working with jars as the Spring Cloud Data Flow applications."
"Spring Cloud Data Flow is not an easy-to-use tool, so improvements are required."
"Some of the features, like the monitoring tools, are not very mature and are still evolving."
"The solution's community support could be improved."
"I would improve the dashboard features as they are not very user-friendly."
"The configurations could be better. Some configurations are a little bit time-consuming in terms of trying to understand using the Spring Cloud documentation."
"There were instances of deployment pipelines getting stuck, and the dashboard not always accurately showing the application status, requiring manual intervention such as rerunning applications or refreshing the dashboard."
"SSIS is not stable."
"The solution could improve by having quicker release updates."
"It would be nice if you could run SSIS on other environments besides Windows."
"I would like to see more features in terms of the integration with Azure Data Factory."
"You have to write push down join & lookup SQL to the database yourself via stored procedures or use of the SQL Task to get very high performance. That said, this is a common complaint for nearly all ETL tools on the market and those that offer an alternative such as Informatica offer them at a very expensive add-on price."
"It should have other programming languages supported as well from a scripting perspective. Currently, only C# and VB.NET are supported, which limits it to .NET. It should have Java support as well."
"The interface could use improvement, as well as the administrative tools. Jobs fail from time to time for different reasons. It's not a problem with Microsoft, or SSIS itself. The problems are external, but to find the problems and analyze them it takes too much time."
"Options for scaling could be improved."
 

Pricing and Cost Advice

"I mostly used the open-source version. I didn't work with a license."
"If a company is looking for an ETL solution and wants to integrate it with their tech stack but doesn't want to spend a bunch of money, Pentaho is a good solution"
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"I primarily work on the Community Version, which is available to use free of charge."
"I believe the pricing of the solution is more affordable than the competitors"
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"The solution provides value for money, and we are currently using its community edition."
"This is an open-source product that can be used free of charge."
"If you want support from Spring Cloud Data Flow there is a fee. The Spring Framework is open-source and this is a free solution."
"It would be beneficial if the solution had a less costly cloud offering."
"Based on my experience and understanding, Talend comes out to be a little bit expensive as compared to SSIS. The average cost of having Talend with Talend Management Console is around 72K per region, which is much higher than SSIS. SSIS works very well with Microsoft technologies, and if you have Microsoft technologies, it is not really expensive to have SSIS. If you have SQL Server, SSIS is free."
"SSIS is fairly well-priced - I would rate it at four out of five."
"We purchased the standard edition of SQL Server and SSIS came with it free of charge."
"This solution is a free of charge addition to our SQL licence. However, the only way this tool can be utilized is as a feature of the SQL licence, which may make it unattractive to organizations who don't wish to purchase the wider-ranging licence."
"We have an enterprise license for this solution."
"People have to opt for a perpetual-based licensing model."
"It comes bundled with other solutions, which makes it difficult to get the price on the specific product."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
867,445 professionals have used our research since 2012.
 

Comparison Review

it_user90069 - PeerSpot reviewer
Feb 20, 2014
Informatica PowerCenter vs. Microsoft SSIS - each technology has its advantages but also have similarities
Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations…
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
11%
Government
8%
Manufacturing Company
6%
Financial Services Firm
24%
Computer Software Company
17%
Retailer
7%
Insurance Company
6%
Financial Services Firm
18%
Computer Software Company
10%
Government
8%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business17
Midsize Enterprise16
Large Enterprise25
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise1
Large Enterprise5
By reviewers
Company SizeCount
Small Business26
Midsize Enterprise19
Large Enterprise57
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivan...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hi...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good j...
What needs improvement with Spring Cloud Data Flow?
There were instances of deployment pipelines getting stuck, and the dashboard not always accurately showing the appli...
What is your primary use case for Spring Cloud Data Flow?
We had a project for content management, which involved multiple applications each handling content ingestion, transf...
What advice do you have for others considering Spring Cloud Data Flow?
I would definitely recommend Spring Cloud Data Flow. It requires minimal additional effort or time to understand how ...
Which is better - SSIS or Informatica PowerCenter?
SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called ...
What do you like most about SSIS?
The product's deployment phase is easy.
What is your experience regarding pricing and costs for SSIS?
Utilizing SSIS involves no extra charges beyond the SQL Server license. It's an economical choice for my clients.
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
No data available
SQL Server Integration Services
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Information Not Available
1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration. Updated: August 2025.
867,445 professionals have used our research since 2012.