Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs SSIS vs Talend Open Studio comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of July 2025, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.8%, up from 0.8% compared to the previous year. The mindshare of SSIS is 7.4%, down from 7.7% compared to the previous year. The mindshare of Talend Open Studio is 4.4%, down from 5.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…
Sean Achim - PeerSpot reviewer
Building impactful organizational KPIs with ease and precision
Stability is rated at 10. One other important aspect I appreciate is that SSAS is included in the base installation of SQL Server. Obviously, it requires installation, but it is readily available, which is a major strength. It's all about setting it up, configuring it, and then using it. If there are additional costs associated with it or separating it as a second product, that would be a disadvantage. The area of improvement is really in education. Microsoft is trying to push everything as a Power BI solution or trying to get people to solve the problems which are solved with SSAS in another space in Power BI, or in Power Pivot, is not enough. There's not enough marketing, conversation, and support around that space. As a result, we end up with people not understanding that you need to build your models correctly, and then they try to model everything inside of Power BI, or another visualization tool, without first building the data model. That leads people to consider alternate solutions because SAP and others argue that their whole thing is in memory, and they disseminate misleading information. Additionally, what would be very helpful is local user group developments, so getting people around the table and teaching them how to use it. That is the biggest problem; it's not the technology itself. The challenge lies in Microsoft withdrawing a lot of the qualifications and watering down its emphasis, leading to a perception that this is supposed to be an elite product.
Costin Marzea - PeerSpot reviewer
Allows you to develop your own components and can be used as an OEM
Sometimes, scalability is part of planning. It depends on what you mean by scalability. People talk a lot about it, but scalability is not always about system functionality. Sometimes, it may be planning the job you're doing. If you want to split it into several jobs or servers, you don't actually have to have it built in as a functionality. You can create a job using a loop, which runs and controls several jobs in a loop that may be controlled. Scaling should not always be part of the infrastructure based on whether the engine can scale or not. I think it's your plan or project that should scale and split, and you can define these parameters. These parameters include how many servers you want to run or how many executions you want to do on different parts of the data. It's not always an issue of the engine running. Sometimes, your database should be configured to support partitioning. The product may scale very well without partitioning, but if the basic response is very slow, you didn't solve the problem. You should solve the problems at a higher level, not just at the execution level. They should be solved at the database level and communication level, and you should have firewalls. We are trying to add to the open source the ability to generate code for containers and Kubernetes that exist in the subscription version. Once you do this, Kubernetes will take care of the scaling, so there is no problem.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"Provides a good open source option."
"Lumada has allowed us to interact with our employees more effectively and compensate them properly. One of the cool things is that we use it to generate commissions for our salespeople and bonuses for our warehouse people. It allows us to get information out to them in a timely fashion. We can also see where they're at and how they're doing."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines."
"The abstraction is quite good."
"I find the drag and drop feature in Pentaho Data Integration very useful for integration."
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
"It is also easy to learn and user-friendly. Microsoft is also good in terms of technical support. They have built a large community all over the world."
"The performance is good."
"There are many good features in this solution including the data fields, database integration, support for SQL views, and the lookups for matching information."
"It is easy to set up the solution."
"The most valuable features for our company are the flexibility and the quick turn around time in producing simple ETL solutions."
"The most valuables features are the relatively short learning curve, and the automation capabilities provided through the BIML add-in for SSDT."
"This solution is easy to implement, has a wide variety of connectors, has support for Visual Basic, and supports the C language."
"The simplicity of the solution is great. The solution also offers excellent integration."
"This solution has improved our overall time to value for data ingestion."
"I can connect with different databases such as Oracle Database or SQL Server. It allows you to extract the data from one database to another. I can structure the data by filtering and mapping the fields. It is very user-friendly. You need to know the basics of SQL development or SQL queries, and you can use this tool."
"We're able to handle large amounts of data with ease."
"The data integration aspect of the solution is excellent."
"This product is very easy to use."
"The standout feature for me is the user-friendly nature of the components."
"It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The graphical design environment is also very easy."
"The main differentiator that I have seen between Talend and other data integration tools is the ability to view the data pipeline in the form of a program."
 

Cons

"I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"I was not happy with the Pentaho Report Designer because of the way it was set up. There was a zone and, under it, another zone, and under that another one, and under that another one. There were a lot of levels and places inside the report, and it was a little bit complicated. You have to search all these different places using a mouse, clicking everywhere... each report is coded in a binary file... You cannot search with a text search tool..."
"I would like to see improvement when it comes to integrating structured data with text data or anything that is unstructured. Sometimes we get all kinds of different files that we need to integrate into the warehouse."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​."
"Communicating with the vendor is challenging, and this hinders its performance in free tool setups."
"​There is not a data quality or MDM solution in the Pentaho DI suite.​"
"Sometimes when we want to publish to other types of databases it's not easy to publish to those databases. For example, the Jet Database Engine. Before the SSIS supported Jet Database Engine but nowadays it doesn't support the Jet Database Engine. We connect to many databases such as Access database, SparkPros databases and the other types of databases using Jet Database Engines now and SSIS now doesn't seem to support it in our databases."
"I would like to see more features in terms of the integration with Azure Data Factory."
"Video training would be a helpful addition."
"The performance of this solution is not as good as other tools in the market."
"We'd like more integration capabilities."
"SSIS can improve in handling different data sources like Salesforce connectivity, Oracle Cloud's connectivity, etc."
"Improving the login procedure would make our reporting easier on monitoring our ETL processes."
"Microsoft's technical support has decreased in quality over the last few years, becoming less responsive and tending to pass problems on instead of solving them."
"The high price of the solution is an area of concern where improvements are required."
"We don't get continuous replication of the data."
"The stability of the solution could improve when running jobs. There can be errors when running projects but in the end, it works well and the errors do not impact the result."
"The product could be more intuitive."
"Talend Open Studio is in Java language, and right now, you can only use the debug functionality in Java. I see that people who know programming languages other than Java currently face difficulties."
"I would say that writing to JSON is kind of a pain. It reads from a JSON file pretty well, but writing to a JSON file is not so great because its components are not good."
"The technical support and documentation need a lot of work to come up to standard."
"Multiple products are there within the product suite. That can be actually trimmed down."
 

Pricing and Cost Advice

"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"We are using the Community Edition. We have been trying to use and sell the Enterprise version, but that hasn't been possible due to the budget required for it."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
"I primarily work on the Community Version, which is available to use free of charge."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"Depending on the arrangement that a certain company has with Microsoft, it may supply the permanent license that is included in the SQL server license, or it may be a time-bound license if it is a partner license or other enterprise license."
"All of my clients have this product included as part of their Microsoft license."
"t's incredibly cost effective, easy to learn the basics quickly (although like all ETL tools requires the traditional learning curve to get good at) and has an immense user base."
"We purchased the standard edition of SQL Server and SSIS came with it free of charge."
"Our license with SSIS is annual."
"If you don't want to pay a lot of money, you can go for SSIS, as its open-source version is available. When it comes to licensing, SSIS can be expensive."
"The solution comes free of cost."
"We have an enterprise license for this solution."
"Price could be lower. It is getting too expensive when compared to some other solutions, which is actually a little bit concerning."
"The paid version of this solution has a very high price, but even with the limitations, the Community version works fine."
"Talend Open Studio is priced too high."
"It is an open-source product."
"Talend Open Studio costs about 11,000 a year."
"It does the job well for nothing — without cost. That's the advantage of this product."
"For Talend Open Studio, there is a need to make yearly payments towards the licensing cost. Talend Open Studio is a bit expensive, in my opinion."
"Right now, because we're using the open-source version, there's no cost."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
860,745 professionals have used our research since 2012.
 

Comparison Review

it_user90069 - PeerSpot reviewer
Feb 20, 2014
Informatica PowerCenter vs. Microsoft SSIS - each technology has its advantages but also have similarities
Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations…
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
14%
Government
6%
Manufacturing Company
5%
Financial Services Firm
19%
Computer Software Company
11%
Government
8%
Manufacturing Company
6%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivan...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hi...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good j...
Which is better - SSIS or Informatica PowerCenter?
SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called ...
What do you like most about SSIS?
The product's deployment phase is easy.
What is your experience regarding pricing and costs for SSIS?
Utilizing SSIS involves no extra charges beyond the SQL Server license. It's an economical choice for my clients.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) f...
What do you like most about Talend Open Studio?
It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open sou...
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
SQL Server Integration Services
Open Studio
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
1. Amazon.com 2. Bank of America 3. Capital One 4. Coca-Cola 5. Dell 6. E*TRADE 7. FedEx 8. Ford Motor Company 9. Google 10. Home Depot 11. IBM 12. Intel 13. JPMorgan Chase 14. Kraft Foods 15. Lockheed Martin 16. McDonald's 17. Microsoft 18. Morgan Stanley 19. Nike 20. Oracle 21. PepsiCo 22. Procter & Gamble 23. Prudential Financial 24. RBC Capital Markets 25. SAP 26. Siemens 27. Sony 28. Toyota 29. UnitedHealth Group 30. Visa 31. Walmart 32. Wells Fargo
Almerys, BF&M, Findus
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration. Updated: June 2025.
860,745 professionals have used our research since 2012.