Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs Talend Open Studio comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Pentaho Data Integration an...
Ranking in Data Integration
19th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
Talend Open Studio
Ranking in Data Integration
5th
Average Rating
8.0
Reviews Sentiment
6.8
Number of Reviews
50
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2025, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.8%, up from 0.7% compared to the previous year. The mindshare of Talend Open Studio is 4.7%, down from 5.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…
Costin Marzea - PeerSpot reviewer
Allows you to develop your own components and can be used as an OEM
Sometimes, scalability is part of planning. It depends on what you mean by scalability. People talk a lot about it, but scalability is not always about system functionality. Sometimes, it may be planning the job you're doing. If you want to split it into several jobs or servers, you don't actually have to have it built in as a functionality. You can create a job using a loop, which runs and controls several jobs in a loop that may be controlled. Scaling should not always be part of the infrastructure based on whether the engine can scale or not. I think it's your plan or project that should scale and split, and you can define these parameters. These parameters include how many servers you want to run or how many executions you want to do on different parts of the data. It's not always an issue of the engine running. Sometimes, your database should be configured to support partitioning. The product may scale very well without partitioning, but if the basic response is very slow, you didn't solve the problem. You should solve the problems at a higher level, not just at the execution level. They should be solved at the database level and communication level, and you should have firewalls. We are trying to add to the open source the ability to generate code for containers and Kubernetes that exist in the subscription version. Once you do this, Kubernetes will take care of the scaling, so there is no problem.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"The abstraction is quite good."
"Pentaho Data Integration is easy to use, especially when transforming data."
"It is easy to use, install, and start working with."
"We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic."
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
"I find the drag and drop feature in Pentaho Data Integration very useful for integration."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"The solution's technical support is responsive and helpful."
"You can use Talend as a stand-alone application without customization to collect data and generate reports over dashboards. It's got great functionality."
"You can use Talend SDK to develop your own components and add them to the product."
"Talend can connect to multiple data sources, including relational data sources, ERP, CRM, and others."
"Talend Studio has the ability to use it to ensure data quality."
"A very user friendly solution."
"The product is easy to install and configure. It is one of the best tools for data integration."
"The data integration aspect of the solution is excellent."
 

Cons

"​I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support.​"
"I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking."
"I would like to see more improvements with AS400 DB2."
"One thing that I don't like, just a little, is the backward compatibility."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet."
"Should provide additional control for the data warehouse"
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
"Talend Open Studio is in Java language, and right now, you can only use the debug functionality in Java. I see that people who know programming languages other than Java currently face difficulties."
"We don't get continuous replication of the data."
"In version 6.2 we did encounter issues with the job servers and specifically with ESB. Version 6.3 is better but large jobs can cause the MDM server to fall over, requiring a reboot."
"Technical support and customer service need to be improved."
"The user interface could be made simpler."
"There is a need for mastery in some areas."
"Having additional training materials, such as a video tutorial, would be an improvement."
"The security features could be improved."
 

Pricing and Cost Advice

"We are using the Community Edition. We have been trying to use and sell the Enterprise version, but that hasn't been possible due to the budget required for it."
"There is a good open source option (Community Edition)​."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"I primarily work on the Community Version, which is available to use free of charge."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"I am using the open-source version of the solution, so there are no extra costs for any feature."
"It is an open-source product."
"Pricing and licensing are fairly straightforward. It is reasonably priced and managed."
"Right now, because we're using the open-source version, there's no cost."
"The paid version of this solution has a very high price, but even with the limitations, the Community version works fine."
"We are using the free version of the tool, because the enterprise version is a little expensive."
"Talend Open Studio costs about 11,000 a year."
"It is an open-source tool which means it is a free solution."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
859,579 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
14%
Government
7%
Manufacturing Company
5%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What do you like most about Talend Open Studio?
It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The...
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
Open Studio
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Almerys, BF&M, Findus
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. Talend Open Studio and other solutions. Updated: May 2025.
859,579 professionals have used our research since 2012.