Try our new research platform with insights from 80,000+ expert users

Informatica Enterprise Data Lake vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Informatica Enterprise Data...
Ranking in Data Integration
39th
Average Rating
8.0
Reviews Sentiment
6.1
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Pentaho Data Integration an...
Ranking in Data Integration
18th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of October 2025, in the Data Integration category, the mindshare of Informatica Enterprise Data Lake is 0.4%, up from 0.2% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.7%, up from 1.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Market Share Distribution
ProductMarket Share (%)
Pentaho Data Integration and Analytics1.7%
Informatica Enterprise Data Lake0.4%
Other97.9%
Data Integration
 

Featured Reviews

SB
Have built an enterprise-level cloud data pipeline with reliable governance and integration
People use Informatica Enterprise Data Lake because we can create the whole journey from data migration projects from the legacy system to AWS and to the Lake architecture, covering the entire journey from the source to the landing zone, then to the curated zone, and finally consumed by both downstream and upstream applications. The main benefit of Informatica Enterprise Data Lake is that it allows you to use different types of data types, including blobs and storage; all types of data—structured, unstructured, and semi-structured—can exist in the Data Lake architecture, making it robust for retrieving data based on your filtration needs. Its value is actually governed by the client, who combined with our company and AWS, so we all work together to use the workspace and all these things for Informatica Enterprise Data Lake.
Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The process of using the tool's scalability option is well documented."
"It's my understanding that the product can scale."
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"It has improved our data integration capabilities​."
"The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it."
"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"The solution offers features for data integration and migration. Pentaho Data Integration and Analytics allows the integration of multiple data sources into one. The product is user-friendly and intuitive to use for almost any business."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
 

Cons

"Informatica Enterprise Data Lake's setup process was complex since it doesn't support a lot of real-time systems."
"​There is not a data quality or MDM solution in the Pentaho DI suite.​"
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"I would like to see improvement when it comes to integrating structured data with text data or anything that is unstructured. Sometimes we get all kinds of different files that we need to integrate into the warehouse."
"I experience difficulties when handling millions of rows, as the data movement from one source to another becomes challenging."
"Should provide additional control for the data warehouse"
"​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"A big problem after deploying something that we do in Lumada is with Git. You get a binary file to do a code review. So, if you need to do a review, you have to take pictures of the screen to show each step. That is the biggest bug if you are using Git."
 

Pricing and Cost Advice

"The licenses attached to the solution are highly priced."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"I use it because it is free. I download from their page for free. I don't have to pay for a license. With other tools, I have to pay for the licenses. That is why I use Pentaho."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"I believe the pricing of the solution is more affordable than the competitors"
"There is a good open source option (Community Edition)​."
"The price of the regular version is not reasonable and it should be lower."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
869,760 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
No data available
Financial Services Firm
18%
Computer Software Company
11%
Government
8%
Manufacturing Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business17
Midsize Enterprise16
Large Enterprise25
 

Questions from the Community

What do you like most about Informatica Enterprise Data Lake?
The process of using the tool's scalability option is well documented.
What is your experience regarding pricing and costs for Informatica Enterprise Data Lake?
Informatica Enterprise Data Lake's cost is higher, but it provides good value for the price. It comes at a reasonable price, which I would rate at eight out of ten.
What needs improvement with Informatica Enterprise Data Lake?
Improvement depends upon how we can faster showcase our data in our reports, whether in Power BI, Tableau, or other formats. Improving the speed of data showcasing and filtration would be better, a...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

Informatica Intelligent Data Lake, Intelligent Data Lake
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Information Not Available
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Informatica Enterprise Data Lake vs. Pentaho Data Integration and Analytics and other solutions. Updated: October 2025.
869,760 professionals have used our research since 2012.