Try our new research platform with insights from 80,000+ expert users

Palantir Foundry vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Palantir Foundry
Ranking in Data Integration
19th
Average Rating
7.6
Reviews Sentiment
7.1
Number of Reviews
16
Ranking in other categories
IT Operations Analytics (9th), Supply Chain Analytics (1st), Cloud Data Integration (14th), Data Migration Appliances (4th), Data Management Platforms (DMP) (2nd), Data and Analytics Service Providers (1st)
Pentaho Data Integration an...
Ranking in Data Integration
22nd
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of May 2025, in the Data Integration category, the mindshare of Palantir Foundry is 2.8%, up from 2.6% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.6%, up from 0.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Rama Subba Reddy Thavva - PeerSpot reviewer
A low-code/no-code platform with a user-friendly UI
We couldn't implement or use some of the latest functionalities, like Spark. Palantir Foundry is scalable, but it is costly compared to other cloud providers. The solution is more suitable for small and medium businesses. It might be difficult for large enterprises. I rate the solution’s scalability a seven out of ten.
Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The virtualization tool is useful."
"I rate Palantir Foundry a ten out of ten."
"Live video sessions enhance the available documentation and allow you to ask questions directly."
"I like the data onboarding to Palantir Foundry and ETL creation."
"The solution provides an end-to-end integrated tech stack that takes care of all utility/infrastructure topics for you."
"The security is also excellent. It's highly granular, so the admins have a high degree of control, and there are many levels of security. That worked well. You won't have an EDC unless you put everything onto the platform because it is its own isolated thing."
"The ease of use is my favorite feature. We're able to build different models and projects or combine different projects to build one use case."
"Encapsulates all the components without the requirement to integrate or check compatibility."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"One of the valuable features is the ability to use PL/SQL statements inside the data transformations and jobs."
"Pentaho Data Integration is easy to use, especially when transforming data."
"The amount of data that it loads and processes is good."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines."
"It's very simple compared to other products out there."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
 

Cons

"It requires a lot of manual work and is very time-consuming to get to a functional point."
"The solution’s data security could be improved."
"Some error messages can be very cryptic."
"The solution's visualization and analysis could be improved."
"Difficult to receive data from external sources."
"Cost of this solution is quite high."
"The data lineage was challenging. It's hard to track data from the sources as it moves through stages. Informatica EDC can easily capture and report it because it talks to the metadata. This is generated across those various staging points."
"The startup pricing is high, causing concern despite being cost-effective in terms of total cost of ownership."
"Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying."
"I would like to see improvements made for real-time data processing."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"I was not happy with the Pentaho Report Designer because of the way it was set up. There was a zone and, under it, another zone, and under that another one, and under that another one. There were a lot of levels and places inside the report, and it was a little bit complicated. You have to search all these different places using a mouse, clicking everywhere... each report is coded in a binary file... You cannot search with a text search tool..."
"I work with different databases. I would like to work with more connectors to new databases, e.g., DynamoDB and MariaDB, and new cloud solutions, e.g., AWS, Azure, and GCP. If they had these connectors, that would be great. They could improve by building new connectors. If you have native connections to different databases, then you can make instructions more efficient and in a more natural way. You don't have to write any scripts to use that connector."
"It's not very stable, at least not in the case of the community edition. I'm working with the community edition right now and I think perhaps it is because of that it is not very stable, it causes the system to sometimes hang. I'm not sure if this is the case for pair tiers."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
 

Pricing and Cost Advice

"It's expensive."
"Palantir Foundry has different pricing models that can be negotiated."
"The solution’s pricing is high."
"Palantir Foundry is an expensive solution."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"We are using the Community Edition. We have been trying to use and sell the Enterprise version, but that hasn't been possible due to the budget required for it."
"The price of the regular version is not reasonable and it should be lower."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
849,686 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
13%
Financial Services Firm
10%
Computer Software Company
10%
Government
7%
Financial Services Firm
22%
Computer Software Company
15%
Government
8%
Manufacturing Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Palantir Foundry?
Palantir Foundry is a robust platform that has really strong plugin connectors and provides features for real-time integration.
What needs improvement with Palantir Foundry?
Palantir Foundry is missing marketing, which could help it grow. Additionally, the startup pricing is high, causing concern despite being cost-effective in terms of total cost of ownership. Palanti...
What is your primary use case for Palantir Foundry?
I am getting into the ontology space using Palantir Foundry. The primary use case is for developing a common business model that includes data, people, and processes, essentially describing how bus...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Merck KGaA, Airbus, Ferrari,United States Intelligence Community, United States Department of Defense
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Palantir Foundry vs. Pentaho Data Integration and Analytics and other solutions. Updated: April 2025.
849,686 professionals have used our research since 2012.