No more typing reviews! Try our Samantha, our new voice AI agent.

Pentaho Data Integration and Analytics vs SharePlex comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Mar 1, 2026

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Pentaho Data Integration an...
Ranking in Data Integration
8th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
60
Ranking in other categories
No ranking in other categories
SharePlex
Ranking in Data Integration
49th
Average Rating
9.0
Reviews Sentiment
7.3
Number of Reviews
5
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of March 2026, in the Data Integration category, the mindshare of Pentaho Data Integration and Analytics is 1.6%, up from 1.4% compared to the previous year. The mindshare of SharePlex is 0.9%, up from 0.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
Pentaho Data Integration and Analytics1.6%
SharePlex0.9%
Other97.5%
Data Integration
 

Featured Reviews

Michelle Lawson - PeerSpot reviewer
Principal Software Engineer at a tech vendor with 10,001+ employees
Streamlines complex data workflows and has supported automated customer payment notifications
I haven't used Pentaho Data Integration and Analytics in a couple of years, so I don't know how it can be improved. I was pretty pleased with it and was self-taught on it, working a lot with their team at various times, but they were surprised that I was able to learn it all by myself. The documentation is not bad, and documentation is the main thing that any product can do to make themselves better because the easier it is to find examples of what you're trying to do improves the learning curve. I think it took me the longest to learn how to do the asynchronous processing and have things wait for other things to finish processing before continuing on in the workflow. I choose 8 out of 10 because the one reason that it's been rejected at T-Mobile is that everything has to go through a provisioning process and has to get approved, meaning the actual code base has to be investigated by T-Mobile before they'll allow us to use tools of that nature. For whatever reason, we just haven't been able to get that approval; I don't know if it's on Pentaho Data Integration and Analytics' side or if it's on our side. The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted.
KW
Oracle DBA at a financial services firm with 5,001-10,000 employees
It reduces the downtime and migration time exponentially
I would rate SharePlex's high availability, and disaster recovery features highly. It works as advertised in terms of rapid fill-over and switch-over opportunities. It reduces the migration time for a multi-terabyte database to an hour or less because it performs real-time replication. SharePlex reduces downtime and migration time exponentially. Something that could take 12 to 24 hours is cut down to one to two hours. By the time you start to migrate, the only remaining replication is a slight difference in data from the point when SharePlex has shut down, and the application has to also switch over to the new database. It allowed the company to replicate data more transparently. Some of the business executives probably don't even know it's there. It works 100 percent of the time, with little downtime or problems. It provides what is needed. There's little concern about whether or not it functions. It's allowed the company to achieve its objectives with its clients, reducing and minimizing downtime for substantial migrations and upgrades. It can reduce hardware and storage costs, but my company doesn't utilize it with high-availability architecture.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I would fully recommend Pentaho."
"The way it has improved our product is by giving our users the ability to do ad hoc reports, which is very important to our users."
"The product is user-friendly and intuitive"
"It allows for rapid prototyping of a wide array of ETL workloads."
"One of the most valuable features is the ability to create many API integrations. I'm always working with advertising agents and using Facebook and Instagram to do campaigns. We use Pentaho to get the results from these campaigns and to create dashboards to analyze the results."
"Using Lumada compared to using SQL manually, ETL development time is half the time it took using a basic manual transformation."
"As a result of one of the projects that we did in the Middle East, we achieved the main goal of fully digitalizing their population census."
"Since we started using Pentaho Data Integration and Analytics, many of our manual tasks have become automatic, and we have increased our time for productive things."
"I like SharePlex's Compare and Repair tool."
"The core features of the solution we like are the reliability of the data transfer and the accuracy of data read and write. The stability of the solution is also excellent."
"The core replication and its performance. Performance is crucial, and SharePlex is by far the fastest. The way it handles replication to multiple targets along with basic filtering, as well as from multiple sources to a single target, is very efficient."
"Try it out in your production and you won't be disappointed."
"There are some capabilities within SharePlex where you can see how the data is migrating and if it still maintains good data integrity. For example, if there are some tables that get out of sync, there are ways to find them and fix the problem on the spot. Since these are very common issues, we can easily fix these types of problems using utilities, like compare and repair. So, if you find something is out of sync, then you can just repair that table. It basically syncs that table from source to target to see if there are any differences. It will then replicate those differences to the target."
"The core replication and its performance, which is by far the fastest, along with the way it handles replication to multiple targets with basic filtering and from multiple sources to a single target, is very efficient."
"It works 100 percent of the time, with little downtime or problems."
"Because of the volume of the transactions, we heavily use a feature that allows SharePlex to replicate thousands of transactions. It's called PEP, Post Enhancement Performance, and that has helped us scale tremendously."
 

Cons

"I would like to see improvement when it comes to integrating structured data with text data or anything that is unstructured. Sometimes we get all kinds of different files that we need to integrate into the warehouse."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"I was not happy with the Pentaho Report Designer because of the way it was set up."
"In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version."
"Searching repository for reports or dashboards"
"The initial version we were on was v3.2 and we had multiple issues, but currently don't find any issues as a blocker."
"For managing very large volumes of data, Pentaho Data Integration and Analytics is not the best tool, but it is in a good position to handle millions of records."
"I also found, in my case, that the statistical data input wasn't working (.sas7bdat input wasn't working)."
"The reporting features need improvement."
"I would like the solution to have some kind of machine learning and AI capabilities."
"I would like the solution to have some kind of machine learning and AI capabilities. Often, if we want to improve the performance of posting, we have to bump up a parameter. That means we need to stop the process, come up with a figure that we want to bump the parameter up to, and then start SharePlex. Machine learning and AI capabilities for these kinds of improvement would tremendously help boost productivity for us."
"For its function in relation to replication (i.e. filtering), I'd give it a six or seven out of 10. GoldenGate has much more functionality by comparison."
"I would like more ability to automate installation and configuration in line with some of the DevOps processes that are more mature in the market."
"For its function in relation to replication (i.e. filtering), I'd give it a six or seven out of 10. GoldenGate has much more functionality by comparison."
"I don't know how easy it would be to change the architecture in an already implemented replication. For example, if we have a certain way of architecting for a particular database migration and want to change that during a period of time, is that an easy or difficult change? There was a need for us to change the architecture in-between the migration, but we didn't do it. We thought, "This is possibly complicated. Let's not change it in the middle because we were approaching our cutover date." That was one thing that we should have checked with support about for training."
"I don't know how easy it would be to change the architecture in an already implemented replication."
 

Pricing and Cost Advice

"We are using the Community Edition. We have been trying to use and sell the Enterprise version, but that hasn't been possible due to the budget required for it."
"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"There is a good open source option (Community Edition)​."
"We did a two or three-year deal the last time we did it. As compared to other solutions, at least so far in our experience, it has been very affordable. The licensing is by component. So, you need to make sure you only license the components that you really intend to use. I am not sure if we have relicensed after the Hitachi acquisition, but previously, multi-year renewals resulted in a good discount. I'm not sure if this is still the case. We've had the full suite for a lot of years, and there is just the initial cost. I am not aware of any additional costs."
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"It is not as expensive as Oracle GoldenGate and has worked really well within our budgets."
"It's really good value for the money. There are some things they could improve on, but in terms of the pricing, features, and support, as a holistic package, we are not thinking of anything else at this point in time."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
885,311 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
11%
Educational Organization
9%
Government
8%
Manufacturing Company
7%
Financial Services Firm
11%
Manufacturing Company
9%
Healthcare Company
8%
Computer Software Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business18
Midsize Enterprise17
Large Enterprise31
No data available
 

Questions from the Community

Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
Ask a question
Earn 20 points
 

Also Known As

Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
Dell SharePlex, SharePlex
 

Overview

 

Sample Customers

66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Bodybuilding.com, Priceline.com, Ameco Beijing, Viasat, SK Broadband
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. SharePlex and other solutions. Updated: March 2026.
885,311 professionals have used our research since 2012.