Try our new research platform with insights from 80,000+ expert users

Pentaho Data Integration and Analytics vs dbt comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

dbt
Ranking in Data Integration
17th
Average Rating
7.8
Reviews Sentiment
6.8
Number of Reviews
7
Ranking in other categories
Data Quality (6th)
Pentaho Data Integration an...
Ranking in Data Integration
8th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
60
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of March 2026, in the Data Integration category, the mindshare of dbt is 1.7%, up from 1.0% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.6%, up from 1.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
Pentaho Data Integration and Analytics1.6%
dbt1.7%
Other96.7%
Data Integration
 

Featured Reviews

AS
Principal Data Engineer at Integrant, Inc.
Data teams have streamlined code-driven pipelines and now collaborate faster on shared models
We are still experimenting with testing, but not that much. We are not using some features yet. We are trying to introduce them because we are coming from a background of SSIS. The team used to work with SSIS, Microsoft SQL Server Integration Services. We are still adapting one feature at a time. Currently, we are working with the SQL modules and with the Jinja templating. We are experimenting with testing, but I would say towards the end of this year, we are planning to explore more of the documentation and the data lineage options as well. I would say the benefits are coming from GUI-based tools like SSIS. We have more control on the codebase. We can create something of a system where we can use macros and templating, speeding up the development cycle. We are now trying to introduce a little testing, and also we are using some sort of a CI/CD cycle, so continuous integration and continuous deployment. I do not believe that these kinds of features are that common as a package as a whole package. dbt excels in that area. I used to have a couple of notes about the performance, but lately I have discovered something called dbt Fusion, which, according to dbt Labs, they proclaim is much faster during the parsing of dbt models. However, I would love to see even more of an out-of-the-box solution regarding the testing. They are treating the testing in a good way so far, but I would love to see even more improvement because the whole data testing field is not very mature. It is not the same as software testing; for example, you have test suites, test tools, and profilers, but for data testing, it is not yet that advanced. I would love for dbt to take the lead on that.
Michelle Lawson - PeerSpot reviewer
Principal Software Engineer at a tech vendor with 10,001+ employees
Streamlines complex data workflows and has supported automated customer payment notifications
I haven't used Pentaho Data Integration and Analytics in a couple of years, so I don't know how it can be improved. I was pretty pleased with it and was self-taught on it, working a lot with their team at various times, but they were surprised that I was able to learn it all by myself. The documentation is not bad, and documentation is the main thing that any product can do to make themselves better because the easier it is to find examples of what you're trying to do improves the learning curve. I think it took me the longest to learn how to do the asynchronous processing and have things wait for other things to finish processing before continuing on in the workflow. I choose 8 out of 10 because the one reason that it's been rejected at T-Mobile is that everything has to go through a provisioning process and has to get approved, meaning the actual code base has to be investigated by T-Mobile before they'll allow us to use tools of that nature. For whatever reason, we just haven't been able to get that approval; I don't know if it's on Pentaho Data Integration and Analytics' side or if it's on our side. The more you can make it easier for companies to feel comfortable that your product is secure, robustly tested and bug-free, and free of any other kind of negative hacks, the more quickly it will get accepted.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"There is operational efficiency achieved, and data quality and governance have also been achieved with modular SQL and version controlling, which reduced duplication of data and data errors."
"dbt has positively impacted my organization by allowing us to create our data pipelines much faster, going from ingestion of data to creating a data product in weeks instead of months, and we can do it in-house with the skillset we already have."
"I would say the best feature or the most desirable feature for dbt is the ability to write everything in code."
"The product is developer-friendly."
"Since we migrated from SSIS to dbt model architecture, it takes around four hours only to complete a full refresh, and the client is now happy because our downtime was drastically reduced when we perform a complete refresh of the data."
"From a developer point of view, I find the ease of development and the code to be the most useful capabilities of dbt."
"When I started doing it and did the migration so that all that ETL process would be done in Pentaho Data Integration and Analytics, I reduced the processing time from two and a half hours per run to seven minutes."
"You can get ETL, reporting, analysis, and analytics in a single shop."
"Provides a good open source option."
"The way it has improved our product is by giving our users the ability to do ad hoc reports, which is very important to our users. We can do predictive analysis on trends coming in for contracts, which is what our product does. The product helps users decide which way to go based on the predictive analysis done by Pentaho. Pentaho is not doing predictions, but reporting on the predictions that our product is doing. This is a big part of our product."
"Since we started using Pentaho Data Integration and Analytics, many of our manual tasks have become automatic, and we have increased our time for productive things."
"Easy to use, support for all databases (jdbc and odbc connection), xls, csv, files, txt, SAS, R."
"Pentaho Data Integration is quite simple to learn, and there is a lot of information available online."
"I love the fact that we haven't come up with a problem yet that we haven't been able to address with this tool."
 

Cons

"If I needed to name a few areas for improvement, I would mention the migration of code to Git and GitHub, which sometimes fails and can be confusing for developers during handover."
"The solution must add more Python-based implementations."
"dbt can be improved as I find the co-pilot in dbt is not very good, and my team has tried using it but opted to move off it and use other co-pilots such as GitHub."
"Dbt is not as stable as preferred, as it has had a few outages in the current year itself, so improvement should be made in the outages section as it is not stable."
"Since dbt has a license cost, if a company is small and does not have much budget, they can explore other tools because there are other tools that provide the same functionality at a lower cost."
"Every upgrade is a little bit of a risk for us because we do not know if the workarounds that we developed will be available for the next version."
"Although it is a low-code solution with a graphical interface, often the error messages that you get are of the type that a developer would be happy with. You get a big stack of red text and Java errors displayed on the screen, and less technical people can get intimidated by that. It can be a bit intimidating to get a wall of red error messages displayed. Other graphical tools that are focused at the power user level provide a much more user-friendly experience in dealing with your exceptions and guiding the user into where they've made the mistake."
"I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking."
"The one annoyance that troubled us a lot was the fact that every step in a transformation that needed data, created its own data connection."
"I experience difficulties when handling millions of rows, as the data movement from one source to another becomes challenging."
"The product needs more plugins."
"I would like to have more languages/scripts supported in user-defined classes."
"The only complaint that I have with Pentaho has been with scaling."
"​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​."
 

Pricing and Cost Advice

"The solution’s pricing is affordable."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"I mostly used the open-source version. I didn't work with a license."
"The solution reduced our ETL development time by a lot because a whole project used to take about a month to get done previously. After having Lumada, it took just a week. For a big company in Brazil, it saves a team at least $10,000 a month."
"I believe the pricing of the solution is more affordable than the competitors"
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
885,264 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Insurance Company
8%
Manufacturing Company
8%
Computer Software Company
7%
Financial Services Firm
12%
Educational Organization
10%
Government
8%
Manufacturing Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business1
Midsize Enterprise3
Large Enterprise3
By reviewers
Company SizeCount
Small Business18
Midsize Enterprise17
Large Enterprise31
 

Questions from the Community

What is your experience regarding pricing and costs for dbt?
The course content that dbt provides is free and excellent for anyone starting out.
What needs improvement with dbt?
We are still experimenting with testing, but not that much. We are not using some features yet. We are trying to introduce them because we are coming from a background of SSIS. The team used to wor...
What is your primary use case for dbt?
I am working with one of our enterprise customers, managing their newly established cloud warehouse. They are using Snowflake and we are using dbt to manage all the transformation and views and tab...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Information Not Available
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Pentaho Data Integration and Analytics vs. dbt and other solutions. Updated: March 2026.
885,264 professionals have used our research since 2012.