Try our new research platform with insights from 80,000+ expert users

Collibra Catalog vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Collibra Catalog
Average Rating
8.0
Reviews Sentiment
7.3
Number of Reviews
11
Ranking in other categories
Metadata Management (3rd)
Pentaho Data Integration an...
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
Data Integration (19th)
 

Mindshare comparison

Collibra Catalog and Pentaho Data Integration and Analytics aren’t in the same category and serve different purposes. Collibra Catalog is designed for Metadata Management and holds a mindshare of 11.8%, up 10.2% compared to last year.
Pentaho Data Integration and Analytics, on the other hand, focuses on Data Integration, holds 1.8% mindshare, up 0.8% since last year.
Metadata Management
Data Integration
 

Featured Reviews

Tejbir Singh - PeerSpot reviewer
Facilitates data quality monitoring and AI governance with a complete suite of tools
When I initially started with Collibra, it was just a data cataloging platform with governance workflows around it. Now they have acquired a lot of other tools, or they have merged or acquired different platforms. It is a complete suite of tools for managing data. We can monitor data quality and take actions on the profiling results obtained by running data quality checks. Collibra helps catalog data assets, monitor the health of data assets, and take necessary actions. If we find data quality issues, it also provides a medium to capture those issues and how to remediate them. The workflows allow the creation of custom workflows based on needs. The newest addition in their tool suite is AI governance, which allows cataloging all AI models currently deployed or even in the pre-production stage. It helps document model meanings and the risks involved, thus managing all risks related to AI deployments.
Aqeel UR Rehman - PeerSpot reviewer
Transform data efficiently with rich features but there's challenges with large datasets
Currently, I am using Pentaho Data Integration for transforming data and then loading it into different platforms. Sometimes, I use it in conjunction with AWS, particularly S3 and Redshift, to execute the copy command for data processing Pentaho Data Integration is easy to use, especially when…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We have had no complaints about the stability."
"The data lineage capability is valuable as it shows how different sources are connected and how data flows, which is crucial for projects like migrations. Moreover, data lineage visualization in Collibra Catalog aids our data governance initiatives."
"Collibra Catalog is simple to use and user-friendly for those who are not technically inclined since it is easy to find while also easy to see data lineage diagrams."
"Except for data quality, everything is perfect."
"Collibra Catalog's best feature is the data quality checker."
"Collibra Catalog allows us to automate metadata management, significantly saving time, effort, and finances."
"Collibra Catalog has significantly enhanced data governance and compliance for our team, primarily through its valuable feature of endpoint lineage enabling visual representation of the data."
"Gartner identifies Collibra Catalog as the leader, which aligns with our observations."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"Data transformation within Pentaho is a nice feature that they have and that I value."
"The abstraction is quite good."
"It has improved our data integration capabilities​."
"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created."
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
 

Cons

"Collibra Catalog could improve its automation to increase the efficiency of the software."
"If the price is a bit reduced, that would be better."
"One of the very key drawbacks is that automation for access provisioning is not available. If I discover a data set or data product in the marketplace and want to access the data, this feature doesn't exist at all."
"More automation and artificial intelligence involvement are necessary. Reducing required employee involvement and enhancing ease of use are vital."
"A key area for improvement in Collibra Catalog lies in its integration capabilities, particularly with a broader range of sources."
"The tool's overall functionalities need to improve since, nowadays, many tools, from a business perspective, are easy to use."
"If it can become more user-intuitive and work on integrating with communication platforms like Slack or Teams, it would significantly help business users."
"I'd like to see more integration with other reporting sources."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"​There is not a data quality or MDM solution in the Pentaho DI suite.​"
"One thing that I don't like, just a little, is the backward compatibility."
"The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi."
 

Pricing and Cost Advice

"Collibra Catalog is fairly priced - I would rate their pricing seven out of ten."
"The product is highly priced compared to other vendors."
"Collibra offers a per-user licensing model."
"I think they can bring a few more features and align better with other quality products."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
"There is a good open source option (Community Edition)​."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"I use it because it is free. I download from their page for free. I don't have to pay for a license. With other tools, I have to pay for the licenses. That is why I use Pentaho."
"If a company is looking for an ETL solution and wants to integrate it with their tech stack but doesn't want to spend a bunch of money, Pentaho is a good solution"
report
Use our free recommendation engine to learn which Metadata Management solutions are best for your needs.
863,429 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
30%
Computer Software Company
8%
Manufacturing Company
7%
Government
6%
Financial Services Firm
18%
Computer Software Company
12%
Government
7%
Manufacturing Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Collibra Catalog?
The data lineage capability is valuable as it shows how different sources are connected and how data flows, which is crucial for projects like migrations. Moreover, data lineage visualization in C...
What is your experience regarding pricing and costs for Collibra Catalog?
Pricing is not under my purview as I am an architect. The platform team handles the licensing aspects.
What needs improvement with Collibra Catalog?
I have utilized the sophisticated search capability in Collibra Catalog, and it can be improved by implementing more natural language search capabilities. Currently, we need to enter the asset name...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

AXA XL, DNB, Adobe, PMI, Holland America Line, UC Davis Health, Cox Automotive
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Informatica, Alation, Collibra and others in Metadata Management. Updated: July 2025.
863,429 professionals have used our research since 2012.