We performed a comparison between Collibra Catalog and Pentaho Data Integration and Analytics based on real PeerSpot user reviews.
Find out what your peers are saying about Informatica, Alation, SAP and others in Metadata Management."Collibra Catalog's best feature is the data quality checker."
"The data lineage capability is valuable as it shows how different sources are connected and how data flows, which is crucial for projects like migrations. Moreover, data lineage visualization in Collibra Catalog aids our data governance initiatives."
"Collibra Catalog is simple to use and user-friendly for those who are not technically inclined since it is easy to find while also easy to see data lineage diagrams."
"We have had no complaints about the stability."
"Collibra Catalog has significantly enhanced data governance and compliance for our team, primarily through its valuable feature of endpoint lineage enabling visual representation of the data."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"The amount of data that it loads and processes is good."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
"It's very simple compared to other products out there."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"A key area for improvement in Collibra Catalog lies in its integration capabilities, particularly with a broader range of sources."
"Collibra Catalog could improve its automation to increase the efficiency of the software."
"The tool's overall functionalities need to improve since, nowadays, many tools, from a business perspective, are easy to use."
"I'd like to see more integration with other reporting sources."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"I work with different databases. I would like to work with more connectors to new databases, e.g., DynamoDB and MariaDB, and new cloud solutions, e.g., AWS, Azure, and GCP. If they had these connectors, that would be great. They could improve by building new connectors. If you have native connections to different databases, then you can make instructions more efficient and in a more natural way. You don't have to write any scripts to use that connector."
"Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying."
"The performance could be improved. If they could have analytics perform well on large volumes, that would be a big deal for our products."
"There is not a data quality or MDM solution in the Pentaho DI suite."
"I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support."
"I was not happy with the Pentaho Report Designer because of the way it was set up. There was a zone and, under it, another zone, and under that another one, and under that another one. There were a lot of levels and places inside the report, and it was a little bit complicated. You have to search all these different places using a mouse, clicking everywhere... each report is coded in a binary file... You cannot search with a text search tool..."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
More Pentaho Data Integration and Analytics Pricing and Cost Advice →
Collibra Catalog is ranked 5th in Metadata Management with 5 reviews while Pentaho Data Integration and Analytics is ranked 16th in Data Integration with 48 reviews. Collibra Catalog is rated 7.8, while Pentaho Data Integration and Analytics is rated 8.0. The top reviewer of Collibra Catalog writes "A user-friendly for those who are not technically inclined and useful for cataloging various reports". On the other hand, the top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". Collibra Catalog is most compared with Informatica Enterprise Data Catalog, Ab Initio Co>Operating System, Talend Data Management Platform, Palantir Foundry and PoolParty Semantic Suite, whereas Pentaho Data Integration and Analytics is most compared with Azure Data Factory, SSIS, Talend Open Studio, Oracle Data Integrator (ODI) and IBM InfoSphere DataStage.
We monitor all Metadata Management reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.