We performed a comparison between Informatica Enterprise Data Catalog and Pentaho Data Integration and Analytics based on real PeerSpot user reviews.
Find out in this report how the two Metadata Management solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Multifeatured and easily scalable data catalog, with good data domain discovery and data profiling features."
"It can automatically connect or associate business terms with various options, providing flexibility beyond general capabilities."
"I rate the technical support a ten out of ten."
"The way that the solution scans is very useful."
"We can scan anything."
"The most valuable feature is its ability to extract metadata from various sources- be it an old SaaS application or the latest cloud application."
"The solution scales well."
"I like EDC's self-service capabilities. You can put the catalog on the intranet inside the organization, so users can search for something. People in the research world have specialized systems, and you might find data from various places that sound similar."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
"Pentaho Data Integration is quite simple to learn, and there is a lot of information available online."
"It has improved our data integration capabilities."
"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created."
"We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic."
"The UX and UI of the solution are areas with certain shortcomings where improvements can be made in the future."
"The model is somewhat flexible. There are certain aspects of the model that are not as flexible as we would like. It doesn't do certain things to a great level of depth. So, in situations where we want to drill in to do something specific, we have to essentially copy that data into our own structures in order to add that additional layer of flexibility."
"IEDC can improve the comparison of lineages."
"The solution is quite expensive."
"They have to improve their relationship discovery tool. They say that they have AI inside, but this AI did not automatically find relationships or suggested relationships between entities."
"It is more complicated to extract data using the product compared to Visio. The system could display the details on the screen."
"This solution is hard to set up and its interface is not user-friendly. It's also not as stable, and the technical support takes a lot of time to solve simple problems."
"Informatica Enterprise Data Catalog could improve by having a much better user interface. It is not user-friendly."
"I would like to see more improvements with AS400 DB2."
"If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was."
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying."
"Although it is a low-code solution with a graphical interface, often the error messages that you get are of the type that a developer would be happy with. You get a big stack of red text and Java errors displayed on the screen, and less technical people can get intimidated by that. It can be a bit intimidating to get a wall of red error messages displayed. Other graphical tools that are focused at the power user level provide a much more user-friendly experience in dealing with your exceptions and guiding the user into where they've made the mistake."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
More Informatica Enterprise Data Catalog Pricing and Cost Advice →
More Pentaho Data Integration and Analytics Pricing and Cost Advice →
Informatica Enterprise Data Catalog is ranked 1st in Metadata Management with 13 reviews while Pentaho Data Integration and Analytics is ranked 16th in Data Integration with 48 reviews. Informatica Enterprise Data Catalog is rated 7.6, while Pentaho Data Integration and Analytics is rated 8.0. The top reviewer of Informatica Enterprise Data Catalog writes "Great metadata management with more visibility and great technical support". On the other hand, the top reviewer of Pentaho Data Integration and Analytics writes "It's flexible and can do almost anything I want it to do". Informatica Enterprise Data Catalog is most compared with Alation Data Catalog, Collibra Catalog, AWS Glue, Informatica PowerCenter and Denodo, whereas Pentaho Data Integration and Analytics is most compared with Azure Data Factory, SSIS, Talend Open Studio, Oracle Data Integrator (ODI) and AWS Glue. See our Informatica Enterprise Data Catalog vs. Pentaho Data Integration and Analytics report.
We monitor all Metadata Management reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.