Pentaho Data Catalog vs SAP Data Hub comparison

Cancel
You must select at least 2 products to compare!
Hitachi Vantara Logo
258 views|107 comparisons
100% willing to recommend
SAP Logo
293 views|258 comparisons
66% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Pentaho Data Catalog and SAP Data Hub based on real PeerSpot user reviews.

Find out what your peers are saying about Informatica, Alation, SAP and others in Metadata Management.
To learn more, read our detailed Metadata Management Report (Updated: April 2024).
768,857 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The ability to easily and quickly ingest new data sources is the most valuable feature... I'm not an especially technical IT person, but my data governance lead and I are able to ingest the data, quickly profile it, and do data identification and tagging."

More Pentaho Data Catalog Pros →

"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have.""SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.""The most valuable feature is the S/4HANA 1909 On-Premise"

More SAP Data Hub Pros →

Cons
"We've tagged a lot of fields that are related to specific processes... What would be helpful is a place, inside Lumada Data Catalog, where you can describe the tags that you're using. Otherwise, anybody coming into the system, or seeing the tag from the outside in one of the reports, is going to say, "What is that tag really referring to?" and has to know where my spreadsheet is."

More Pentaho Data Catalog Cons →

"The company has everything offshore.""Nowadays there are some inconsistencies in data bases, however, they upgrade and release the versions to market.""In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."

More SAP Data Hub Cons →

Pricing and Cost Advice
  • "We can afford it. We got a three-year contract... If it were to go up and price a lot, I don't know if I would be able to keep it."
  • More Pentaho Data Catalog Pricing and Cost Advice →

  • "The Cloud is very expensive, but SAP HANA previous service is okay."
  • More SAP Data Hub Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Metadata Management solutions are best for your needs.
    768,857 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:The ability to easily and quickly ingest new data sources is the most valuable feature... I'm not an especially technical IT person, but my data governance lead and I are able to ingest the data… more »
    Top Answer:We can afford it. We got a three-year contract. I'm hoping that when our contract expires that it is still going to be reasonable enough for us to afford. One of the ways I pitched it is that I told… more »
    Top Answer:As I've said, we've tagged a lot of fields that are related to specific processes, like the driller's log or, for example, if you want to get a license to be a well driller. Now, what I'm having to do… more »
    Top Answer:SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.
    Top Answer:We moved from Oracle. If you're aware of your monitoring system, the RPU market, and the managed system, you should move to HANA, which is an innovative database built by SAP itself. However, this… more »
    Top Answer:I technically handle the database, like cycle management projects. When transaction data comes in, we see it based on the retention periods. We have to move the data to some secure storage rather than… more »
    Ranking
    7th
    out of 27 in Metadata Management
    Views
    258
    Comparisons
    107
    Reviews
    1
    Average Words per Review
    2,400
    Rating
    9.0
    11th
    out of 27 in Metadata Management
    Views
    293
    Comparisons
    258
    Reviews
    1
    Average Words per Review
    469
    Rating
    7.0
    Comparisons
    Also Known As
    Hitachi Lumada DataOps - Data Catalog
    Learn More
    Hitachi Vantara
    Video Not Available
    Overview

    Data intelligence delivered across all structured and unstructured data. A data culture fostered by trusted & actionable data with observability, lineage, quality and reliability.

    The SAP® Data Hub solution enables sophisticated data operations management. It gives you the capability and flexibility to connect enterprise data and Big Data and gain a deep understanding of data and information processes across sources and systems throughout the distributed landscape. The unified solution provides visibility and control into data opportunities, integrating cloud and on-premise information and driving data agility and business value. Distributed processing power enables greater speed and efficiency.

    Sample Customers
    Information Not Available
    Kaeser Kompressoren, HARTMANN
    Top Industries
    No Data Available
    VISITORS READING REVIEWS
    Computer Software Company15%
    Manufacturing Company12%
    Financial Services Firm12%
    Government7%
    Company Size
    No Data Available
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise74%
    Buyer's Guide
    Metadata Management
    April 2024
    Find out what your peers are saying about Informatica, Alation, SAP and others in Metadata Management. Updated: April 2024.
    768,857 professionals have used our research since 2012.

    Pentaho Data Catalog is ranked 7th in Metadata Management with 1 review while SAP Data Hub is ranked 11th in Metadata Management with 3 reviews. Pentaho Data Catalog is rated 9.0, while SAP Data Hub is rated 7.6. The top reviewer of Pentaho Data Catalog writes "Helps make metadata available from our transactional databases, data warehouse, document management system, and GIS". On the other hand, the top reviewer of SAP Data Hub writes "The solution is seamless, but the database sometimes leads to confusion". Pentaho Data Catalog is most compared with Informatica Enterprise Data Catalog and IBM Watson Knowledge Catalog, whereas SAP Data Hub is most compared with Microsoft Purview, SAP Data Services, Alation Data Catalog, Azure Data Factory and Palantir Foundry.

    See our list of best Metadata Management vendors.

    We monitor all Metadata Management reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.