Try our new research platform with insights from 80,000+ expert users

Collibra Lineage vs SAP Data Hub comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Nov 3, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Collibra Lineage
Ranking in Data Governance
10th
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
10
Ranking in other categories
No ranking in other categories
SAP Data Hub
Ranking in Data Governance
30th
Average Rating
7.6
Reviews Sentiment
6.8
Number of Reviews
3
Ranking in other categories
Metadata Management (12th)
 

Mindshare comparison

As of September 2025, in the Data Governance category, the mindshare of Collibra Lineage is 2.7%, up from 2.7% compared to the previous year. The mindshare of SAP Data Hub is 1.1%, down from 1.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Governance Market Share Distribution
ProductMarket Share (%)
Collibra Lineage2.7%
SAP Data Hub1.1%
Other96.2%
Data Governance
 

Featured Reviews

Saikat Ghosh - PeerSpot reviewer
Provides an end-to-end perspective of data usage and assists in identifying sensitive data across the enterprise
I use the Collibra Lineage visualization features. The solution helps me understand data flows effectively. Collibra Lineage impacts my data analysis by helping to identify where data is used, making it significantly useful. The solution facilitates compliance by defining where sensitive data is…
VM
The solution is seamless, but the database sometimes leads to confusion
We used to have multiple different kinds of databases, which internally, had different compliance levels. Retention management is very different now. If the policy is live and the claim has been completed, I couldn't archive the claim. I needed to keep a reference integrity of that claim and understand which policy paid out the claim. With this solution, the policy came in six months ago and qualified for archiving. The claim had been paid and in every environment, the claim had been closed, including the reporting system, the claims system, etc. With the payment set gateway, I can just go and archive. But, we had a hard time during this process. I rate the overall solution a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"One of the best features of Collibra Lineage is that it provides an end-to-end perspective of where data is used in the enterprise."
"It shows how the data was transformed on the way and how the solutions were derived. This is the essence of the tool and why it is so powerful."
"The most valuable features of Collibra Lineage are the multiple options for automation. Additionally, if you have a scenario where automation does not work, you have the option to manually create the lineage. The solution does not force you to always go through an automated way where you have broken lineage and you are stuck without a lineage."
"The diagrams are good for data visualization."
"The solution makes data more discoverable and transparent."
"The integration with other Collibra Lineage products is valuable in our data governance initiatives."
"The best features of Collibra Lineage include its ability to map data flows and provide clear visualizations."
"The solution has very good online assistance."
"SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database."
"The most valuable feature is the S/4HANA 1909 On-Premise"
"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have."
 

Cons

"Collibra doesn't provide support for integration for SAP Info Steward directly with Collibra."
"The product must be cheaper."
"Before you can use this solution, you have to make a lot of changes to how you measure your information in your organization, the data governance and policies, and the organizational structures that use them. Many companies believe that if they purchase the licenses for the tool that they have done the job, but this is far from the case with this tool."
"It would be better if features like the privacy module and AI-based Collibra DQ were included in one package instead of being separate and requiring additional costs. Currently, the basic package covers lineage and documentation, but the extra modules come at an extra cost."
"The workflow documentation must be improved."
"There is room for improvement in Collibra Lineage as it could incorporate automation features which would allow it to find data lineage across a data stack independently."
"Collibra Lineage could improve connectively. If they can offer more out-of-the-box connectors, which come as in a package compared to you having to go and buy a lot in the marketplace for certain specific connectors."
"If a developer implements ETL in Glue or PySpark, Collibra cannot capture lineage out of that."
"In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."
"The company has everything offshore."
"Nowadays there are some inconsistencies in data bases, however, they upgrade and release the versions to market."
 

Pricing and Cost Advice

"It's expensive, especially for small companies."
"There are different licenses available for Collibra Lineage based on what kind of access you want. If you want, for example, a read-only license they have different ones to pick from."
"The Cloud is very expensive, but SAP HANA previous service is okay."
report
Use our free recommendation engine to learn which Data Governance solutions are best for your needs.
867,370 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Insurance Company
14%
Computer Software Company
9%
Manufacturing Company
7%
Manufacturing Company
16%
Financial Services Firm
13%
Computer Software Company
10%
Government
10%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise1
Large Enterprise7
No data available
 

Questions from the Community

What do you like most about Collibra Lineage?
The solution makes data more discoverable and transparent.
What needs improvement with Collibra Lineage?
There is room for improvement in Collibra Lineage as it could incorporate automation features which would allow it to find data lineage across a data stack independently.
What do you like most about SAP Data Hub?
SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.
What needs improvement with SAP Data Hub?
We moved from Oracle. If you're aware of your monitoring system, the RPU market, and the managed system, you should move to HANA, which is an innovative database built by SAP itself. However, this ...
What is your primary use case for SAP Data Hub?
I technically handle the database, like cycle management projects. When transaction data comes in, we see it based on the retention periods. We have to move the data to some secure storage rather t...
 

Overview

 

Sample Customers

AXA XL, DNB, Adobe, PMI, Holland America Line, UC Davis Health, Cox Automotive
Kaeser Kompressoren, HARTMANN
Find out what your peers are saying about Collibra Lineage vs. SAP Data Hub and other solutions. Updated: July 2025.
867,370 professionals have used our research since 2012.