Try our new research platform with insights from 80,000+ expert users

Azure Data Factory vs SAP Data Hub comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Azure Data Factory
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
92
Ranking in other categories
Data Integration (1st), Cloud Data Warehouse (2nd)
SAP Data Hub
Average Rating
7.6
Reviews Sentiment
6.8
Number of Reviews
3
Ranking in other categories
Data Governance (30th), Metadata Management (12th)
 

Mindshare comparison

Azure Data Factory and SAP Data Hub aren’t in the same category and serve different purposes. Azure Data Factory is designed for Data Integration and holds a mindshare of 5.6%, down 11.6% compared to last year.
SAP Data Hub, on the other hand, focuses on Data Governance, holds 1.1% mindshare, down 1.1% since last year.
Data Integration Market Share Distribution
ProductMarket Share (%)
Azure Data Factory5.6%
Informatica PowerCenter6.3%
SSIS5.9%
Other82.2%
Data Integration
Data Governance Market Share Distribution
ProductMarket Share (%)
SAP Data Hub1.1%
Microsoft Purview Data Governance21.6%
Varonis Platform10.8%
Other66.5%
Data Governance
 

Featured Reviews

KandaswamyMuthukrishnan - PeerSpot reviewer
Integrates diverse data sources and streamlines ETL processes effectively
Regarding potential areas of improvement for Azure Data Factory, there is a need for better data transformation, especially since many people are now depending on DataBricks more for connectivity and data integration. Azure Data Factory should consider how to enhance integration or filtering for more transformations, such as integrating with Spark clusters. I am satisfied with Azure Data Factory so far, but I suggest integrating some AI functionality to analyze data during the transition itself, providing insights such as null records, common records, and duplicates without running a separate pipeline or job. The monitoring tools in Azure Data Factory are helpful for optimizing data pipelines; while the current feature is adequate, they can improve by creating a live dashboard to see the online process, including how much percentage has been completed, which will be very helpful for people who are monitoring the pipeline.
VM
The solution is seamless, but the database sometimes leads to confusion
We used to have multiple different kinds of databases, which internally, had different compliance levels. Retention management is very different now. If the policy is live and the claim has been completed, I couldn't archive the claim. I needed to keep a reference integrity of that claim and understand which policy paid out the claim. With this solution, the policy came in six months ago and qualified for archiving. The claim had been paid and in every environment, the claim had been closed, including the reporting system, the claims system, etc. With the payment set gateway, I can just go and archive. But, we had a hard time during this process. I rate the overall solution a seven out of ten.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I enjoy the ease of use for the backend JSON generator, the deployment solution, and the template management."
"The tool's most valuable features are its connectors. It has many out-of-the-box connectors. We use ADF for ETL processes. Our main use case involves integrating data from various databases, processing it, and loading it into the target database. ADF plays a crucial role in orchestrating these ETL workflows."
"The most valuable feature of Azure Data Factory is the core features that help you through the whole Azure pipeline or value chain."
"From what we have seen so far, the solution seems very stable."
"The most valuable feature of this solution would be ease of use."
"Data Factory's best features are connectivity with different tools and focusing data ingestion using pipeline copy data."
"From my experience so far, the best feature is the ability to copy data to any environment. We have 100 connects and we can connect them to the system and copy the data from its respective system to any environment. That is the best feature."
"I find that the solution integrates well with cloud technologies, which we are using for different clouds like Snowflake and AWS"
"SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database."
"The most valuable feature is the S/4HANA 1909 On-Premise"
"Its connection to on-premise products is the most valuable. We mostly use the on-premise connection, which is seamless. This is what we prefer in this solution over other solutions. We are using it the most for the orchestration where the data is coming from different categories. Its other features are very much similar to what they are giving us in open source. Their push-down approach is the most advantageous, where they push most of the processing on to the same data source. This means that they have a serverless kind of thing, and they don't process the data inside a product such as Data Hub. They process the data from where the data is coming out. If it is coming from HANA, to capture the data or process it for analytics, orchestration, or management, they go to the HANA database and give it out. They don't process it on Data Hub. This push-down approach increases the processing speed a little bit because the data is processed where it is sitting. That's the best part and an advantage. I have used another product where they used to capture the data first and then they used to process it and give it. In Data Hub, it is in reverse. They process it first and give it, and then they put their own manipulations. They lead in terms of business functions. No other solution has business functions already implemented to perform business analysis. They have a lot of prebuilt business functions for machine learning and orchestration, which we can use directly to get an analysis out from the existing data. Most of the data is sitting as enterprise data there. That's a major advantage that they have."
 

Cons

"There is always room to improve. There should be good examples of use that, of course, customers aren't always willing to share. It is Catch-22. It would help the user base if everybody had really good examples of deployments that worked, but when you ask people to put out their good deployments, which also includes me, you usually got, "No, I'm not going to do that." They don't have enough good examples. Microsoft probably just needs to pay one of their partners to build 20 or 30 examples of functional Data Factories and then share them as a user base."
"Areas for improvement in Azure Data Factory include connectivity and integration. When you use integration runtime, whenever there's a failure, the backup process in Azure Data Factory takes time, so this is another area for improvement."
"When working with AWS, we have noticed that the difference between ADF and AWS is that AWS is more customer-focused. They're more responsive compared to any other company. ADF is not as good as AWS, but it should be. If AWS is ten out of ten, ADF is around eight out of ten. I think AWS is easier to understand from the GUI perspective compared to ADF."
"There are limitations when processing more than one GD file."
"The tool’s workflow is not user-friendly. It should also improve its orchestration monitoring."
"Azure Data Factory's pricing in terms of utilization could be improved."
"If the user interface was more user friendly and there was better error feedback, it would be helpful."
"It's a good idea to take a Microsoft course. Because they are really helpful when you start from your journey with Data Factory."
"The company has everything offshore."
"In 2018, connecting it to outside sources, such as IoT products or IoT-enabled big data Hadoop, was a little complex. It was not smooth at the beginning. It was unstable. It took a lot of time for the initial data load. Sometimes, the connection broke, and we had to restart the process, which was a major issue, but they might have improved it now. It is very smooth with SAP HANA on-premise system, SAP Cloud Platform, and SAP Analytics Cloud. It could be because these are their own products, and they know how to integrate them. With Hadoop, they might have used open-source technologies, and that's why it was breaking at that time. They are providing less embedded integration because they want us to use their other products. For example, they don't want to go and remove SAP Analytics Cloud and put everything in Data Hub. They want us to use SAP Analytics Cloud somewhere else and not inside the Data Hub. On the integration part, it lacks real-time analytics, and it is slow. They should embed the SAP Analytics Cloud inside Data Hub or support some kind of analysis. They do provide some analysis, but it is not extensive. They are moreover open source. So, we need a lot of developers or data scientists to go in and implement Python algorithms. It would be better if they can provide their own existing algorithms and give some connections and drop-down menus to go and just configure those. It will make things really quick by increasing the embedded integrations. It will also improve the process efficiency and processing power. Its performance needs improvement. It is a little slow. It is not the best in the market, and there are other products that are much better than this. In terms of technology and performance, it is a little slow as compared to Microsoft and other data orchestration products. I haven't used other products, but I have read about those products, their settings, and the milliseconds that they do. In Azure Purview, they say that they can copy, manage, or transform the data within milliseconds. They say that they can transform 100 gigabytes of data within three to five seconds, which is something SAP cannot do. It generally takes a lot of time to process that much amount of data. However, I have never tested out Azure."
"Nowadays there are some inconsistencies in data bases, however, they upgrade and release the versions to market."
 

Pricing and Cost Advice

"The solution's fees are based on a pay-per-minute use plus the amount of data required to process."
"Azure products generally offer competitive pricing, suitable for diverse budget considerations."
"The pricing model is based on usage and is not cheap."
"The pricing is pay-as-you-go or reserve instance. Of the two options, reserve instance is much cheaper."
"The solution's pricing is competitive."
"The price you pay is determined by how much you use it."
"The licensing is a pay-as-you-go model, where you pay for what you consume."
"The licensing model for Azure Data Factory is good because you won't have to overpay. Pricing-wise, the solution is a five out of ten. It was not expensive, and it was not cheap."
"The Cloud is very expensive, but SAP HANA previous service is okay."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
867,497 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Computer Software Company
12%
Manufacturing Company
9%
Government
7%
Manufacturing Company
16%
Financial Services Firm
13%
Computer Software Company
10%
Government
10%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business31
Midsize Enterprise19
Large Enterprise55
No data available
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up an...
How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
What do you like most about SAP Data Hub?
SAP is one of the most seamless ERPs that have integrated SAP archiving within Excel. I have not seen this with any other database.
What needs improvement with SAP Data Hub?
We moved from Oracle. If you're aware of your monitoring system, the RPU market, and the managed system, you should move to HANA, which is an innovative database built by SAP itself. However, this ...
What is your primary use case for SAP Data Hub?
I technically handle the database, like cycle management projects. When transaction data comes in, we see it based on the retention periods. We have to move the data to some secure storage rather t...
 

Overview

 

Sample Customers

1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
Kaeser Kompressoren, HARTMANN
Find out what your peers are saying about Microsoft, Informatica, Talend and others in Data Integration. Updated: August 2025.
867,497 professionals have used our research since 2012.