Try our new research platform with insights from 80,000+ expert users

Azure Data Factory vs IBM Cloud Pak for Data comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Azure Data Factory
Ranking in Data Integration
1st
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
92
Ranking in other categories
Cloud Data Warehouse (2nd)
IBM Cloud Pak for Data
Ranking in Data Integration
26th
Average Rating
7.8
Reviews Sentiment
6.5
Number of Reviews
13
Ranking in other categories
Data Virtualization (3rd)
 

Mindshare comparison

As of October 2025, in the Data Integration category, the mindshare of Azure Data Factory is 5.2%, down from 11.0% compared to the previous year. The mindshare of IBM Cloud Pak for Data is 1.8%, up from 1.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Market Share Distribution
ProductMarket Share (%)
Azure Data Factory5.2%
IBM Cloud Pak for Data1.8%
Other93.0%
Data Integration
 

Featured Reviews

KandaswamyMuthukrishnan - PeerSpot reviewer
Integrates diverse data sources and streamlines ETL processes effectively
Regarding potential areas of improvement for Azure Data Factory, there is a need for better data transformation, especially since many people are now depending on DataBricks more for connectivity and data integration. Azure Data Factory should consider how to enhance integration or filtering for more transformations, such as integrating with Spark clusters. I am satisfied with Azure Data Factory so far, but I suggest integrating some AI functionality to analyze data during the transition itself, providing insights such as null records, common records, and duplicates without running a separate pipeline or job. The monitoring tools in Azure Data Factory are helpful for optimizing data pipelines; while the current feature is adequate, they can improve by creating a live dashboard to see the online process, including how much percentage has been completed, which will be very helpful for people who are monitoring the pipeline.
Michelle Leslie - PeerSpot reviewer
Starts strong with data management capabilities but needs a demo database
What I would love to see is an end-to-end, almost a training demo database of some sort, where one of the biggest problems with data management is demonstrated. There are so many components to data management, and more often than not, people understand one thing really well. They may understand DataStage and how to move data around, but they do not see the impact of moving data incorrectly. They also do not see the impact of everyone understanding a piece of data in the same way. I would love Cloud Pak to come with a demo database that illustrates the different components of data management in a logical way, so I can see the whole picture instead of just the area I'm specializing in. It would be great if Cloud Pak, from a data modeling point of view, allowed us to import our PDMs, for example. It would be ideal to import and create business terms in Cloud Pak. The PEA would be great to create the technical data. The association between the business and the technical metadata could then be automated by pulling it through from your ACE models. The data modeling component is available in Cloud Pak. Additionally, when it comes to Cloud Pak, even though it has the NextGen DataStage built into it, there is Cloud Pak for data integration as well. Currently, I do not think we have a full enough understanding of how CP4D and CP4I can enhance each other.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Data Factory's best feature is the ease of setting up pipelines for data and cloud integrations."
"Azure Data Factory became more user-friendly when data-flows were introduced."
"The security of the agent that is installed on-premises is very good."
"One of the most valuable features of Azure Data Factory is the drag-and-drop interface. This helps with workflow management because we can just drag any tables or data sources we need. Because of how easy it is to drag and drop, we can deliver things very quickly. It's more customizable through visual effect."
"The most valuable feature I have found at Azure Data Factory is the data flow function."
"It's extremely consistent."
"The solution is okay."
"The data copy template is a valuable feature."
"I love the way that I can start at a very basic level with my data management journey by capturing my policies, justifying my data, and putting them into different categories to say this is data relating to individuals, for example, or data relating to geography."
"The most valuable features of IBM Cloud Pak for Data are the Watson Studio, where we can initiate more groups and write code. Additionally, Watson Machine Learning is available with many other services, such as APIs which you can plug the machine learning models."
"What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds. That's an advantage of IBM Cloud Pak for Data."
"Its data preparation capabilities are highly valuable."
"Cloud Pak's most valuable features are IBM MQ, IBM App Connect, IBM API Connect, and ISPF."
"The most valuable features are data virtualization and reporting."
"Cloud Pak is a very, very, very good system."
"You can model the data there, connect the data models with the business processes and create data lineage processes."
 

Cons

"The setup and configuration process could be simplified."
"The speed and performance need to be improved."
"On the UI side, they could make it a little more intuitive in terms of how to add the radius components. Somebody who has been working with tools like Informatica or DataStage gets very used to how the UI looks and feels."
"Currently, smaller businesses face a disadvantage in terms of pricing, and reducing costs could address this issue."
"Azure Data Factory can improve the transformation features. You have to do a lot of transformation activities. This is something that is just not fully covered. Additionally, the integration could improve for other tools, such as Azure Data Catalog."
"I do not have any notes for improvement."
"We require Azure Data Factory to be able to connect to Google Analytics."
"I would like to see this time travel feature in Snowflake added to Azure Data Factory."
"The interface could improve because sometimes it becomes slow. Sometimes there is a delay between clicks when using the software, which can make the development process slow. It can take a few seconds to complete one action, and then a few more seconds to do the next one."
"There is a solution that is part of IBM Cloud Pak for Data called Watson OpenScale. It is used to monitor the deployed models for the quality and fairness of the results. This is one area that needs a lot of improvement."
"The product must improve its performance."
"Cloud Pak would be improved with integration with cloud service providers like Cloudera."
"The solution's user experience is an area that has room for improvement."
"The technical support could be a little better."
"One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output. Too many changes have been made, and my company has around one hundred thousand mappings, so my team has been putting more effort into alternative ways to do things. Another area for improvement in IBM Cloud Pak for Data is that it's more complicated to shift from on-premise to the cloud. Other vendors provide secure agents that easily connect with your existing setup. Still, with IBM Cloud Pak for Data, you have to perform connection migration steps, upgrade to the latest version, etc., which makes it more complicated, especially as my company has XML-based mappings. Still, the XML input and output capabilities of IBM Cloud Pak for Data have been discontinued, so I'd like IBM to bring that back."
"One thing that bugs me is how much infrastructure Cloud Pak requires for the initial deployment. It doesn't allow you to start small. The smallest permitted deployment is too big. It's a huge problem that prevents us from implementing the solution in many scenarios."
 

Pricing and Cost Advice

"My company is on a monthly subscription for Azure Data Factory, but it's more of a pay-as-you-go model where your monthly invoice depends on how many resources you use. On a scale of one to five, pricing for Azure Data Factory is a four. It's just the usage fees my company pays monthly."
"While I can't specify the actual cost, I believe it is reasonably priced and comparable to similar products."
"Pricing is comparable, it's somewhere in the middle."
"For our use case, it is not expensive. We take into the picture everything: resources, learning curve, and maintenance."
"The solution's pricing is competitive."
"The licensing model for Azure Data Factory is good because you won't have to overpay. Pricing-wise, the solution is a five out of ten. It was not expensive, and it was not cheap."
"There's no licensing for Azure Data Factory, they have a consumption payment model. How often you are running the service and how long that service takes to run. The price can be approximately $500 to $1,000 per month but depends on the scaling."
"I would rate Data Factory's pricing nine out of ten."
"Cloud Pak's cost is a little high."
"I don't have the exact licensing cost for IBM Cloud Pak for Data, as my company is still finalizing requirements, including monthly, yearly, and three-year licensing fees. Still, on a scale of one to five, I'd rate it a three because, compared to other vendors, it's more complicated."
"The solution is expensive."
"The solution's pricing is competitive with that of other vendors."
"I think that this product is too expensive for smaller companies."
"It's quite expensive."
"IBM Cloud Pak for Data is expensive. If we include the training time and the machine learning, it's expensive. The cost of the execution is more reasonable."
"For the licensing of the solution, there is a yearly payment that needs to be made. Also, since it is expensive, cost-wise, I rate the solution an eight or nine out of ten."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
871,358 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Computer Software Company
12%
Manufacturing Company
9%
Government
7%
Financial Services Firm
28%
Manufacturing Company
10%
Computer Software Company
9%
Government
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business31
Midsize Enterprise19
Large Enterprise55
By reviewers
Company SizeCount
Small Business7
Large Enterprise8
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up an...
How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
What is your experience regarding pricing and costs for IBM Cloud Pak for Data?
The setup cost is very expensive. The cost depends on the pieces of the solution I'm using, how much data I have, and whether it's on the cloud or on-prem.
What needs improvement with IBM Cloud Pak for Data?
What I would love to see is an end-to-end, almost a training demo database of some sort, where one of the biggest problems with data management is demonstrated. There are so many components to data...
What is your primary use case for IBM Cloud Pak for Data?
My primary use case for Cloud Pak is that I am the reference Data steward for the Africa regions in the banks where I work. My main objective is to capture the reference data in Caltech or Data and...
 

Also Known As

No data available
Cloud Pak for Data
 

Overview

 

Sample Customers

1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
Qatar Development Bank, GuideWell, Skanderborg Music Festival
Find out what your peers are saying about Azure Data Factory vs. IBM Cloud Pak for Data and other solutions. Updated: September 2025.
871,358 professionals have used our research since 2012.