No more typing reviews! Try our Samantha, our new voice AI agent.
Michelle Leslie - PeerSpot reviewer
Data asset management engineer at a tech services company with 1-10 employees
Real User
Top 5
Jan 26, 2025
Starts strong with data management capabilities but needs a demo database
Pros and Cons
  • "I love the way that I can start at a very basic level with my data management journey by capturing my policies, justifying my data, and putting them into different categories to say this is data relating to individuals, for example, or data relating to geography."
  • "Cloud Pak is a very, very, very good system."
  • "What I would love to see is an end-to-end, almost a training demo database of some sort, where one of the biggest problems with data management is demonstrated."
  • "The setup cost is very expensive. The cost depends on the pieces of the solution I'm using, how much data I have, and whether it's on the cloud or on-prem."

What is our primary use case?

My primary use case for Cloud Pak is that I am the reference Data steward for the Africa regions in the banks where I work. My main objective is to capture the reference data in Caltech or Data and ensure that people profile or QA their data. 

This is due to the fact that a large percentage of data is actually reference data, not by volume, but by the number of tables. The group-approved reference data is used to assure quality and ensure people know what they have; that's my primary use case for Cloud Pak.

What is most valuable?

There's a whole bunch of stuff I really like. I love the way that I can start at a very basic level with my data management journey by capturing my policies, justifying my data, and putting them into different categories to say this is data relating to individuals, for example, or data relating to geography. Those base-level data management components, together with the reference data, can then be reused whether I want to figure out where the data is coming from—using Nantucket, for example—or checking the quality of my data. 

Often, when I check the quality of my data, I might find an issue, but that data did not originate in the system where I found the issue. So, I need to use Nantucket to track back to where that data originally came from so I can fix it at the source. I love that component of Cloud Pak. 

I do not do much with the machine learning or AI pieces. It is probably because I can start at a basic level with data management: policies, rules, categories, reference data, and business terms. From there, I can work my way into a more granular level, applying all of that information on top of my actual data to understand what my data looks like, where it came from, and where it went wrong, managing it throughout the cycle.

What needs improvement?

What I would love to see is an end-to-end, almost a training demo database of some sort, where one of the biggest problems with data management is demonstrated. 

There are so many components to data management, and more often than not, people understand one thing really well. They may understand DataStage and how to move data around, but they do not see the impact of moving data incorrectly. 

They also do not see the impact of everyone understanding a piece of data in the same way. I would love Cloud Pak to come with a demo database that illustrates the different components of data management in a logical way, so I can see the whole picture instead of just the area I'm specializing in. 

It would be great if Cloud Pak, from a data modeling point of view, allowed us to import our PDMs, for example. It would be ideal to import and create business terms in Cloud Pak. The PEA would be great to create the technical data. The association between the business and the technical metadata could then be automated by pulling it through from your ACE models. The data modeling component is available in Cloud Pak. 

Additionally, when it comes to Cloud Pak, even though it has the NextGen DataStage built into it, there is Cloud Pak for data integration as well. Currently, I do not think we have a full enough understanding of how CP4D and CP4I can enhance each other.

For how long have I used the solution?

I have used the solution since the end of 2021.

Buyer's Guide
IBM Cloud Pak for Data
May 2026
Learn what your peers think about IBM Cloud Pak for Data. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,387 professionals have used our research since 2012.

What do I think about the scalability of the solution?

Scalability is endless if I can pay for it. Obviously, it is just for containers, however, I have to pay more.

How are customer service and support?

The response time is quick, however, solving the problem is not always as fast. Cloud Pak is a complicated system, and it's often difficult to find the right resource in IBM to help with specific issues.

How was the initial setup?

The setup was very complete and very complex.

What about the implementation team?

We did the implementation with IBM.

What's my experience with pricing, setup cost, and licensing?

The setup cost is very expensive. The cost depends on the pieces of the solution I'm using, how much data I have, and whether it's on the cloud or on-prem.

Which other solutions did I evaluate?

I've looked at Talend, Calibra, Denodo, Purview, and AWS Glue. It depends on the client's maturity in data management. If the client is only looking to do data quality as a small piece of data management, Denodo would be an excellent choice. If they are looking for end-to-end data management and have the technical resources to get Cloud Pak running and enabled with all functionalities, then definitely Cloud Pak. The choice depends on the maturity of the company.

What other advice do I have?

Cloud Pak is a very, very, very good system. I'm super impressed with it. The learning curve is high, but I gain so much when I finally figure it out. 

Overall product rating: seven out of ten.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

IBM
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Murali B - PeerSpot reviewer
Associate Manager at a consultancy with 10,001+ employees
Real User
Top 5Leaderboard
Jul 18, 2024
Provides IBM Watson Catalog and data pipelines, but catalog searching needs to be improved
Pros and Cons
  • "IBM Watson Catalog and data pipelines are the most valuable features of the solution."
  • "The solution's catalog searching or map search needs to be improved."

What is most valuable?

IBM Watson Catalog and data pipelines are the most valuable features of the solution.

What needs improvement?

Previously, we used to extract the information in the DSX and the XML formats. IBM Cloud Pak for Data exports information mostly on the ISX, which is an encrypted format. The only challenge with the tool is the metadata queries we try to understand.

We have to go with the lineage and other packages that come with IBM. Previously, we created our own reports depending on the existing command line export of the mappings. The solution's catalog searching or map search needs to be improved.

For how long have I used the solution?

I have been using IBM Cloud Pak for Data for two years.

What do I think about the scalability of the solution?

We usually recommend the solution for medium and large-scale organizations.

How are customer service and support?

My current organization is a Gold Partner with IBM. Whenever we reach out to the support team, the turnaround time is about 24 to 48 hours, which is pretty decent.

I rate the solution’s technical support an eight to nine out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The solution’s initial setup is easy.

What's my experience with pricing, setup cost, and licensing?

The solution's pricing is competitive with that of other vendors. The pricing also depends on the number of users.

What other advice do I have?

If people are with the existing stuff, I would definitely suggest they go with IBM Cloud Pak for Data. I usually recommend the solution for the financial sector, where I worked for about ten years. I worked with IBM for almost eight years. Unless they want to migrate to a new product completely, I recommend IBM Cloud Pak for Data to explore current business. It is easy to integrate the tool with other solutions.

Except for metadata queries, metadata validations, and metadata integrations, I don't see any issues with the solution. I would recommend the solution to other users if it supports their existing infrastructure.

Some people don't want to put their data in the cloud because they are concerned about how the data is secured with encryption and decryption. For such cases, we have listed out all the pros and cons of the solution to suggest them to users.

Overall, I rate the solution a seven out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Buyer's Guide
IBM Cloud Pak for Data
May 2026
Learn what your peers think about IBM Cloud Pak for Data. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,387 professionals have used our research since 2012.
Production In-Charge(Manager) at ING
Real User
Aug 25, 2023
A scalable data analytics and digital transformation tool that provides useful features and integrations
Pros and Cons
  • "DataStage allows me to connect to different data sources."
  • "The product must improve its performance."

What is our primary use case?

I am using the solution for spend analytics and contract analytics. I'm specifically using it to understand if there's any contract leakage. In spend analytics, I'm using it specifically for category management. I design category strategies using multiple dimensions from the spend analysis. We also use it for improvements in supplier relationship management, such as on-time payment.

What is most valuable?

The data sits in silos. DataStage allows me to connect to different data sources. It also allows me to do ETL or ELT. It has a good integration with data warehouses.

What needs improvement?

The product must improve its performance. We see typical cloud-related issues in the solution. IBM can still focus more on keeping the performance up and keeping it 100% available all the time.

For how long have I used the solution?

I have been using the solution for more than a year.

What do I think about the stability of the solution?

I rate the stability a seven out of ten.

What do I think about the scalability of the solution?

The tool is scalable. I rate the scalability an eight out of ten. More than 500 people are using the solution in our organization.

How are customer service and support?

The first line of support is strong enough. I do not have to put in additional effort.

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. It must improve its costs.

What other advice do I have?

I was not fully part of the core team or implementation. The solution is suitable for data analytics and digital transformation. People considering the solution must decide their use case before exploring the tool’s functionalities or components. Typically, people use an IT-heavy approach, adopting functionalities first and then looking for use cases. I would advise them to do the exact opposite.

Whatever we do should be in line with the business purpose and the business vision. We must be mindful of the business model and the organization's maturity to consume the insights. These technologies are targeted towards delivering the right decision-making aids. We should not make it an IT agenda.

Overall, I rate the product a nine out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer. partner/customer
PeerSpot user
Project Manager at Blue Technology
Real User
Jul 6, 2023
Offers powerful data preparation capabilities for creating and providing reliable data for accurate predictions
Pros and Cons
  • "Its data preparation capabilities are highly valuable."
  • "The product is trying to be more maturity in terms of connectors. That, I believe, is an area where Cloud Pak can improve."

What is our primary use case?

The last project I worked on was for the World Bank. I was tasked with designing the architecture for an e-commerce system aimed at rural people in Mexico, enabling them to sell products. I handled all the requirements and design aspects. The main focus of the project was the implementation of artificial intelligence and effective data management. I proposed IBM Cloud Pak for Data and an update to the cloud platform. 

Unfortunately, the project was not successful in terms of booking the final customer, as the required products were quite expensive. However, I designed the entire structure and architecture. That was one of my most recent projects. Another ongoing project with Cloud Pak is working on sales proposals for various customers in Mexico. The adoption of Cloud Pak for Data in Mexico is relatively slow compared to Data Station and Information Server, which is faster.

The main focus is on projecting sales and managing product renewals in the stores. That's more or less what I'm currently involved with regarding Cloud Pak for Data.

What is most valuable?

Its data preparation capabilities are highly valuable. You need to ensure that the data is properly created and provided in the most reliable format, especially for accurate predictions. 

For instance, IBM offers numerous artificial intelligence algorithms that require the data to be in the correct form and with precise information. We need to ingest all the data into the artificial intelligence models. I believe this is one of the primary objectives of IBM Cloud Pak for Data.

What needs improvement?

There are several specific connectors that we need to use, such as the one for SAP or XML. These connectors are not fully integrated into Cloud Pak for Data. However, they are very useful in database and information services for many of the projects I've worked on. 

Right now, the product is trying to be more maturity in terms of connectors. That, I believe, is an area where Cloud Pak can improve. Obviously, they are constantly working on refining the product. We are currently on version 4.5, and I have good relationships with some people at IBM.

They are actively striving to iterate and release new versions of the product. They are also focusing on improving Information Server and Rapid Stage 5.

In future releases, it would be beneficial to have more advanced data curation features. Specifically, I'm referring to data analysis and the quality dimensions associated with it. In my experience, this aspect is not as mature in Cloud Pak for Data compared to Information Server or database information analyzer, as they have been working more extensively on these areas.

They have been more focused on developing and enhancing those specific aspects. However, based on my research and discussions with my peers at IBM, I believe these features will be included in Cloud Pak in the near future.

For how long have I used the solution?

I have been using IBM Cloud Pak for Data for three years. The last version I worked with was 4.5.

What do I think about the stability of the solution?

I would rate the stability an eight out of ten. The solution have lot of issues right now.

What do I think about the scalability of the solution?

I would rate the scalability of this solution a nine out of ten. 

How are customer service and support?

The customer service and support are good. It is one of the main part of the solution.

How would you rate customer service and support?

Positive

How was the initial setup?

It depends because the IT infrastructure for Cloud Pak for Data is in the cloud. The implementation process is different from traditional system deployments. So, I would say it depends. 

I come from many years of working with Data Station and Information Server, which follow a classic implementation approach. However, implementing Cloud Pak is somewhat different. The vision or the concept behind it needs to be approached differently.

What's my experience with pricing, setup cost, and licensing?

It's quite expensive. Small companies cannot afford such a product. It's mainly targeted towards larger companies.

The pricing is very high. 

What other advice do I have?

You need to have sufficient funds and experienced personnel because it is a highly technical solution. It's not something that can be easily implemented without proper knowledge and expertise. 

For management, I would suggest taking smaller steps and gradually adapting the product within the company. Starting with smaller projects rather than diving into a major implementation.

The solution has a lot of potential, so I would rate it a nine out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Murali B - PeerSpot reviewer
Associate Manager at a consultancy with 10,001+ employees
Real User
Top 5Leaderboard
Nov 9, 2022
Stable, with a containerization feature, and a helpful community
Pros and Cons
  • "What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds. That's an advantage of IBM Cloud Pak for Data."
  • "What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds."
  • "One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output. Too many changes have been made, and my company has around one hundred thousand mappings, so my team has been putting more effort into alternative ways to do things. Another area for improvement in IBM Cloud Pak for Data is that it's more complicated to shift from on-premise to the cloud. Other vendors provide secure agents that easily connect with your existing setup. Still, with IBM Cloud Pak for Data, you have to perform connection migration steps, upgrade to the latest version, etc., which makes it more complicated, especially as my company has XML-based mappings. Still, the XML input and output capabilities of IBM Cloud Pak for Data have been discontinued, so I'd like IBM to bring that back."
  • "One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output."

What is most valuable?

What I found most helpful in IBM Cloud Pak for Data is containerization, which means it's easy to shift and leave in terms of moving to other clouds. That's an advantage of IBM Cloud Pak for Data.

What needs improvement?

One challenge I'm facing with IBM Cloud Pak for Data is native features have been decommissioned, such as XML input and output. Too many changes have been made, and my company has around one hundred thousand mappings, so my team has been putting more effort into alternative ways to do things.

Another area for improvement in IBM Cloud Pak for Data is that it's more complicated to shift from on-premise to the cloud. Other vendors provide secure agents that easily connect with your existing setup. Still, with IBM Cloud Pak for Data, you have to perform connection migration steps, upgrade to the latest version, etc., which makes it more complicated, especially as my company has XML-based mappings. Still, the XML input and output capabilities of IBM Cloud Pak for Data have been discontinued, so I'd like IBM to bring that back.

For how long have I used the solution?

I just started working on IBM Cloud Pak for Data, so my experience with it is hardly two weeks.

What do I think about the stability of the solution?

Currently, IBM Cloud Pak for Data is stable. My company is testing the enterprise version, which is stable.

What do I think about the scalability of the solution?

We haven't tried scaling IBM Cloud Pak for Data yet. We also haven't tested its configuration because we're still analyzing it. We have a vast configuration, and IBM Cloud Pak for Data is still in the POC phase.

We're running six nodes on a Hadoop cluster of around four hundred terabytes, and we're still trying to figure out IBM Cloud Pak for Data.

How are customer service and support?

I haven't contacted the technical support team for IBM Cloud Pak for Data.

How was the initial setup?

We're still learning IBM Cloud Pak for Data, and we're reaching out to the community where others have been helping us out, but in terms of initial setup, it's not as simple as other solutions, and it could be a bit tricky.

On a scale of one to five, my rating for the initial setup for IBM Cloud Pak for Data is a three.

What's my experience with pricing, setup cost, and licensing?

I don't have the exact licensing cost for IBM Cloud Pak for Data, as my company is still finalizing requirements, including monthly, yearly, and three-year licensing fees. Still, on a scale of one to five, I'd rate it a three because, compared to other vendors, it's more complicated.

What other advice do I have?

I'm an expert on IBM InfoSphere, but now, I'm working on IBM Cloud Pak for Data.

Approximately five thousand people use IBM Cloud Pak for Data within the company.

My advice to others looking into implementing IBM Cloud Pak for Data is that it's helpful if you want to do AI. IBM has a package, but it would also depend on the number of mappings you have for the transformations. Ninety percent can be easily migrated, and the remaining ten percent tends to be more complicated. Right now, my company has five hundred thousand mappings, so the remaining ten percent is a considerable number, fifty thousand. It depends on the number of mappings you have. If you have a minimal number of mappings, you can find an alternative solution versus IBM Cloud Pak for Data. If you have poor centralization in mappings or a lot of mappings, IBM Cloud Pak for Data is an excellent solution to try.

IBM Cloud Pak for Data is a good solution, so I'm rating it nine out of ten. It has more enhanced logging, cataloging, and other features, so as a solution, IBM Cloud Pak for Data is good.

My company is a customer of IBM Cloud Pak for Data.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer2140437 - PeerSpot reviewer
Lead Architect at a financial services firm with 10,001+ employees
Real User
Apr 25, 2023
A highly scalable solution due to the containerization benefits it provides to its users
Pros and Cons
  • "Scalability-wise, I rate the solution a nine or ten out of ten."
  • "The tool depends on the control plane, an OpenShift container platform utilized as an orchestration layer...So, we have communicated this issue to IBM and asked if it is feasible to adapt the solution to work on a Kubernetes platform that we support."

What is most valuable?

Since my team and I are in the POC phase, we are interested in the solution's ETL layer and the data stage component. Also, we are yet to evaluate certain external aspects of the solution.

What needs improvement?

The tool depends on the control plane, an OpenShift container platform utilized as an orchestration layer. However, for our organization, it is not a standard Kubernetes orchestration layer that we are currently using. So, we have communicated this issue to IBM and asked if it is feasible to adapt the solution to work on a Kubernetes platform that we support.

For how long have I used the solution?

I am currently employed at a banking organization utilizing a range of IT products. One of our current solutions is the Informatica ETL server, which is implemented on-premises. As for the IBM Cloud Pak for Data, our team is currently in the POC phase. The goal of the POC is to ensure that the solution complies with our organization's IT standards. Based on the POC's success criteria, we will evaluate the solution further to determine if it aligns with our organization's model. We will integrate it into our banking services if deemed a suitable fit. My company has a partnership with IBM. Also, we are users of the solution.

What do I think about the stability of the solution?

I won't be able to comment now on the solution's stability. Considering the tools that IBM supports, that's the other tools and other stuff. I think it should be a more scalable product. If it is able to meet the needs of different organizations, then it can be a wonderful product.

What do I think about the scalability of the solution?

Scalability-wise, I rate the solution a nine or ten out of ten. It is a scalable solution considering its containerization benefits.

How was the initial setup?

I rate the solution's initial setup a seven out of ten.

In terms of deployment timeline, our organization is currently in the strategic planning phase before actual deployment. It will take us a quarter to finalize our plans. However, we aim to complete the POC within the quarter and identify the success criteria. This will help us determine if the solution is aligned with our organization's strategic standards and can be implemented accordingly within the bank.

What other advice do I have?

My suggestion for those considering using IBM Cloud Pak for Data is to evaluate it based on their own organization's policies and standards. While the product may be technically sound, its fit within the organization's system is the key factor to consider. In our case, the solution has the potential to be beneficial due to its scalability and use of different NFA's, which align with our organization's IT success. Overall, I rate the product a nine out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer.
PeerSpot user
Drew Collins - PeerSpot reviewer
Freelance Innovation & Delivery at DMC
Real User
Mar 23, 2023
While the solution offers good capabilities and is cost-efficient, it needs to improve its user experience
Pros and Cons
  • "It is a scalable solution, and we have had no issues with its scalability in our company. I rate the solution's scalability a nine out of ten."
  • "The solution's user experience is an area that has room for improvement."

What is our primary use case?

Though I cannot provide information on the use cases in detail, I can say that the solution is used for data governance and data quality.

What is most valuable?

In the solution, data quality and ETL tools are the two features I found to be the most valuable ones.

What needs improvement?

The solution's user experience is an area that has room for improvement.

For how long have I used the solution?

I have been using IBM Cloud Pak for Data for the past five years. Also, I'm using the latest version of the solution.

What do I think about the stability of the solution?

Stability-wise, I would say that the solution is fine. I rate the solution's stability an eight out of ten.

What do I think about the scalability of the solution?

It is a scalable solution, and we have had no issues with its scalability in our company. I rate the solution's scalability a nine out of ten. Also, currently, we do not have any plans in our company to increase the number of uses using the solution. However, the product is being used extensively in our company.

How are customer service and support?

We have contacted the solution's technical support team, and our organization is happy with them since they were helpful. I rate the solution's technical support as an eight out of ten.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Previously, we were using a different solution. We then switched to IBM Cloud Pak for Data because of its capabilities and cost-efficiency.

How was the initial setup?

During the solution's initial setup, if you understand what you are doing, the process will be okay for you. However, the solution has got a quite complex implementation and configuration process. The deployment process took about three months. Also, the deployment model is cloud-based.

For the deployment process, our company sought the help of an integration partner.

We have around eight or nine people for the deployment and maintenance of the solution.

What about the implementation team?

The solution was implemented in-house with an integration partner's help.

What was our ROI?

With the solution, we could meet our ROI requirements in our company.

What's my experience with pricing, setup cost, and licensing?

For the licensing of the solution, there is a yearly payment that needs to be made. Also, since it is expensive, cost-wise, I rate the solution an eight or nine out of ten. There is no need to pay additional charges above the standard license and fees for the solution.

What other advice do I have?

I recommend IBM Cloud Pak for Data to others who want to use it. Additionally, I suggest that those who plan to use the solution should train their staff properly. The performance and integration of the solution are fine. Overall, I rate the solution probably a six or seven out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Jacek Wróż - PeerSpot reviewer
Senior Data Architect at AZframe
Real User
Dec 26, 2022
A powerful solution helpful in creating data catalogues and modelling data
Pros and Cons
  • "You can model the data there, connect the data models with the business processes and create data lineage processes."
  • "The solution could have more connectors."

What is our primary use case?

The general use cases are the creation of a data catalogue. The module is called Watson Knowledge Catalog, and the tool is responsible for data profiling, rough data, and quality improvement by business people. You can model the data there, connect the data models with the business processes and create data lineage processes.

What is most valuable?

The general use cases are the creation of a data catalogue. The module is called Watson Knowledge Catalog, and the tool is responsible for data profiling, rough data, and quality improvement by business people. You can model the data there, connect the data models with the business processes and create data lineage processes.

What needs improvement?

The solution could have more connectors. Sometimes the customers request additional things that are not implemented, like Data Catalog.

For how long have I used the solution?

We have been using this solution for about six years, and it was previously called IIS. We deploy on cloud, but there is an on-premises version. We haven't used it because it is a complicated system.

What do I think about the stability of the solution?

It is a stable solution 

What do I think about the scalability of the solution?

It is a scalable solution, and I rate the scalability a ten out of ten. You can add processors and scale for additional people and users. Three people in our organization use this solution, and we plan to increase usage.

How are customer service and support?

I rate the technical support a nine out of ten.

How would you rate customer service and support?

Positive

What was our ROI?

The return on investment depends on how you use it. If you improve the data quality, all the processes of artificial intelligence and machine learning are correct. If you have processes based on dirty data, data with errors or incomplete data, such processes are incorrect. So using such platforms in prediction and transactional systems also improves all the processes. You don't get financial returns per se, but the return on investment is huge if you improve your data.

What's my experience with pricing, setup cost, and licensing?

It's a subscription license, so you must buy a yearly one. In comparison to the competition, I think the price is fair. It is not low, but it's lower than Informatica and Precisely company platforms. So I think the price is relatively low for such functionality.

What other advice do I have?

I rate the solution a nine out of ten and recommend it to others.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free IBM Cloud Pak for Data Report and get advice and tips from experienced pros sharing their opinions.
Updated: May 2026
Buyer's Guide
Download our free IBM Cloud Pak for Data Report and get advice and tips from experienced pros sharing their opinions.