No more typing reviews! Try our Samantha, our new voice AI agent.

IBM Cloud Pak for Data vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.1
IBM Cloud Pak for Data benefits larger companies by enhancing AI, saving time, reducing costs, and improving data-driven decisions.
Sentiment score
8.1
StreamSets speeds up data processing, boosts efficiency and revenue, simplifies tasks, enhances security, and reduces costs significantly.
We have been able to drive responsible, transparent, and explainable AI workflow to operationalize AI and mitigate risk and regulatory compliance easily.
Senior Data Analyst at Wipro Limited
It is easy to collect, organize, and analyze data no matter where it is, hence being able to make data-driven decisions.
Engineer at Turner Construction
 

Customer Service

Sentiment score
7.3
IBM Cloud Pak for Data support is effective, rated positively by users, though some desire faster initial responses and localization.
Sentiment score
6.7
StreamSets support is responsive and knowledgeable, offering effective solutions, though response times and technical handling could improve.
I rate the technical support from IBM a nine out of ten because the support has been very top-notch, unparalleled, and also very professional.
Manager at Teshama Group
Cloud Pak is a complicated system, and it's often difficult to find the right resource in IBM to help with specific issues.
Data asset management engineer at a tech services company with 1-10 employees
The customer support for IBM Cloud Pak for Data is great and responsive.
Engineer at Turner Construction
IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Scalability Issues

Sentiment score
6.7
IBM Cloud Pak for Data impresses users with its scalable infrastructure, efficiently handling large data and maintaining performance without downtime.
Sentiment score
7.6
StreamSets is scalable and flexible, favored for cloud use but could improve auto-scaling for large data migrations.
I have not noticed any downtime or lagging, especially when dealing with large data, so it is relatively very scalable.
Senior Data Analyst at Wipro Limited
IBM Cloud Pak for Data's scalability is very good; it can be used by any size of organization.
Engineer at Turner Construction
For scalability, I rate it a nine out of ten because it is a very scalable solution that has been able to handle my organization's growth efficiently.
Manager at Teshama Group
 

Stability Issues

Sentiment score
7.8
IBM Cloud Pak for Data is stable with good performance, though opinions vary on consistency due to its complexity.
Sentiment score
7.8
StreamSets is praised for stability and reliability, despite minor memory issues, with high user ratings and market competitiveness.
The overall performance of IBM Cloud Pak for Data, particularly with IBM DataStage for ETL processes, is very good.
Sales Director at Jordan Business Systems
 

Room For Improvement

IBM Cloud Pak for Data struggles with infrastructure demands, deployment size, integration, and requires technical support and simplified experiences.
StreamSets struggles with integration, real-time processing, clarity in UI, memory issues, security, documentation, and cloud storage performance.
Setting up the hybrid and multi-cloud environments is a long job and it takes time.
Senior Data Analyst at Wipro Limited
IBM Cloud Pak for Data can be improved because processing speeds are sometimes slow.
Engineer at Turner Construction
To improve IBM Cloud Pak for Data, I suggest more out-of-the-box integration.
Senior Project Manager at EY
It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Setup Cost

IBM Cloud Pak for Data is costly for small businesses but affordable and competitive for large enterprises with negotiable pricing.
StreamSets provides flexible pricing models, with varied user satisfaction, favoring larger enterprises over smaller companies due to cost.
The setup cost is very expensive.
Data asset management engineer at a tech services company with 1-10 employees
Regarding my experience with pricing, setup cost, and licensing, for a small organization, the price might be relatively high, but for huge enterprises such as ours, the price is relatively affordable.
Senior Data Analyst at Wipro Limited
The list price is high, but the flexibility in pricing is adequate.
Solution Manager at Intalion
 

Valuable Features

IBM Cloud Pak for Data offers robust governance, integration, AI lifecycle, and cloud flexibility with praised data management features.
StreamSets offers intuitive interface, extensive connectors, and features accessible to non-technical users for seamless data integration and manipulation.
From there, I can work my way into a more granular level, applying all of that information on top of my actual data to understand what my data looks like, where it came from, and where it went wrong, managing it throughout the cycle.
Data asset management engineer at a tech services company with 1-10 employees
The benefits of choosing IBM Cognos, in addition to saving on cost, include having institutional knowledge about maintaining this infrastructure and enough people who have developed on Cognos in the past, which creates comfort in its use.
EDW Manager at a university with 1,001-5,000 employees
We have been able to save approximately 80 percent of our time. We are not doing data analysis manually, so this relieves our data department of dealing with data.
Senior Data Analyst at Wipro Limited
It allows a hybrid installation approach, rather than being completely cloud-based or on-premises.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Categories and Ranking

IBM Cloud Pak for Data
Ranking in Data Integration
19th
Average Rating
8.2
Reviews Sentiment
6.1
Number of Reviews
20
Ranking in other categories
Data Virtualization (3rd)
StreamSets
Ranking in Data Integration
23rd
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of May 2026, in the Data Integration category, the mindshare of IBM Cloud Pak for Data is 1.2%, down from 1.8% compared to the previous year. The mindshare of StreamSets is 1.2%, down from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
IBM Cloud Pak for Data1.2%
StreamSets1.2%
Other97.6%
Data Integration
 

Featured Reviews

ArchanaSingh - PeerSpot reviewer
Senior Data Analyst at Wipro Limited
Collaborative data platform has transformed analytics and now drives faster decisions
The best features IBM Cloud Pak for Data offers include robust data visualization, centralized data analytics, data reliability, and compatibility with hybrid and multi-cloud environments. The compatibility with hybrid and multi-cloud environments has helped our organization as data visualization is very simple. Migrations, reading, analysis, and data management from other sources are performed without problems of requirements. We have a team of experts in IBM Cloud Pak for Data to maintain security and correct data management easily. I find this cloud excellent for visualizing and managing data across networks and also fulfilling fastest data storage, making it less complex and completely improving productivity in my organization. Everything is managed in multiple environments without any problem. IBM Cloud Pak for Data has positively impacted my organization, and I have noticed some improvement since we started using this tool. Configuration with hybrid and multi-cloud environments has been very seamless and easy. It is a robust platform capable of working with multiple data sources where we gain insights to make data-driven decisions easily. It automates data analysis for quick and better performance. We have seen improvements in analysis and data correction from multiple sources. Our productivity in the company is growing, thanks to the data analysis team. We have also seen a robust hybrid and multi-cloud access system working seamlessly. I can share specific outcomes that show how productivity has grown and how performance has improved since the data is automated, and the analysis is done much faster, saving us a lot of time. We have been able to save approximately 80 percent of our time. We are not doing data analysis manually, so this relieves our data department of dealing with data. We have been relieved of a lot of duties, and now we are able to focus on other strategic tasks. Our productivity has greatly increased since we are able to make concrete and data-driven decisions easily.
SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
894,738 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
20%
Manufacturing Company
10%
Computer Software Company
7%
University
5%
Financial Services Firm
11%
Insurance Company
8%
Manufacturing Company
7%
Real Estate/Law Firm
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business9
Large Enterprise17
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
 

Questions from the Community

What is your experience regarding pricing and costs for IBM Cloud Pak for Data?
Regarding the price, I know IBM is traditionally relatively expensive in the Hungarian market, but we work together with the local IBM sales team, and on a project basis they manage to negotiate th...
What needs improvement with IBM Cloud Pak for Data?
I see room for improvement in IBM Cloud Pak for Data, as it lacked the lake house. However, IBM issued the new product which is Watsonx.data. This is a new product for IBM and it provides all the m...
What is your primary use case for IBM Cloud Pak for Data?
I believe IBM Cloud Pak for Data is suitable for mid-size to bigger companies. It is not tailored for smaller customers. My customers use IBM DataStage for ETL processes. One client has implemented...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
What advice do you have for others considering StreamSets?
If asked, I definitely recommend StreamSets to other users. My overall rating for the solution is nine.
 

Also Known As

Cloud Pak for Data
No data available
 

Overview

 

Sample Customers

Qatar Development Bank, GuideWell, Skanderborg Music Festival
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about IBM Cloud Pak for Data vs. StreamSets and other solutions. Updated: April 2026.
894,738 professionals have used our research since 2012.