

StreamSets and IBM Cloud Pak for Data are prominent competitors in the data integration and management category. StreamSets has an edge in user-friendliness and flexibility, whereas IBM Cloud Pak excels in data governance and AI integration.
Features: StreamSets offers a user-friendly interface, supporting both batch and streaming processes, with tools like Data Collector and Control Hub that simplify data integration and are accessible to users without deep technical skills. IBM Cloud Pak for Data is distinguished by its data governance capabilities, integration with AI using tools like Watson Studio, and comprehensive data preparation features that are vital for regulatory compliance.
Room for Improvement: StreamSets could enhance integration beyond Java, improve logging, and bolster security features, along with user interface enhancements and SAP HANA connectivity. IBM Cloud Pak for Data would benefit from reduced infrastructure demands, smoother feature transitions, better performance, and improved cloud service integrations.
Ease of Deployment and Customer Service: StreamSets allows flexible deployment across various environments, though users prefer community support over costly direct services. IBM Cloud Pak for Data also offers flexible deployment but relies heavily on documentation and community support, with users often facing prolonged support response times.
Pricing and ROI: StreamSets provides an open-source option but can be costly for advanced features in small businesses, yet users report significant ROI due to reduced workload. IBM Cloud Pak for Data is pricey, especially for smaller firms, but justifies the cost with robust features benefiting larger enterprises.
We have been able to drive responsible, transparent, and explainable AI workflow to operationalize AI and mitigate risk and regulatory compliance easily.
It is easy to collect, organize, and analyze data no matter where it is, hence being able to make data-driven decisions.
I rate the technical support from IBM a nine out of ten because the support has been very top-notch, unparalleled, and also very professional.
Cloud Pak is a complicated system, and it's often difficult to find the right resource in IBM to help with specific issues.
The customer support for IBM Cloud Pak for Data is great and responsive.
IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating.
I have not noticed any downtime or lagging, especially when dealing with large data, so it is relatively very scalable.
IBM Cloud Pak for Data's scalability is very good; it can be used by any size of organization.
For scalability, I rate it a nine out of ten because it is a very scalable solution that has been able to handle my organization's growth efficiently.
The overall performance of IBM Cloud Pak for Data, particularly with IBM DataStage for ETL processes, is very good.
Setting up the hybrid and multi-cloud environments is a long job and it takes time.
IBM Cloud Pak for Data can be improved because processing speeds are sometimes slow.
To improve IBM Cloud Pak for Data, I suggest more out-of-the-box integration.
It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades.
The setup cost is very expensive.
Regarding my experience with pricing, setup cost, and licensing, for a small organization, the price might be relatively high, but for huge enterprises such as ours, the price is relatively affordable.
The list price is high, but the flexibility in pricing is adequate.
From there, I can work my way into a more granular level, applying all of that information on top of my actual data to understand what my data looks like, where it came from, and where it went wrong, managing it throughout the cycle.
The benefits of choosing IBM Cognos, in addition to saving on cost, include having institutional knowledge about maintaining this infrastructure and enough people who have developed on Cognos in the past, which creates comfort in its use.
We have been able to save approximately 80 percent of our time. We are not doing data analysis manually, so this relieves our data department of dealing with data.
It allows a hybrid installation approach, rather than being completely cloud-based or on-premises.
| Product | Mindshare (%) |
|---|---|
| IBM Cloud Pak for Data | 1.2% |
| StreamSets | 1.2% |
| Other | 97.6% |

| Company Size | Count |
|---|---|
| Small Business | 9 |
| Large Enterprise | 17 |
| Company Size | Count |
|---|---|
| Small Business | 9 |
| Midsize Enterprise | 2 |
| Large Enterprise | 11 |
IBM Cloud Pak for Data is a comprehensive platform integrating data management, AI, and machine learning capabilities tailored for hybrid environments. It's renowned for enhancing productivity through efficient data analytics and management.
This platform offers data virtualization, robust analytics, and AI-driven processes. Its integration capabilities, including IBM MQ and App Connect, facilitate seamless data connections. Users benefit from containerization, data governance, and compatibility with hybrid systems, improving decision-making and management productivity. However, the requirement of extensive infrastructure and performance challenges can impact scalability for small businesses.
What are the key features of IBM Cloud Pak for Data?In the financial and banking sectors, IBM Cloud Pak for Data is utilized for data management tasks like spend analytics and contract leakage analysis. It's used for data integration, machine learning, and AI-driven analytics to transform data into valuable insights in industries such as FinTech and consultancy.
StreamSets streamlines data pipeline creation, connecting data from multiple sources to destinations like cloud platforms with minimal coding. Its centralized platform and intuitive design enhance ETL and data migration processes.
StreamSets integrates seamlessly with analytics platforms, offering tools such as Data Collector and Control Hub to facilitate data ingestion, transformation, and machine learning integrations. Its user-friendly interface and ready connectors aid in configuring complex data pipelines. With built-in data drift resilience and scheduling options, users experience efficient, scalable data management, despite challenges like latency in cloud storage and interface enhancement needs. Users often employ StreamSets for batch loading, real-time data processing, and smart data pipeline management, offering comprehensive data integration solutions.
What are the key features of StreamSets?In industries like finance and technology, StreamSets supports data migration, machine learning integrations, and analytics by simplifying data transformation and enhancing decision-making capabilities through its robust pipeline management.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.