StreamSets Reviews

Vendor: IBM

4.2 out of 5

21 reviews
100% willing to recommend

What is StreamSets?

StreamSets is a data integration platform that enables organizations to efficiently move and process data across various systems. It offers a user-friendly interface for designing, deploying, and managing data pipelines, allowing users to easily connect to various data sources and destinations. StreamSets also provides real-time monitoring and alerting capabilities, ensuring that data is flowing smoothly and any issues are quickly addressed.

Get the StreamSets Buyer's Guide and find out what your peers are saying about StreamSets, Informatica Intelligent Data Management Cloud (IDMC), Azure Data Factory and more!

StreamSets is the #22 ranked solution in top Data Integration Tools. PeerSpot users give StreamSets an average rating of 8.4 out of 10. StreamSets is most commonly compared to Informatica Intelligent Data Management Cloud (IDMC): StreamSets vs Informatica Intelligent Data Management Cloud (IDMC). StreamSets is popular among the large enterprise segment, accounting for 69% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 10% of all views.

Helped 865,649 peers since 2012

Featured StreamSets reviews

Ved Prakash Yadav

Senior Data Platform Manager at a manufacturing company with 10,001+ employees

We use various tools and alerting systems to notify us of pipeline errors or failures. StreamSets supports data governance and compliance by allowing us to encrypt incoming data based on specified rules. We can easily encrypt columns by providing the column name and hash key. If you're considering using StreamSets for the first time, I would advise first understanding why you want to use it and how it will benefit you. If you're dealing with change tracking or handling large amounts of data, it could be cost-effective compared to services like Amazon. It's easy to schedule and manage tasks with the tool, and you can enhance your skills as an ETL developer. You can easily migrate traditional pipelines built on platforms like Informatica or Talend to StreamSets. I rate the overall solution an eight out of ten.

Read full review

Nantabo Jackie

Sales Manager at Soft Hostings Limited

The design experience when implementing batch streaming or ECL pipelines is very easy and straightforward. When we initially attempted to integrate StreamSets with Kafka, it was somewhat challenging until we consulted the documentation, after which it became straightforward. We use StreamSets to move data into modern analytics platforms. Moving the data into modern analytics platforms is still complex. It requires a lot of understanding of logic. StreamSets enables us to build data pipelines without knowing how to code. StreamSets' ability to build data pipelines without requiring us to know complex programming is very important, as it allows us to focus on our projects without spending time writing code. StreamSets' Transformer for Snowflake is simple to use for designing both simple and complex transformation logic. StreamSets' Transformer for Snowflake is extremely important to me as it helps me to connect external data sources and keep my internal workflow organized. Transformer for Snowflake's functionality is a perfect ten out of ten. It is important and cost-effective that Transformer for Snowflake is a serverless engine embedded within the platform, as without this feature, it would be very expensive. This feature helps us to sell at lower budget costs, which would otherwise be at a high cost with other servers. StreamSets has helped improve our organization. StreamSets simplified pipelines for our organization. It is easier to complete a project when we know where and how to start, and working with the team remotely makes it more efficient. This helps us to save time and be more organized when creating data pipelines. Being a structured company that produces reliable resources for our application benefits both our clients and contacts. StreamSets' built-in data drift resilience plays a part in our ETL operations. With prior knowledge, the built-in data drift resilience is very effective, but it can be challenging to implement without the preexisting knowledge. The built-in data drift resilience reduced the time it takes us to fix data drift breakages by 45 percent. StreamSets helped us break down data silos within our organization. The use of StreamSets to break down data silos enabled us to be confident in the services and products we provide, as well as the real-time streaming we offer. This has had a positive impact on our business, as it allowed us to accurately determine the analytics we need to present to stakeholders, clients, and our sources while ensuring that the process is secure and transparent. StreamSets saved us time because anyone can use StreamSets not just developers. We can save around 40 percent of our time. StreamSets' reusable assets helped us reduce workload by around 25 percent. StreamSets saved us money by not having to hire developers with specialized skills. We saved around $2,000 US. StreamSets helped us scale our data operations. Since StreamSets makes it easy to scale our data operations, it enabled us to know exactly where to start at any time. We are aware of the timeline for completing the project, and depending on our familiarity with the software, we can come up with a solution quickly.

Read full review

Reyansh Kumar

Technical Specialist at Accenture

The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Read full review

StreamSets mindshare

As of August 2025, the mindshare of StreamSets in the Data Integration category stands at 1.6%, up from 1.5% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Data Integration

PeerResearch reports based on StreamSets reviews

Type	Title	Date
Category	Data Integration	Aug 24, 2025	Download
Product	Reviews, tips, and advice from real users	Aug 24, 2025	Download
Comparison	StreamSets vs Azure Data Factory	Aug 24, 2025	Download
Comparison	StreamSets vs SSIS	Aug 24, 2025	Download
Comparison	StreamSets vs Informatica PowerCenter	Aug 24, 2025	Download

Title	Rating	Mindshare	Recommending
Informatica Intelligent Data Management Cloud (IDMC)	4.0	4.3%	93%	185 interviews Add to research
Azure Data Factory	4.0	7.4%	92%	91 interviews Add to research

Valuable Features

StreamSets offers easy configuration and a user-friendly graphical interface. It supports multiple connectors and stages for data transformation, including data masking and schema generation. Integration with platforms like Hadoop, Snowflake, Azure, and Kafka is seamless. The tool excels in ETL capabilities and data drift resilience, providing automatic alerts. Its Control Hub and Data Collector features facilitate centralized management, minimal coding, and rapid deployment, greatly enhancing efficiency and reducing the need for technical expertise.

"StreamSets is the leader in the market."
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."

Room for Improvement

StreamSets users want enhanced integration beyond Java, improved real-time capabilities, and streamlined logging and visualization. The GUI needs updates, and security features could be expanded to meet stricter organizational requirements. SAP HANA support is limited, and pipeline organization could be more intuitive. Documentation requires improvements, and user expertise is crucial due to a steep learning curve. Memory issues and data drift resilience need addressing, and latency problems are noted in cloud storage operations. Users seek a more user-friendly interface overall.

"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."

ROI

StreamSets significantly reduces data processing time, enhances efficiency by 20-70%, and boosts revenue by up to 50%. It streamlines complex data operations, saving money otherwise spent on custom platforms and extensive training. Resource needs decrease by 30%, eliminating manual error and enabling faster data-driven decisions. StreamSets offers a secure, trustworthy platform that augments organizational productivity, resulting in substantial cost savings and a positive financial impact.

Pricing

StreamSets offers both free and paid versions, including open-source and commercial licenses. Enterprises find the CPU core-based licensing effective, though opinions on pricing vary. Some users appreciate its moderate cost, while others find it expensive, especially for smaller enterprises. The solution's flexibility is suitable for larger organizations, but there's criticism over fixed pricing and additional costs for updates. Overall, StreamSets is valued for its comprehensive offerings, but pricing perceptions differ across businesses.

"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"The pricing is affordable for any business."

Popular Use Cases

StreamSets is utilized for extracting data from Kafka topics, transporting Oracle datasets to Azure, building data pipelines for cloud migrations, and integrating datasets from multiple sources. Companies design data pipelines for analytics, engaging in real-time streaming to Azure Event Hubs. It aids in data transformation, ingests data for DataOps, and supports batch loading. IT departments configure StreamSets for stable, secure solutions, while leading initiatives in healthcare data processing and migration to the cloud.

Service and Support

Many find StreamSets' customer service knowledgeable and responsive, with support often conducted via email or dedicated portals. Users appreciate the team's prompt assistance, although some express a need for quicker resolutions. Support is particularly valued for its technical handling and detailed analysis, though it might take time to engage the right specialist. StreamSets provides patches and upgrades to address specific issues, enhancing user experiences, despite some challenges in immediate engagements.

Deployment

StreamSets initial setup is generally straightforward but varies in complexity depending on the organization's setup. Some found it simple, especially when guided by documentation or support. Others experienced challenges due to technical skills or multiple data sources, requiring more time and assistance. Deployments are often done in phases with various environments, and maintenance needs are minimal, typically managed by small teams. With cloud-based deployment, ongoing management is handled efficiently.

Scalability

Many users agree that StreamSets is highly scalable, with support for increased load handling and multiple platforms. While scalability is affected by installation environment, users report successful use across various teams and locations, including cloud-based setups. Some suggest room for improvement in auto-scaling for data migration projects. StreamSets is being adopted by numerous companies, fueling interest and curiosity. It effectively handles millions of records, making it suitable for small to medium enterprises.

Stability

StreamSets demonstrates solid stability, functioning without major interruptions for months. Users have noted occasional memory issues and latency with cloud-based deployment, but patches and upgrades often address these. Confidence in its operations is high, even during intense usage. Some users would appreciate quicker issue resolution and high-availability implementations. Despite previous bugs, recent updates have enhanced reliability, earning ratings around eight to ten. StreamSets ranks among the top in its category for stability.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our StreamSets Buyer's Guide for additional reliable information.