No more typing reviews! Try our Samantha, our new voice AI agent.

StreamSets vs Upsolver comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

StreamSets
Ranking in Data Integration
25th
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
Upsolver
Ranking in Data Integration
39th
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
3
Ranking in other categories
Streaming Analytics (21st)
 

Mindshare comparison

As of March 2026, in the Data Integration category, the mindshare of StreamSets is 1.2%, down from 1.6% compared to the previous year. The mindshare of Upsolver is 0.7%, up from 0.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Mindshare Distribution
ProductMindshare (%)
StreamSets1.2%
Upsolver0.7%
Other98.1%
Data Integration
 

Featured Reviews

SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.
reviewer2784462 - PeerSpot reviewer
Software Engineer at a tech vendor with 10,001+ employees
Streaming pipelines have become simpler and onboarding new data sources is now much faster
One of the best features Upsolver offers is the automatic schema evolution. Another good feature is SQL-based streaming transformations. Complex streaming transformations such as cleansing, deduplication, and enrichment were implemented using SQL and drastically reduced the need for custom Spark code. My experience with the SQL-based streaming transformations in Upsolver is that it had a significant positive impact on the overall data engineering workflow. By replacing custom Spark streaming jobs with declarative SQL logic, I simplified development, review, and deployment processes. Data transformations such as parsing, filtering, enrichment, and deduplication could be implemented and modified quickly without rebuilding or redeploying complex code-based pipelines. Upsolver has impacted my organization positively because it brings many benefits. The first one is faster onboarding of new data sources. Another one is more reliable streaming pipelines. Another one is near-real-time data availability, which is very important for us. It also reduced operational effort for data engineering teams. A specific outcome that highlights these benefits is that the time to onboard new sources is reduced from weeks to days. Custom Spark code reduction reached 50 to 40 percent. Pipeline failures are reduced by 70 to 80 percent. Data latency is improved from hours to minutes.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"This product was a lot easier to use than the one we had before it, and it took us half an hour and we were set up and running it the first time."
"StreamSets has definitely helped us in getting the information into our data lake very quickly, in terms of ingestion, and the most important thing is it has helped us from a resourcing point of view because you can easily upskill a BI or ETL resource without any programming knowledge to work with this, which has drastically reduced the time that we are spending on workloads by 60% to 70% as well as reducing the time spent on ingestion by 30%."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"StreamSets is the leader in the market."
"It is really easy to set up and the interface is easy to use."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"It was easy to use and set up, with a nearly no-code interface that relied mostly on drag-and-drop functionality."
"The most prominent feature of Upsolver is its function as an ETL tool, allowing data to be moved across platforms and different data technologies."
"A specific outcome that highlights these benefits is that the time to onboard new sources is reduced from weeks to days, custom Spark code reduction reached 50 to 40 percent, pipeline failures are reduced by 70 to 80 percent, and data latency is improved from hours to minutes."
"Customer service is excellent, and I would rate it between eight point five to nine out of ten."
 

Cons

"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA."
"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"If the data processing in StreamSets takes a long time as compared to the previous solution, then we will reconsider why we use StreamSets."
"Upsolver excels in ETL and data aggregation, while ThoughtSpot is strong in natural language processing for querying datasets. Combining these tools can be very effective: Upsolver handles aggregation and ETL, and ThoughtSpot allows for natural language queries. There’s potential for highlighting these integrations in the future."
"I think that Upsolver can be improved in orchestration because it is not a full orchestration tool."
"There is room for improvement in query tuning."
"On the stability side, I would rate it seven out of ten. Using multiple cloud providers and data engineering technologies creates complexity, and managing different plugins is not always easy, but they are working on it."
 

Pricing and Cost Advice

"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"We are running the community version right now, which can be used free of charge."
"StreamSets is an expensive solution."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"I believe the pricing is not equitable."
"Upsolver is affordable at approximately $225 per terabyte per year."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
885,311 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
9%
Manufacturing Company
8%
Insurance Company
8%
Real Estate/Law Firm
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
No data available
 

Questions from the Community

What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
What is your experience regarding pricing and costs for Upsolver?
My experience with pricing, setup cost, and licensing was a very good experience, but it is not a direct experience because it was not my responsibility. It was in charge of the customer. However, ...
What needs improvement with Upsolver?
I think that Upsolver can be improved in orchestration because it is not a full orchestration tool. I believe it could be better in this regard. The cost needs attention at a very large scale. I th...
What is your primary use case for Upsolver?
My main use case for Upsolver is during an IT consulting project for a large enterprise running a cloud-native data platform on AWS. I used Upsolver to ingest and process high-volume stream data fr...
 

Comparisons

 

Overview

 

Sample Customers

Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Information Not Available
Find out what your peers are saying about StreamSets vs. Upsolver and other solutions. Updated: March 2026.
885,311 professionals have used our research since 2012.