Try our new research platform with insights from 80,000+ expert users

SAS Data Integration Server vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

SAS Data Integration Server
Ranking in Data Integration
38th
Average Rating
7.2
Reviews Sentiment
6.5
Number of Reviews
4
Ranking in other categories
No ranking in other categories
StreamSets
Ranking in Data Integration
22nd
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2026, in the Data Integration category, the mindshare of SAS Data Integration Server is 0.7%, up from 0.5% compared to the previous year. The mindshare of StreamSets is 1.2%, down from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration Market Share Distribution
ProductMarket Share (%)
StreamSets1.2%
SAS Data Integration Server0.7%
Other98.1%
Data Integration
 

Featured Reviews

NN
Works at a financial services firm with 5,001-10,000 employees
Offloads processes on the server side but needs better installation syntax
One area for improvement is the installation process. Another point could be the syntax, as it sometimes involves using syntax names that are not intuitive. For example, to calculate the difference between two dates, the general syntax in SAS is called the data difference or data net function. However, another name is used, such as NF and INK. Without knowledge of SAS programming, it becomes unclear what these functions mean. It is not good to define function names this way.
SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of the solution is its amazing capabilities in regard to data handling."
"A key feature allows us to enhance job performance by offloading processing to the server side, rather than processing on the server itself."
"A key feature allows us to enhance job performance by offloading processing to the server side, rather than processing on the server itself."
"A key feature allows us to enhance job performance by offloading processing to the server side, rather than processing on the server itself."
"The solution is very stable."
"The solution offers very good data manipulation and loading."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
"It is really easy to set up and the interface is easy to use."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
 

Cons

"The transform tool has limited access. They should make it more flexible."
"The initial setup of SAS Data Integration Server was complex."
"One area for improvement is the installation process."
"The initial setup had issues, and even after using it for about one year, it was still not fixed."
"So I would like to see improved integration with other software."
"The initial setup had issues, and even after using it for about one year, it was still not fixed."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
 

Pricing and Cost Advice

"It is an expensive program."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"StreamSets is an expensive solution."
"The pricing is affordable for any business."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"It's not so favorable for small companies."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"I believe the pricing is not equitable."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
881,114 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
26%
Computer Software Company
8%
Insurance Company
7%
Healthcare Company
7%
Insurance Company
8%
Financial Services Firm
8%
Manufacturing Company
8%
Computer Software Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
 

Questions from the Community

What needs improvement with SAS Data Integration Server?
One area for improvement is the installation process. Another point could be the syntax, as it sometimes involves using syntax names that are not intuitive. For example, to calculate the difference...
What is your primary use case for SAS Data Integration Server?
I am involved in the ETR job. My role is focused on executing the ETR job.
What advice do you have for others considering SAS Data Integration Server?
I use it without further details. For example, if I use SAS to connect to a NetEazt database or purchase a shared asset to ODBC, I can connect to any database with ODBC connection support. The over...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
 

Also Known As

SAS Enterprise Data Integration Server, Enterprise Data Integration Server
No data available
 

Overview

 

Sample Customers

Credit Guarantee Corporation, Cr_dito y Cauci‹n, Delaware State Police, Deutsche Lufthansa, Directorate of Economics and Statistics, DSM, Livzon Pharmaceutical Group, Los Angeles County, Miami Herald Media Company, Netherlands Enterprise Agency, New Zealand Ministry of Health, Nippon Paper, West Midlands Police, XS Inc., Zenith Insurance
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about SAS Data Integration Server vs. StreamSets and other solutions. Updated: January 2026.
881,114 professionals have used our research since 2012.