Try our new research platform with insights from 80,000+ expert users

AWS Glue vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
6.9
AWS Glue delivers cost-effective efficiency, reducing setup costs and time while providing favorable ROI for limited pipeline needs.
Sentiment score
8.1
StreamSets speeds up data processing, boosts efficiency and revenue, simplifies tasks, enhances security, and reduces costs significantly.
I advocate using Glue in such cases.
 

Customer Service

Sentiment score
6.8
AWS Glue users appreciate technical support for its reliability and helpfulness, but desire improvements in response consistency and fees.
Sentiment score
6.7
StreamSets support is responsive and knowledgeable, offering effective solutions, though response times and technical handling could improve.
AWS's documentation is reliable, and careful reference often resolves missed upgrade details.
IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating.
 

Scalability Issues

Sentiment score
8.0
AWS Glue offers seamless, scalable data handling with serverless architecture, though some see potential for resource management improvements.
Sentiment score
7.6
StreamSets is scalable and flexible, favored for cloud use but could improve auto-scaling for large data migrations.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
For jobs requiring multiple RAM usage, we increase the number of workers accordingly.
 

Stability Issues

Sentiment score
8.0
AWS Glue is highly reliable, seamlessly integrates with AWS, and benefits from serverless architecture, managed support, and easy troubleshooting.
Sentiment score
7.8
StreamSets is praised for stability and reliability, despite minor memory issues, with high user ratings and market competitiveness.
AWS Glue is highly stable, and I would rate its stability as nine.
 

Room For Improvement

AWS Glue users seek quicker start-up, better UI, Java support, multi-cloud compatibility, cost reduction, and improved documentation and support.
StreamSets struggles with integration, real-time processing, clarity in UI, memory issues, security, documentation, and cloud storage performance.
A more user-friendly and simpler process would help speed up the deployment process.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
Migrating jobs from version 3.0 to 4.0 can present compatibility issues.
It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades.
 

Setup Cost

AWS Glue offers flexible pay-as-you-go pricing, affordable yet costly for smaller firms, with extra costs for technical support.
StreamSets provides flexible pricing models, with varied user satisfaction, favoring larger enterprises over smaller companies due to cost.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
AWS charges based on runtime, which can be quite pricey.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
 

Valuable Features

AWS Glue features a user-friendly, scalable interface, ensuring seamless integration, effective ETL tasks, and cost-effective data transformation.
StreamSets offers intuitive interface, extensive connectors, and features accessible to non-technical users for seamless data integration and manipulation.
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
I think if I'm working with big data, common languages like Python work quite nicely, which is advantageous.
It allows a hybrid installation approach, rather than being completely cloud-based or on-premises.
 

Categories and Ranking

AWS Glue
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
49
Ranking in other categories
Cloud Data Integration (1st)
StreamSets
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
Data Integration (16th)
 

Featured Reviews

Saurabh Jaiswal - PeerSpot reviewer
Enables seamless integration and data preparation with robust transformation capabilities
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models. It offers transformation options on canvas or through ETL pipelines, notebooks, and code. Additionally, it supports data preparation, cleaning, and filtering seamlessly. AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
Karthik Rajamani - PeerSpot reviewer
Integrates with different enterprise systems and enables us to easily build data pipelines without knowing how to code
There are a few things that can be better. We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back. There are certain features that are only available at certain stages. For example, HTTP Client has some great features when it is used as a processor, but those features are not available in HTTP Client as a destination. There could be some improvements on the group side. Currently, if I want to know which users are a part of certain groups, it is not straightforward to see. You have to go to each and every user and check the groups he or she is a part of. They could improve it in that direction. Currently, we have to put in a manual effort. In case something goes wrong, we have to go to each and every user account to check whether he or she is a part of a certain group or not.
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
850,900 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
13%
Manufacturing Company
8%
Government
6%
Financial Services Firm
13%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
 

Comparisons

 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about AWS Glue vs. StreamSets and other solutions. Updated: April 2025.
850,900 professionals have used our research since 2012.