No more typing reviews! Try our Samantha, our new voice AI agent.

StreamSets vs Tray.io comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

StreamSets
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
Data Integration (25th)
Tray.io
Average Rating
7.4
Reviews Sentiment
5.9
Number of Reviews
3
Ranking in other categories
Process Automation (55th), Cloud Data Integration (30th), Low-Code Development Platforms (74th), Integration Platform as a Service (iPaaS) (24th)
 

Featured Reviews

SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.
SS
Principal AI and Data Science Engineer at a manufacturing company with 10,001+ employees
Marketing A/B tests have gained deeper insights from user behavior and unified global reporting
The best features Tray.io offers include excellent visualization capabilities and a dashboard, which stand out to me the most. I appreciate that it is very easy to convert the data we receive from Tray.io into dashboards from Power BI, which is extremely useful. I would also appreciate if in the future Tray.io provides a way to natively convert the data to Tableau. Tray.io has positively impacted my organization as it provides a trusted way to organize data results and share them throughout the company at once. As a multinational and very large company, it is definitely beneficial that those of us in the UK can use the same format that colleagues use in India, and the entire data architecture is framed within a trusted system from an established organization. As far as I know, Tray.io has been operating for the last 12 years, making it a very reliable system.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"StreamSets is the leader in the market."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"This product was a lot easier to use than the one we had before it, and it took us half an hour and we were set up and running it the first time."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"Tray.io has positively impacted my organization by reducing the amount of redundant tasks that our team performs by approximately 80%, and the numbers are quite significant with the workflows alone, as we are working towards creating and utilizing AI within these workflows as well."
"Tray.io has positively impacted my organization by helping us keep our internal database and this third-party service in sync, and it has really helped us automate a lot of that work because it is fairly straightforward to maintain and develop."
"Tray.io has positively impacted my organization as it provides a trusted way to organize data results and share them throughout the company at once."
 

Cons

"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"One way Tray.io could be improved, especially for people coming in with no real coding experience, is with more comprehensive error messages."
"I have found that the error management in my main use case with Tray.io is not as effective as we would prefer."
"As our product got more complex, we needed to add more and more complexity to Tray.io in terms of our setup, and that is when the benefits of it being no-code or low-code started to pale in comparison to the cost of making everything slightly more complicated."
 

Pricing and Cost Advice

"It's not so favorable for small companies."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"We are running the community version right now, which can be used free of charge."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"StreamSets is an expensive solution."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"It has a CPU core-based licensing, which works for us and is quite good."
Information not available
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
885,311 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
9%
Manufacturing Company
8%
Insurance Company
8%
Real Estate/Law Firm
6%
Computer Software Company
13%
Construction Company
12%
Outsourcing Company
10%
Wholesaler/Distributor
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
No data available
 

Questions from the Community

What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
What needs improvement with Tray.io?
It was not always easy to test our changes in Tray.io. In a software engineering context, you might imagine there being different branches and different environments to make changes where you can m...
What is your primary use case for Tray.io?
We run automation workflows with Tray.io to take data from our internal databases and update a third-party software that we use. Specifically, my company offers digital trade credit, and we use tha...
What advice do you have for others considering Tray.io?
I think in general it has been a positive experience with Tray.io, and while there are areas where it can be improved, overall it is a good product. I rate Tray.io a seven out of ten.
 

Comparisons

 

Overview

 

Sample Customers

Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Copper, DigitalOcean, Udemy, AdRoll, FICO, Outreach
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: March 2026.
885,311 professionals have used our research since 2012.