Fivetran vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Fivetran
Ranking in Data Integration
13th
Average Rating
8.0
Number of Reviews
21
Ranking in other categories
Data Replication (3rd), Cloud Data Integration (6th)
StreamSets
Ranking in Data Integration
8th
Average Rating
8.4
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of July 2024, in the Data Integration category, the mindshare of Fivetran is 2.7%, up from 1.7% compared to the previous year. The mindshare of StreamSets is 1.9%, up from 1.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
Unique Categories:
Data Replication
1.1%
Cloud Data Integration
5.6%
No other categories found
 

Featured Reviews

KK
Nov 29, 2022
Easy integration for onboarding new sources of data, but the logging is complicated
One of the traditional issues with the platform has been logging. The logging, while they have it, is not particularly verbose, so when there are issues it becomes hard to do. They also have internal logs versus customer-facing logs. We've asked FiveTran to provide more exposure on that or to be able to subscribe to it via an API or Datadog or something like that to pull from their system. Another thing is mainly their breadth of being able to pull from different systems. They have some of this already, but they're pushing to do some integrations with Excel online. Some of the pain points we're looking at are trying to integrate some of the items in the Microsoft stack, so SharePoint and Excel, and then some of the newer Azure services.
MI
Mar 17, 2023
It's lightweight and well-integrated, and it saves a lot of money and time
There are so many things that need to be improved. For the StreamSets cloud user interface, there aren't enough use cases and examples for the main problems. In addition, the hybrid data sets cannot be joined in a data connector, which is a significant limitation. There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline. It isn't helpful when you need to apply the same logic for multiple sources. It becomes difficult because you need to create more pipelines and then add coordination between them. Initially, it's hard to find out or master the logic behind it. It can be hard if you aren't technical enough. There is scope for improvement because it's not straightforward. You need to go through the documentation and make sure that you understand every step. For me, it was a challenging model.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most important feature of the solution is its ability to build data pipelines in less time."
"Fivetran offers native connectivity with Sybase"
"Fivetran can perform data migration incredibly fast, depending on the source and target."
"The product has some seamless connectors, which are readily available."
"You can manage all of your connectors individually, which gives you a very good ability to trace which one of your ETL processes is running and when."
"Its arrays are powerful enough to handle migrations even when the replication is happening in the background, without causing any trouble with the ongoing traffic."
"Making the decision to implement Fivetran was supported by the fact that they have better connectors than other competitors."
"The solution is stable. We've never faced any stability issues."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"The scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
 

Cons

"Fivetran is very expensive for data sources with a lot of rows, such as email data. I would like to see cheaper pricing for data sources like that."
"I would like Fivetran to implement additional resource monitoring and restriction policies."
"We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool."
"Some of the pain points we're looking at are trying to integrate some of the items in the Microsoft stack, so SharePoint and Excel, and then some of the newer Azure services."
"The documentation can be laid out better to make it easier to find things, and I really wish there was built-in support for changing passwords. Some features don't work as advertised for the platform/repository database, and HVR is not always the fastest at getting results."
"The interface needs to be more user-friendly."
"The connections with SAP must be improved."
"The environment must be more development-friendly."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
 

Pricing and Cost Advice

"In the first year, we were given a very good discount. It was approximately 20,000 Euros per year. In the third year, we purchased credit for two years and the price was 33,000 Euros per year."
"When you have a lot of workflows and complex use cases, pricing goes down as you use it more."
"The solution is affordable."
"I've heard that the license for HVR is a bit costly compared to its competitors, but since it's reliable and efficient, I think the customer shouldn't be bothered about the cost."
"I don't have the exact information, but I know it is high, and it is on a yearly basis. There is no additional cost for what we're doing. We're always open to doing things cheaper, so we might potentially implement a different solution."
"I rate the pricing a six out of ten."
"The pricing model is okay and mid to large companies will not have an issue with it."
"Fivetran is very expensive, and its database-driven pricing model is outdated."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"The pricing is affordable for any business."
"StreamSets is an expensive solution."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"It's not expensive because you pay per month, and the tasks you can perform with it are huge. It's reliable and cost-effective."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"We are running the community version right now, which can be used free of charge."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
793,295 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
25%
Computer Software Company
12%
Financial Services Firm
11%
Manufacturing Company
9%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What's the deal with the HVR software acquisition?
As a user of HVR Software I followed this deal closely. Fivetran is apparently trying to establish more in its sector and by buying an already established data replication software, they become som...
Does HVR Software provide reliable insights?
I honestly can't think of another data replication software that can give you better statistics and insight than HVR Software. There's the feature for topology and statistics and both of them can ...
How much traffic can HVR Software handle?
As someone who works at a company where a high volume of information is replicated and has tried several data replication softwares, I can tell you that you're looking at the right one. HVR Softwar...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

 

Learn More

Video not available
 

Overview

 

Sample Customers

DocuSign, Oldcastle Infrastructure, Crossmedia, Talkdesk, Chubbies, Brandwatch
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Fivetran vs. StreamSets and other solutions. Updated: July 2024.
793,295 professionals have used our research since 2012.