StreamSets Pros

Karthik Rajamani - PeerSpot reviewer
Principal Engineer at Tata Consultancy Services
I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks.
View full review »
AbhishekKatara - PeerSpot reviewer
Technical Lead at Sopra Steria
StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.
View full review »
SS
Senior Data Engineer at a energy/utilities company with 1,001-5,000 employees
StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.
View full review »
Buyer's Guide
StreamSets
November 2022
Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: November 2022.
656,474 professionals have used our research since 2012.
BR
Data Engineer at a consultancy with 11-50 employees
In StreamSets, everything is in one place.
View full review »
Prateek Agarwal - PeerSpot reviewer
Manager at NISG
It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution.
View full review »

StreamSets Cons

Karthik Rajamani - PeerSpot reviewer
Principal Engineer at Tata Consultancy Services
We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.
View full review »
AbhishekKatara - PeerSpot reviewer
Technical Lead at Sopra Steria
The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time.
View full review »
SS
Senior Data Engineer at a energy/utilities company with 1,001-5,000 employees
Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.
View full review »
Buyer's Guide
StreamSets
November 2022
Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: November 2022.
656,474 professionals have used our research since 2012.
BR
Data Engineer at a consultancy with 11-50 employees
If you use JDBC Lookup, for example, it generally takes a long time to process data.
View full review »
Prateek Agarwal - PeerSpot reviewer
Manager at NISG
Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.
View full review »
Buyer's Guide
StreamSets
November 2022
Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: November 2022.
656,474 professionals have used our research since 2012.