Coming October 25: PeerSpot Awards will be announced! Learn more
Miriam Tover - PeerSpot reviewer
Service Delivery Manager at PeerSpot (formerly IT Central Station)
  • 0
  • 22

What needs improvement with StreamSets?

Please share with the community what you think needs improvement with StreamSets.

What are its weaknesses? What would you like to see changed in a future version?

PeerSpot user
2 Answers
Data Engineer at a energy/utilities company with 10,001+ employees
Real User
Top 10
19 November 20

We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then we'd have to restart the container, maybe once a month when it gets high.

Senior Technical Manager at a financial services firm with 501-1,000 employees
Real User
08 August 18

I would like to see it integrate with other kinds of platforms, other than Java. We're going to have a lot of applications using .NET and other languages or frameworks. StreamSets is very helpful for the old Java platform but it's hard to integrate with the other platforms and frameworks. StreamSets works great for batch processing but we are looking for something that is more real-time. We need latency in numbers below milliseconds.

Learn what your peers think about StreamSets. Get advice and tips from experienced pros sharing their opinions. Updated: September 2022.
633,572 professionals have used our research since 2012.
Related Questions
Miriam Tover - PeerSpot reviewer
Service Delivery Manager at PeerSpot (formerly IT Central Station)
Aug 21, 2022
If you were talking to someone whose organization is considering StreamSets, what would you say? How would you rate it and why? Any other tips or advice?
2 out of 4 answers
AbhishekKatara - PeerSpot reviewer
Technical Lead at Sopra Steria
15 May 22
For people who are starting out, the simple advice is to first try out the cloud login of StreamSets. It is freely available for everyone these days. StreamSets has released its online practice platform to design and create pipelines. Someone simply needs to go to, which is StreamSets official website. It is there that people who are starting out can log into StreamSets cloud and spin up their StreamSets Data Collector machines. Then, they can choose their execution mode. It is all in a Docker-containerized fashion. You don't need to do anything. You simply need to have your laptop ready and step-by-step instructions are given. You just simply spin up your Data Collector, the execution mode, and then you are ready with the canvas. You can design your pipeline, practice, and test there. So, if you want to evaluate StreamSets in basic mode, you can take a look online. This is the easiest way to evaluate StreamSets. It is a drag-and-drop, UI-based approach with a canvas, where you design the pipeline. It is pretty easy to follow. So, once your team feels confident, then they can purchase the StreamSets add-ons, which will provide them end-to-end solutions and vendor support. The best way is to log into their cloud practice platform and create some pipelines. In my current project, there is a requirement to integrate with Snowflake, but I don't have Snowflake experience. I have not integrated Snowflake with StreamSets yet. I personally love working on StreamSets. It is part of my day-to-day activities. I do a lot of work on StreamSets, so I would rate them pretty well as nine out of 10.
Senior Data Engineer at a energy/utilities company with 1,001-5,000 employees
09 June 22
Every tool in the market at the moment has some major gaps, especially for large enterprises. It could be the way that the data or pipeline is secured. At present, StreamSets looks like the market leader and is trying to fill that gap. For anyone going through a proof of concept for various tools, StreamSets is almost at the top. I don't think that they need to look any further. We are working only with API, a relational database management system, and our enterprise warehouses at the moment. We are not using any streaming sort of ingestion at the moment. We are not using Snowflake Transformer yet. It just got released. We are using a traditional Snowflake destination stage because our enterprise is huge. We have our own Snowflake architecture. We load the security in the data into our own databases using the destination stage, not Transformer yet. I would rate the solution as 7.5 out of 10.
Nurit Sherman - PeerSpot reviewer
Content Operations Manager at PeerSpot (formerly IT Central Station)
May 07, 2018
One of the most popular comparisons on IT Central Station isĀ Informatica PowerCenter vs SSIS. In your opinion, which is better and why? Thanks! --Rhea
2 out of 22 answers
PeerSpot user
ETL Solution Architect at a tech services company
11 April 18
Hands down Informatica. I've used both. Informatica is system and DB agnostic and can run on any platform. If set up correctly like I do in my architecture moving from the platform (Windows) to Linux or AIX would take less than a day. Also, Informatica is highly parameterizable and easier to develop in. The shared objects ability is a huge time-saving advantage. Also, Informatica has built-in version controls.
it_user850776 - PeerSpot reviewer
User at a tech company with 51-200 employees
11 April 18
PFA which helps better to the question and please do let me know if required any .
Related Solutions
Download Free Report
Download our free StreamSets Report and get advice and tips from experienced pros sharing their opinions. Updated: September 2022.
633,572 professionals have used our research since 2012.