Coming October 25: PeerSpot Awards will be announced! Learn more

Azure Data Factory vs StreamSets comparison

Cancel
You must select at least 2 products to compare!
Microsoft Logo
34,601 views|28,151 comparisons
StreamSets Logo
5,633 views|3,613 comparisons
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Azure Data Factory and StreamSets based on real PeerSpot user reviews.

Find out in this report how the two Data Integration Tools solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.

To learn more, read our detailed Azure Data Factory vs. StreamSets report (Updated: August 2022).
636,406 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The most valuable feature of Azure Data Factory is that it has a good combination of flexibility, fine-tuning, automation, and good monitoring.""The initial setup is very quick and easy.""Allows more data between on-premises and cloud solutions""The solution can scale very easily.""An excellent tool for pipeline orchestration.""When it comes to our business requirements, this solution has worked well for us. However, we have not stretched it to the limit.""I like that it's a monolithic data platform. This is why we propose these solutions.""Data Factory's best features are connectivity with different tools and focusing data ingestion using pipeline copy data."

More Azure Data Factory Pros →

"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks.""StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.""It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution.""In StreamSets, everything is in one place.""It is really easy to set up and the interface is easy to use."

More StreamSets Pros →

Cons
"One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases.""It does not appear to be as rich as other ETL tools. It has very limited capabilities.""Some of the optimization techniques are not scalable.""For some of the data, there were some issues with data mapping. Some of the error messages were a little bit foggy. There could be more of a quick start guide or some inline examples. The documentation could be better.""The number of standard adaptors could be extended further.""The need to work more on developing out-of-the-box connectors for other products like Oracle, AWS, and others.""The deployment should be easier.""There should be a way that it can do switches, so if at any point in time I want to do some hybrid mode of making any data collections or ingestions, I can just click on a button."

More Azure Data Factory Cons →

"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.""The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""We've seen a couple of cases where it appears to have a memory leak or a similar problem.""Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.""If you use JDBC Lookup, for example, it generally takes a long time to process data."

More StreamSets Cons →

Pricing and Cost Advice
  • "The price you pay is determined by how much you use it."
  • "Understanding the pricing model for Data Factory is quite complex."
  • "I would not say that this product is overly expensive."
  • "The licensing is a pay-as-you-go model, where you pay for what you consume."
  • "Our licensing fees are approximately 15,000 ($150 USD) per month."
  • "The licensing cost is included in the Synapse."
  • "It's not particularly expensive."
  • "Product is priced at the market standard."
  • More Azure Data Factory Pricing and Cost Advice →

  • "We are running the community version right now, which can be used free of charge."
  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • "The pricing is good, but not the best. They have some customized plans you can opt for."
  • More StreamSets Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
    636,406 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up and… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power… more »
    Top Answer:It is really easy to set up and the interface is easy to use.
    Top Answer:We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then we'd have to restart the container, maybe once a month when it gets high.
    Top Answer:We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
    Ranking
    1st
    Views
    34,601
    Comparisons
    28,151
    Reviews
    35
    Average Words per Review
    501
    Rating
    7.9
    11th
    Views
    5,633
    Comparisons
    3,613
    Reviews
    6
    Average Words per Review
    1,851
    Rating
    8.3
    Comparisons
    Learn More
    StreamSets
    Video Not Available
    Overview

    Create, schedule, and manage your data integration at scale with Azure Data Factory - a hybrid data integration (ETL) service. Work with data wherever it lives, in the cloud or on-premises, with enterprise-grade security.

    StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.

    Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.

    With StreamSets, you can deliver the continuous data that drives the connected enterprise.

    Offer
    Learn more about Azure Data Factory
    Learn more about StreamSets
    Sample Customers
    Milliman, Pier 1 Imports, Rockwell Automation, Ziosk, Real Madrid
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Top Industries
    REVIEWERS
    Computer Software Company34%
    Non Profit10%
    Insurance Company7%
    Manufacturing Company7%
    VISITORS READING REVIEWS
    Computer Software Company20%
    Financial Services Firm11%
    Comms Service Provider9%
    Energy/Utilities Company7%
    VISITORS READING REVIEWS
    Financial Services Firm16%
    Computer Software Company15%
    Insurance Company8%
    Government7%
    Company Size
    REVIEWERS
    Small Business23%
    Midsize Enterprise23%
    Large Enterprise53%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise13%
    Large Enterprise71%
    REVIEWERS
    Small Business14%
    Midsize Enterprise29%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    Azure Data Factory vs. StreamSets
    August 2022
    Find out what your peers are saying about Azure Data Factory vs. StreamSets and other solutions. Updated: August 2022.
    636,406 professionals have used our research since 2012.

    Azure Data Factory is ranked 1st in Data Integration Tools with 37 reviews while StreamSets is ranked 11th in Data Integration Tools with 6 reviews. Azure Data Factory is rated 8.0, while StreamSets is rated 8.4. The top reviewer of Azure Data Factory writes "There's the good, the bad and the ugly....unfortunately lots of ugly". On the other hand, the top reviewer of StreamSets writes "Integrates with different enterprise systems and enables us to easily build data pipelines without knowing how to code". Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Microsoft Azure Synapse Analytics, Alteryx Designer and Talend Open Studio, whereas StreamSets is most compared with Informatica PowerCenter, SSIS, Spring Cloud Data Flow, Oracle GoldenGate and Talend Open Studio. See our Azure Data Factory vs. StreamSets report.

    See our list of best Data Integration Tools vendors.

    We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.