"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"In StreamSets, everything is in one place."
"It is really easy to set up and the interface is easy to use."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"It has built-in connectors for more than 100 sources and onboarding data from many different sources to the cloud environment."
"Its integrability with the rest of the activities on Azure is most valuable."
"The security of the agent that is installed on-premises is very good."
"I enjoy the ease of use for the backend JSON generator, the deployment solution, and the template management."
"I think it makes it very easy to understand what data flow is and so on. You can leverage the user interface to do the different data flows, and it's great. I like it a lot."
"The most important feature is that it can help you do the multi-threading concepts."
"Allows more data between on-premises and cloud solutions"
"When it comes to our business requirements, this solution has worked well for us. However, we have not stretched it to the limit."
"The most valuable feature is the ETL functionality."
"Data Services' table comparison mechanism is very powerful. It's pretty hard to find a similar feature in other solutions."
"The basic functionality is quite good as is the basic logic and data information."
"I appreciate having access to the SAP data."
"We can extract data at a table level, extract data at the ETL level, and we can extract data at an ODP level."
"Its integration capabilities and the data migration capabilities are the most valuable. It is very good for SAP and non-SAP tools. It has very good integration with SAP, but it also has the capabilities to connect to other systems. We find it very helpful and stable."
"You can always manipulate a lot of things as long as you have the skill level."
"The BA reporting tools, such as Data Services, and ETL tool in SAP Data Services are the most valuable. When we had in-memory requirements, we used HANA. HANA is most preferably for most the customers for in-memory. SAP is the first company that created the in-memory concept."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"The need to work more on developing out-of-the-box connectors for other products like Oracle, AWS, and others."
"Azure Data Factory can improve by having support in the drivers for change data capture."
"User-friendliness and user effectiveness are unquestionably important, and it may be a good option here to improve the user experience. However, I believe that more and more sophisticated monitoring would be beneficial."
"Occasionally, there are problems within Microsoft itself that impacts the Data Factory and causes it to fail."
"I would like to be informed about the changes ahead of time, so we are aware of what's coming."
"We have experienced some issues with the integration. This is an area that needs improvement."
"The performance could be better. It would be better if Azure Data Factory could handle a higher load. I have heard that it can get overloaded, and it can't handle it."
"They require more detailed error reporting, data normalization tools, easier connectivity to other services, more data services, and greater compatibility with other commonly used schemas."
"At the integration level, there could be certain set of improvement to connect to various other systems."
"The skillset of data engineers matters a bit to use all the functionality of this solution. Otherwise, the delivery speed won't be faster."
"They could make it easier to work with web services."
"Data Services SAP is lacking in sources and target databases compared to Informatica. SAP Data Services should have more connectivity."
"It's an ETL that is very good with relational databases but not as good with files and semi-structured files."
"It would be nice if this solution was a bit easier to move from development to production."
"The migration of the solution between different environments is quite complex."
"I want some more business intelligence applications. People need to know more and more about data, including the transformation rules, etc. Informatica is a better product for data cataloging. SAP should update the data catalog."
StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.
Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.
With StreamSets, you can deliver the continuous data that drives the connected enterprise.
Create, schedule, and manage your data integration at scale with Azure Data Factory - a hybrid data integration (ETL) service. Work with data wherever it lives, in the cloud or on-premises, with enterprise-grade security.
You cannot afford to run your business on questionable data. With SAP® Data Services software, you can access, transform, and connect data to fuel your critical business processes. Together, these enterprise-class solutions enable data integration and data quality, providing the right level of insight across your business so you can make better decisions and operate more effectively.
Azure Data Factory is ranked 2nd in Data Integration Tools with 32 reviews while SAP Data Services is ranked 7th in Data Integration Tools with 9 reviews. Azure Data Factory is rated 7.8, while SAP Data Services is rated 8.0. The top reviewer of Azure Data Factory writes "Easy to bring in outside capabilities, flexible, and works well". On the other hand, the top reviewer of SAP Data Services writes "It's a powerful tool that does a lot, but it has its annoyances". Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Talend Open Studio, Alteryx Designer and Denodo, whereas SAP Data Services is most compared with Informatica PowerCenter, SSIS, SAP Process Orchestration, Palantir Foundry and Oracle Data Integrator (ODI). See our Azure Data Factory vs. SAP Data Services report.
See our list of best Data Integration Tools vendors.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.