We performed a comparison between Hitachi Lumada Data Integration and SSIS based on real PeerSpot user reviews.Find out in this report how the two Data Integration Tools solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution."
"In StreamSets, everything is in one place."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"It's my understanding that the product can scale."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"The amount of data that it loads and processes is good."
"We can schedule job execution in the BA Server, which is the front-end product we're using right now. That scheduling interface is nice."
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there."
"The most valuable feature of SSIS is that you can take data from other servers which are not MS SQL Server or Oracle."
"The technical support is very good."
"It is also easy to learn and user-friendly. Microsoft is also good in terms of technical support. They have built a large community all over the world."
"It's saved time using visualization descriptions."
"We like that this solution includes a developer edition, free of charge, to allow for training."
"The debugging capabilities are great, particularly during data flow execution. You can look into the data and see what's going on in the pipeline."
"SSIS provides you with lookup and transformation functions, and you have the flexibility to write your own custom code."
"The performance and stability are good."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"In terms of the flexibility to deploy in any environment, such as on-premise or in the cloud, we can do the cloud deployment only through virtual machines. We might also be able to work on different environments through Docker or Kubernetes, but we don't have an Azure app or an AWS app for easy deployment to the cloud. We can only do it through virtual machines, which is a problem, but we can manage it. We also work with Databricks because it works with Spark. We can work with clustered servers, and we can easily do the deployment in the cloud. With a right-click, we can deploy Databricks through the app on AWS or Azure cloud."
"Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"The product needs more plugins."
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
"If you develop it on MacBook, it'll be quite a hassle."
"The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi."
"It's difficult to refactor SSIS. It gets cumbersome to reuse the solution."
"The security could be improved, as it is more important in our context."
"Involving a data lake or data engineering aspects would be useful. While it is there, we need more features included."
"The solution could improve by having quicker release updates."
"Sometimes we need to connect to AWS to get additional data sources, so we have to install some external LAN and not a regular RDBMS. We need external tools to connect. It would be great if SSIS included these tools. I'd also like some additional features for row indexing and data conversion."
"I have a tool called ZappySys. I need that tool to cut down on the complexity of SSIS. That tool really helps with a quick turnaround. I can do things quickly, and I can do things accurately. I can get better reporting on errors."
"There is connectivity with other databases, however, this is the most significant issue that has to be addressed."
"We purchase an add on called task factory primarily to allow bulk delete, update and upsert capability. I'd like to see this be part of the standard package."
StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.
Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.
With StreamSets, you can deliver the continuous data that drives the connected enterprise.
Hitachi Lumada Data Integration is a top-raking data integration tool that aims to deliver accurate data from various sources to end users. This is a complete data integration platform that utilizes visual tools in the delivery of analytics-ready data. The product eliminates coding and complexity to ensure equal accessibility of its services to IT users as well as businesses that do not specialize in the field.
The solution offers powerful data integration, which is achieved through:
Users of Hitachi Lumada Data Integration can collaborate to build, deploy, and monitor dataflows in order to streamline data delivery. The visual tools of the product reduce the time of operation and lower complexity, allowing even beginners to operate the platform seamlessly. The onboarding process is initiated through broad connectivity to a wide variety of data sources and applications.
A drag-and-drop interface allows users to easily create data pipelines and ready-made templates to execute edge to cloud. The product provides users with the opportunity to blend data on premises or using cloud services, including Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP). The tool allows for a seamless switch between the native engine and Apache Spark, and operationalizes Python, Scala, and Weka machine-learning models.
The tool offers features for extensive business analytics through:
Hitachi Lumada Data Integration offers its clients modern data architectures for data analytics. Through interactive visualizations and easy integration, users are able to increase data integrity for their organizations. The product offers a web-based drag-and-drop dashboard for a flexible experience, collaboration with other applications, and advanced multi tenancy. There is special enterprise reporting which consists of operational self-serving reporting, security with content permissions, and additional high-level protection, achieved through locking, and expirations.
Hitachi Lumada Data Integration Features
The tool offers its clients various features which can be used to achieve efficient data integration and further analysis. These features include:
Hitachi Lumada Data Integration Benefits
The tool offers increased work productivity through efficient data integration. A number of the benefits include:
Reviews from Real Users
Philip R., a senior engineer at a comms service provider, says this product "Saves time and makes it easy for our mixed-skilled team to support the product".
Ryan F., a senior data engineer at Burgiss, appreciates Hitachi Lumada Data Integration because low-code makes development faster than with Python.
SQL Server Integration Services (SSIS) is a Microsoft platform designed to build enterprise-level data integration and data transformation solutions. Users now have the ability to solve intricate business queries by downloading and copying files, cleansing and mining data, loading data warehouses, and managing SQL Server objects and data through Integration Services. SSIS is a component of the Microsoft SQL Server data software used for many data migration tasks.
SSIS loads data into one or more destinations by extracting and transforming data from a wide variety of sources such as XML data files, flat files, and relational data sources.
Integration Services include:
Create packages programmatically and code custom tasks through programming the extensive Integration Services. With graphical Integration Services, you can create solutions without writing any code.
Benefits of SSIS
There are many benefits of SSIS, such as:
Reviews from Real Users
The SSIS platform stands out among its competitors for a variety of reasons. Two major ones are its debugging capabilities during data flow execution and its easy connectivity with other Microsoft tools.
Muhammad J., a senior manager software developer at Techlogix, notes, "The debugging capabilities are great, particularly during data flow execution. You can look into the data and see what's going on in the pipeline."
Ismail L., a data engineer at a tech service company, writes, "The most valuable thing is that it is easy to connect with Microsoft tools. In Europe, particularly in France, a lot of companies use Excel, SQL Server, and other Microsoft tools, and it is easier to connect SSIS with Microsoft tools than other products."
Hitachi Lumada Data Integration is ranked 6th in Data Integration Tools with 26 reviews while SSIS is ranked 3rd in Data Integration Tools with 30 reviews. Hitachi Lumada Data Integration is rated 7.8, while SSIS is rated 7.8. The top reviewer of Hitachi Lumada Data Integration writes "Saves time and makes it easy for our mixed-skilled team to support the product, but more guidance and better error messages are required in the UI". On the other hand, the top reviewer of SSIS writes "SSIS 2016 - The good, the bad, and the ugly". Hitachi Lumada Data Integration is most compared with Talend Open Studio, Informatica Enterprise Data Catalog, Oracle Data Integrator (ODI), Azure Data Factory and Spring Cloud Data Flow, whereas SSIS is most compared with Informatica PowerCenter, Talend Open Studio, Oracle Data Integrator (ODI), AWS Glue and IBM InfoSphere DataStage. See our Hitachi Lumada Data Integration vs. SSIS report.
We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.