Hitachi Lumada Data Integration vs StreamSets comparison

Cancel
You must select at least 2 products to compare!
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Hitachi Lumada Data Integration and StreamSets based on real PeerSpot user reviews.

Find out in this report how the two Data Integration Tools solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Hitachi Lumada Data Integration vs. StreamSets Report (Updated: November 2022).
656,862 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"It makes it pretty simple to do some fairly complicated things. Both I and some of our other BI developers have made stabs at using, for example, SQL Server Integration Services, and we found them a little bit frustrating compared to Data Integration. So, its ease of use is right up there.""Lumada has allowed us to interact with our employees more effectively and compensate them properly. One of the cool things is that we use it to generate commissions for our salespeople and bonuses for our warehouse people. It allows us to get information out to them in a timely fashion. We can also see where they're at and how they're doing.""We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic.""The amount of data that it loads and processes is good.""Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that.""Pentaho Data Integration is quite simple to learn, and there is a lot of information available online.""It has a really friendly user interface, which is its main feature. The process of automating or combining SQL code with some databases and doing the automation is great and really convenient.""This solution allows us to create pipelines using a minimal amount of custom coding."

More Hitachi Lumada Data Integration Pros →

"In StreamSets, everything is in one place.""StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.""It is a very powerful, modern data analytics solution, in which you can integrate a large volume of data from different sources. It integrates all of the data and you can design, create, and monitor pipelines according to your requirements. It is an all-in-one day data ops solution.""StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."

More StreamSets Pros →

Cons
"I'm still in the very recent stage concerning Pentaho Data Integration, but it can't really handle what I describe as "extreme data processing" i.e. when there is a huge amount of data to process. That is one area where Pentaho is still lacking.""I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it.""In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version.""Since Hitachi took over, I don't feel that the documentation is as good within the solution. It used to have very good help built right in.""I work with different databases. I would like to work with more connectors to new databases, e.g., DynamoDB and MariaDB, and new cloud solutions, e.g., AWS, Azure, and GCP. If they had these connectors, that would be great. They could improve by building new connectors. If you have native connections to different databases, then you can make instructions more efficient and in a more natural way. You don't have to write any scripts to use that connector.""The support for the Enterprise Edition is okay, but what they have done in the last three or four years is move more and more things to that edition. The result is that they are breaking the Community Edition. That's what our impression is.""I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors.""It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."

More Hitachi Lumada Data Integration Cons →

"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""If you use JDBC Lookup, for example, it generally takes a long time to process data.""Sometimes, when we have large amounts of data that is very efficiently stored in Hadoop or Kafka, it is not very efficient to run it through StreamSets, due to the lack of efficiency or the resources that StreamSets is using.""We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.""The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."

More StreamSets Cons →

Pricing and Cost Advice
  • "The price of the regular version is not reasonable and it should be lower."
  • "Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
  • "It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
  • "I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
  • "When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
  • "The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
  • "The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
  • "There was a cost analysis done and Pentaho did favorably in terms of cost."
  • More Hitachi Lumada Data Integration Pricing and Cost Advice →

  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • "The pricing is good, but not the best. They have some customized plans you can opt for."
  • More StreamSets Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
    656,862 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition :… more »
    Top Answer: In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, it… more »
    Top Answer:My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could use… more »
    Top Answer:It is really easy to set up and the interface is easy to use.
    Top Answer:We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then we'd have to restart the container, maybe once a month when it gets high.
    Top Answer:We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
    Ranking
    6th
    Views
    6,337
    Comparisons
    3,976
    Reviews
    25
    Average Words per Review
    1,283
    Rating
    7.8
    11th
    Views
    5,913
    Comparisons
    3,583
    Reviews
    5
    Average Words per Review
    2,142
    Rating
    8.4
    Comparisons
    Also Known As
    Kettle, Pentaho Data Integration
    Learn More
    StreamSets
    Video Not Available
    Overview

    Hitachi Lumada Data Integration is a top-raking data integration tool that aims to deliver accurate data from various sources to end users. This is a complete data integration platform that utilizes visual tools in the delivery of analytics-ready data. The product eliminates coding and complexity to ensure equal accessibility of its services to IT users as well as businesses that do not specialize in the field.

    The solution offers powerful data integration, which is achieved through:

    • Accelerated data onboarding
    • Flexible data self-service
    • Robust data flow orchestration

    Users of Hitachi Lumada Data Integration can collaborate to build, deploy, and monitor dataflows in order to streamline data delivery. The visual tools of the product reduce the time of operation and lower complexity, allowing even beginners to operate the platform seamlessly. The onboarding process is initiated through broad connectivity to a wide variety of data sources and applications.

    A drag-and-drop interface allows users to easily create data pipelines and ready-made templates to execute edge to cloud. The product provides users with the opportunity to blend data on premises or using cloud services, including Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP). The tool allows for a seamless switch between the native engine and Apache Spark, and operationalizes Python, Scala, and Weka machine-learning models.

    The tool offers features for extensive business analytics through:

    • Ad-hoc analysis
    • Flexible interface
    • Enterprise reporting

    Hitachi Lumada Data Integration offers its clients modern data architectures for data analytics. Through interactive visualizations and easy integration, users are able to increase data integrity for their organizations. The product offers a web-based drag-and-drop dashboard for a flexible experience, collaboration with other applications, and advanced multi tenancy. There is special enterprise reporting which consists of operational self-serving reporting, security with content permissions, and additional high-level protection, achieved through locking, and expirations.

    Hitachi Lumada Data Integration Features

    The tool offers its clients various features which can be used to achieve efficient data integration and further analysis. These features include:

    • Data access: The tool allows users to access data sources at the edge, core, and cloud. This reduces the time and complexity of the process while blending sources to deliver data in a format ready to be analyzed.

    • Machine learning: The solution offers a feature to orchestrate machine learning. R, Python, Scale, and Weka models are provided to users of this product.

    • Enterprise reporting: Hitachi Lumada Data Integration provides its clients with detailed visualized reporting. This feature is highly secured, which provides additional protection for clients' data.

    • Connect and move: This feature offers users the option to connect to sources on premises or in the cloud and move data of any size and format.

    • Flexibility: The product allows users broad connectivity and flexibility with no vendor lock-in to on-premise or cloud services.

    • Cluster to container: The tool offers the option to create scalable pipelines with Kubernetes clusters. This is possible across multiple clouds.

    • Dataflow studio: This feature allows users to build and manage data pipelines, view run metrics, analyze activities, and resume paused ones.

    • Authoring: The tool has an editor feature, which allows for the transformation of activities while the dataflow is in progress.

    Hitachi Lumada Data Integration Benefits

    The tool offers increased work productivity through efficient data integration. A number of the benefits include:

    • Ability to increase productivity in the work process due to effective automation.

    • Production deployment time can be sped up while saving costs for the company.

    • The no-code functionality improves pipeline quality in comparison to hand-coding data.

    • The tool offers high-quality reports which reduce implementation time.

    • Employees can save time and resources by manually embedding reporting and applications through this solution.

    • The utilization of this tool can increase business user adoption by improving data accuracy.

    Reviews from Real Users

    Philip R., a senior engineer at a comms service provider, says this product "Saves time and makes it easy for our mixed-skilled team to support the product".

    Ryan F., a senior data engineer at Burgiss, appreciates Hitachi Lumada Data Integration because low-code makes development faster than with Python.

    StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.

    Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.

    With StreamSets, you can deliver the continuous data that drives the connected enterprise.

    Offer
    Learn more about Hitachi Lumada Data Integration
    Learn more about StreamSets
    Sample Customers
    66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Top Industries
    REVIEWERS
    Financial Services Firm19%
    Healthcare Company15%
    Comms Service Provider12%
    Manufacturing Company12%
    VISITORS READING REVIEWS
    Computer Software Company18%
    Comms Service Provider16%
    Financial Services Firm12%
    Government7%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Computer Software Company14%
    Manufacturing Company7%
    Insurance Company7%
    Company Size
    REVIEWERS
    Small Business27%
    Midsize Enterprise31%
    Large Enterprise41%
    VISITORS READING REVIEWS
    Small Business23%
    Midsize Enterprise13%
    Large Enterprise65%
    REVIEWERS
    Small Business22%
    Midsize Enterprise33%
    Large Enterprise44%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    Hitachi Lumada Data Integration vs. StreamSets
    November 2022
    Find out what your peers are saying about Hitachi Lumada Data Integration vs. StreamSets and other solutions. Updated: November 2022.
    656,862 professionals have used our research since 2012.

    Hitachi Lumada Data Integration is ranked 6th in Data Integration Tools with 26 reviews while StreamSets is ranked 11th in Data Integration Tools with 5 reviews. Hitachi Lumada Data Integration is rated 7.8, while StreamSets is rated 8.4. The top reviewer of Hitachi Lumada Data Integration writes "Saves time and makes it easy for our mixed-skilled team to support the product, but more guidance and better error messages are required in the UI". On the other hand, the top reviewer of StreamSets writes "Integrates with different enterprise systems and enables us to easily build data pipelines without knowing how to code". Hitachi Lumada Data Integration is most compared with SSIS, Talend Open Studio, Informatica Enterprise Data Catalog, Oracle Data Integrator (ODI) and Spring Cloud Data Flow, whereas StreamSets is most compared with Informatica PowerCenter, SSIS, Oracle GoldenGate, Spring Cloud Data Flow and Azure Data Factory. See our Hitachi Lumada Data Integration vs. StreamSets report.

    See our list of best Data Integration Tools vendors, best Data Integration Tools vendors, and best Cloud Data Integration vendors.

    We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.