Try our new research platform with insights from 80,000+ expert users

Informatica Intelligent Data Management Cloud (IDMC) vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Informatica Intelligent Dat...
Ranking in Data Integration
3rd
Average Rating
8.0
Number of Reviews
181
Ranking in other categories
Data Quality (1st), Business Process Management (BPM) (7th), Business-to-Business Middleware (3rd), API Management (8th), Cloud Data Integration (3rd), Data Governance (2nd), Test Data Management (3rd), Cloud Master Data Management (MDM) Solutions (1st), Data Management Platforms (DMP) (2nd), Data Masking (2nd), Metadata Management (1st), Test Data Management Services (3rd), Product Information Management (PIM) (1st), Data Observability (2nd)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Featured Reviews

Raj Sethupathi - PeerSpot reviewer
Jun 13, 2024
Offers profiling and address standardization but can be complicated
Informatica Data Quality has its data warehouse, primarily using Oracle and some SQL databases. You need a database to host the data. The cleansed version of the data is stored in the data warehouse. It integrates with PowerCenter and other Informatica tools. The integration details can be complex, but a regional setup is involved in this process. Profiling smaller datasets, such as 10,000-50,000 records, worked fine. However, unexpected issues could arise with larger datasets, such as thousands of records or more, especially with tables containing many columns. Handling tables with fifty or more columns can be challenging, even in Excel. A mismatch in data types could cause the entire system to crash. Continual enhancements are being made to address these issues, which can be unique to specific industries like finance and healthcare.
Reyansh Kumar - PeerSpot reviewer
Mar 10, 2023
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Some of Axon's valuable features include creating your business glossaries, importing DQ rules, and creating change management."
"This is where I think MDM shines - with its strong fuzzy matching algorithm. This is the essence of Informatica MDM. Based on these results, I can write our match conditions and then perform the corresponding data management activities."
"The way that the solution scans is very useful."
"The most valuable features of Informatica MDM are its reliability, match functions, and integration capabilities. The out-of-box functionality of deduplication and built-in data models ensure faster implementation."
"The user interface which is very easy to use if we have any problems to solve."
"I rate the technical support a ten out of ten."
"Informatica MDM has a defined data model we can customize with user and developer options."
"Whether we need data cleansing or data mastering, we get it all in one platform."
"The UI is user-friendly, it doesn't require any technical know-how and we can navigate to social media or use it more easily."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The best feature that I really like is the integration."
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
 

Cons

"There could be a lot more application integration."
"The main issue probably has nothing to do with end users, but installation can definitely be simplified."
"I would like to have the solution in one product and technical support needs to be better."
"We would like to have accessibility to the repository."
"The integration process is not easy."
"The error information provided is not informative, as compared to Power Center."
"Informatica MDM's UI is not intuitive enough."
"The cloud version of the Informatica, it's a very substandard product. They might say it's enterprise-ready but it's not at all ready. They need to add more features, such as improved data replication features. If you look at other tools, such as Matillion they are now cloud-native and flexible. Additionally, Informatica Cloud Data Integration should have a good migration strategy from Informatica PowerCenter to Informatica Cloud Data Integration."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"I would like to see it integrate with other kinds of platforms, other than Java. We're going to have a lot of applications using .NET and other languages or frameworks. StreamSets is very helpful for the old Java platform but it's hard to integrate with the other platforms and frameworks."
"I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
 

Pricing and Cost Advice

"I rate the product's price a seven on a scale of one to ten, where one is the cheapest and ten is the most expensive. The product is a bit expensive."
"Pricing is determined by the number of licensed users as well as the number of Core CPUs."
"Informatica Axon is a costly solution. I rate Informatica Axon a four out of ten for its pricing."
"You pay for this solution based on IPUs, Informatica Processing Units. This depends on how much data you process and how much memory you consume from the cloud provider, and you pay as you go."
"The licensing price of the product depends on the organization's requirements."
"I rate the licensing cost of Informatica MDM a five out of ten."
"It's an expensive solution."
"We saw an ROI. We have been able to get data from various sources and consolidate it into a data lake, which is helping us in data analytics."
"It's not so favorable for small companies."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"It's not expensive because you pay per month, and the tasks you can perform with it are huge. It's reliable and cost-effective."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"It has a CPU core-based licensing, which works for us and is quite good."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
813,418 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
10%
Government
6%
Financial Services Firm
18%
Computer Software Company
13%
Manufacturing Company
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which Informatica product would you choose - PowerCenter or Cloud Data Integration?
Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge...
What are the biggest benefits of using Informatica Cloud Data Integration?
When it comes to cloud data integration, this solution can provide you with multiple benefits, including: Overhead reduction by integrating data on any cloud in various ways Effective integration ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

ActiveVOS, Active Endpoints, BPM, Address Verification, Persistent Data Masking, Cloud Test Data Management, PIM, , Enterprise Data Catalog, Data Integration Hub, Cloud Data Integration, Data Quality, Cloud API and App Integration
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

The Travel Company, Carbonite
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Informatica Intelligent Data Management Cloud (IDMC) vs. StreamSets and other solutions. Updated: October 2024.
813,418 professionals have used our research since 2012.