IT Central Station is now PeerSpot: Here's why

Azure Data Factory vs Informatica Enterprise Data Catalog comparison

Cancel
You must select at least 2 products to compare!
Featured Review
Buyer's Guide
Data Integration Tools
July 2022
Find out what your peers are saying about Informatica, Microsoft, Talend and others in Data Integration Tools. Updated: July 2022.
621,327 professionals have used our research since 2012.
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved.""In StreamSets, everything is in one place.""StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes.""It is really easy to set up and the interface is easy to use.""I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."

More StreamSets Pros →

"When it comes to our business requirements, this solution has worked well for us. However, we have not stretched it to the limit.""It is easy to integrate.""The data copy template is a valuable feature.""Data Factory's most valuable feature is Copy Activity.""The most important feature is that it can help you do the multi-threading concepts.""Azure Data Factory's most valuable features are the packages and the data transformation that it allows us to do, which is more drag and drop, or a visual interface. So, that eases the entire process.""I think it makes it very easy to understand what data flow is and so on. You can leverage the user interface to do the different data flows, and it's great. I like it a lot.""Microsoft supported us when we planned to provision Azure Data Factory over a private link. As a result, we received excellent support from Microsoft."

More Azure Data Factory Pros →

"I like EDC's self-service capabilities. You can put the catalog on the intranet inside the organization, so users can search for something. People in the research world have specialized systems, and you might find data from various places that sound similar.""The metadata management of Informatica is great.""Multifeatured and easily scalable data catalog, with good data domain discovery and data profiling features."

More Informatica Enterprise Data Catalog Pros →

Cons
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back.""If you use JDBC Lookup, for example, it generally takes a long time to process data.""We've seen a couple of cases where it appears to have a memory leak or a similar problem.""Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful.""The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."

More StreamSets Cons →

"The pricing scheme is very complex and difficult to understand.""I would like to be informed about the changes ahead of time, so we are aware of what's coming.""The speed and performance need to be improved.""Some of the optimization techniques are not scalable.""One area for improvement is documentation. At present, there isn't enough documentation on how to use Azure Data Factory in certain conditions. It would be good to have documentation on the various use cases.""User-friendliness and user effectiveness are unquestionably important, and it may be a good option here to improve the user experience. However, I believe that more and more sophisticated monitoring would be beneficial.""We have experienced some issues with the integration. This is an area that needs improvement.""Data Factory has so many features that it can be a little difficult or confusing to find some settings and configurations. I'm sure there's a way to make it a little easier to navigate."

More Azure Data Factory Cons →

"It is not easy to set up and configure the tool.""This solution is hard to set up and its interface is not user-friendly. It's also not as stable, and the technical support takes a lot of time to solve simple problems.""Interoperability is one area where EDC has room for improvement. It was challenging when the faculty took over the data world and had specific vendors they wanted to use, and some were not particularly open platforms."

More Informatica Enterprise Data Catalog Cons →

Pricing and Cost Advice
  • "We are running the community version right now, which can be used free of charge."
  • "StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
  • "It has a CPU core-based licensing, which works for us and is quite good."
  • "There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
  • More StreamSets Pricing and Cost Advice →

  • "The price you pay is determined by how much you use it."
  • "Understanding the pricing model for Data Factory is quite complex."
  • "I would not say that this product is overly expensive."
  • "The licensing is a pay-as-you-go model, where you pay for what you consume."
  • "Our licensing fees are approximately 15,000 ($150 USD) per month."
  • "The licensing cost is included in the Synapse."
  • "It's not particularly expensive."
  • "Product is priced at the market standard."
  • More Azure Data Factory Pricing and Cost Advice →

    Information Not Available
    report
    Use our free recommendation engine to learn which Data Integration Tools solutions are best for your needs.
    621,327 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:It is really easy to set up and the interface is easy to use.
    Top Answer:We've seen a couple of cases where it appears to have a memory leak or a similar problem. It grows for a bit and then… more »
    Top Answer:We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load… more »
    Top Answer:The metadata management of Informatica is great.
    Top Answer:Additional metadata harvesters from different major tools can be an added advantage. Informatica provides some smart… more »
    Top Answer:I use the solution for enterprise-wide data discovery and traceability, providing business context to data and… more »
    Comparisons
    Also Known As
    Informatica EDC, Informatica Enterprise Information Catalog, Enterprise Information Catalog
    Learn More
    Overview

    StreamSets offers an end-to-end data integration platform to build, run, monitor and manage smart data pipelines that deliver continuous data for DataOps, and power the modern data ecosystem and hybrid integration.

    Only StreamSets provides a single design experience for all design patterns for 10x greater developer productivity; smart data pipelines that are resilient to change for 80% less breakages; and a single pane of glass for managing and monitoring all pipelines across hybrid and cloud architectures to eliminate blind spots and control gaps.

    With StreamSets, you can deliver the continuous data that drives the connected enterprise.

    Create, schedule, and manage your data integration at scale with Azure Data Factory - a hybrid data integration (ETL) service. Work with data wherever it lives, in the cloud or on-premises, with enterprise-grade security.

    Informatica Enterprise Information Catalog provides a machine-learning-based discovery engine to collect data assets across the enterprise while increasing the understanding of those data assets through a graph-based enterprise information catalog. Powered by Informatica’s unique metadata services engine, Enterprise Information Catalog enables business analysts and data stewards to find all types of data across the enterprise; discover relationships among them; enrich data with business glossary terms and crowdsourced annotations; and understand the provenance, quality, and usage of their data.

    Offer
    Learn more about StreamSets
    Learn more about Azure Data Factory
    Learn more about Informatica Enterprise Data Catalog
    Sample Customers
    Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
    Milliman, Pier 1 Imports, Rockwell Automation, Ziosk, Real Madrid
    AIA Singapore, Mattel
    Top Industries
    VISITORS READING REVIEWS
    Computer Software Company18%
    Comms Service Provider14%
    Financial Services Firm13%
    Insurance Company8%
    REVIEWERS
    Computer Software Company33%
    Non Profit11%
    Insurance Company7%
    Manufacturing Company7%
    VISITORS READING REVIEWS
    Computer Software Company25%
    Comms Service Provider13%
    Financial Services Firm9%
    Energy/Utilities Company7%
    VISITORS READING REVIEWS
    Computer Software Company27%
    Comms Service Provider11%
    Financial Services Firm10%
    Government7%
    Company Size
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise13%
    Large Enterprise72%
    REVIEWERS
    Small Business24%
    Midsize Enterprise22%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise14%
    Large Enterprise70%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise75%
    Buyer's Guide
    Data Integration Tools
    July 2022
    Find out what your peers are saying about Informatica, Microsoft, Talend and others in Data Integration Tools. Updated: July 2022.
    621,327 professionals have used our research since 2012.

    Azure Data Factory is ranked 2nd in Data Integration Tools with 34 reviews while Informatica Enterprise Data Catalog is ranked 3rd in Metadata Management with 3 reviews. Azure Data Factory is rated 7.8, while Informatica Enterprise Data Catalog is rated 8.4. The top reviewer of Azure Data Factory writes "There's the good, the bad and the ugly....unfortunately lots of ugly". On the other hand, the top reviewer of Informatica Enterprise Data Catalog writes "They listen to their customers, so if something is missing or not working, they will put it on their roadmap". Azure Data Factory is most compared with Informatica PowerCenter, Informatica Cloud Data Integration, Alteryx Designer, Talend Open Studio and Microsoft Azure Synapse Analytics, whereas Informatica Enterprise Data Catalog is most compared with AWS Glue, Collibra Catalog, Denodo, Alation Data Catalog and Talend Data Management Platform.

    We monitor all Data Integration Tools reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.