Coming October 25: PeerSpot Awards will be announced! Learn more

AWS Glue vs Informatica Enterprise Data Catalog comparison

Cancel
You must select at least 2 products to compare!
Comparison Buyer's Guide
Executive Summary

We performed a comparison between AWS Glue and Informatica Enterprise Data Catalog based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Integration solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.

To learn more, read our detailed AWS Glue vs. Informatica Enterprise Data Catalog report (Updated: August 2022).
632,611 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you.""It is AWS-integrated. There is end-to-end integration with the other AWS services. It is also user-friendly.""The solution is stable and reliable.""I like that it's flexible, powerful, and allows you to write your own queries and scripts to get the needed transformations.""Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs.""The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it.""AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software.""AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."

More AWS Glue Pros →

"The metadata management of Informatica is great.""I like EDC's self-service capabilities. You can put the catalog on the intranet inside the organization, so users can search for something. People in the research world have specialized systems, and you might find data from various places that sound similar.""The capability of the tool to scan and capture the metadata from a variety of sources is one of the capabilities that I find most useful. The central repository into which it is going to put that captured metadata is the best.""Multifeatured and easily scalable data catalog, with good data domain discovery and data profiling features."

More Informatica Enterprise Data Catalog Pros →

Cons
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options.""The crucial problem with AWS Glue is that it only works with AWS. It is not an agnostic tool like Pentaho. In PowerCenter, we can install the forms from Google and other vendors, but in the case of AWS Glue, we can only use AWS.""The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment.""There is a learning curve to this tool.""The monitoring is not that good.""Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background.""If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data.""There should be more connectors for different databases."

More AWS Glue Cons →

"The model is somewhat flexible. There are certain aspects of the model that are not as flexible as we would like. It doesn't do certain things to a great level of depth. So, in situations where we want to drill in to do something specific, we have to essentially copy that data into our own structures in order to add that additional layer of flexibility.""This solution is hard to set up and its interface is not user-friendly. It's also not as stable, and the technical support takes a lot of time to solve simple problems.""It is not easy to set up and configure the tool.""Interoperability is one area where EDC has room for improvement. It was challenging when the faculty took over the data world and had specific vendors they wanted to use, and some were not particularly open platforms."

More Informatica Enterprise Data Catalog Cons →

Pricing and Cost Advice
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • More AWS Glue Pricing and Cost Advice →

  • "I have no idea what the price actually is. It is probably not going to be the cheapest, but it is a pretty stable and robust platform from the backend standpoint."
  • More Informatica Enterprise Data Catalog Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    632,611 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:The facility to integrate with S3 and the possibility to use Jupyter Notebook inside the pipeline are the most valuable features.
    Top Answer:The capability of the tool to scan and capture the metadata from a variety of sources is one of the capabilities that I find most useful. The central repository into which it is going to put that… more »
    Top Answer:The model is somewhat flexible. There are certain aspects of the model that are not as flexible as we would like. It doesn't do certain things to a great level of depth. So, in situations where we… more »
    Top Answer:We are using it to understand the assets that we have from their technical metadata perspective, but we're also using it to align our business glossaries with the actual physical data location where… more »
    Ranking
    2nd
    Views
    19,150
    Comparisons
    15,466
    Reviews
    10
    Average Words per Review
    472
    Rating
    8.1
    3rd
    out of 24 in Metadata Management
    Views
    7,062
    Comparisons
    5,161
    Reviews
    4
    Average Words per Review
    789
    Rating
    8.3
    Comparisons
    Also Known As
    Informatica EDC, Informatica Enterprise Information Catalog, Enterprise Information Catalog
    Learn More
    Overview

    AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics. You can create and run an ETL job with a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored on AWS, and AWS Glue discovers your data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately searchable, queryable, and available for ETL.

    Informatica Enterprise Information Catalog provides a machine-learning-based discovery engine to collect data assets across the enterprise while increasing the understanding of those data assets through a graph-based enterprise information catalog. Powered by Informatica’s unique metadata services engine, Enterprise Information Catalog enables business analysts and data stewards to find all types of data across the enterprise; discover relationships among them; enrich data with business glossary terms and crowdsourced annotations; and understand the provenance, quality, and usage of their data.

    Offer
    Learn more about AWS Glue
    Learn more about Informatica Enterprise Data Catalog
    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    AIA Singapore, Mattel
    Top Industries
    VISITORS READING REVIEWS
    Computer Software Company18%
    Financial Services Firm13%
    Insurance Company8%
    Media Company8%
    VISITORS READING REVIEWS
    Computer Software Company20%
    Financial Services Firm13%
    Government8%
    Comms Service Provider8%
    Company Size
    REVIEWERS
    Small Business17%
    Midsize Enterprise25%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise75%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise10%
    Large Enterprise75%
    Buyer's Guide
    AWS Glue vs. Informatica Enterprise Data Catalog
    August 2022
    Find out what your peers are saying about AWS Glue vs. Informatica Enterprise Data Catalog and other solutions. Updated: August 2022.
    632,611 professionals have used our research since 2012.

    AWS Glue is ranked 2nd in Cloud Data Integration with 10 reviews while Informatica Enterprise Data Catalog is ranked 3rd in Metadata Management with 4 reviews. AWS Glue is rated 8.2, while Informatica Enterprise Data Catalog is rated 8.2. The top reviewer of AWS Glue writes "Easy to perform ETL on multiple data sources, and easy to use after you learn it". On the other hand, the top reviewer of Informatica Enterprise Data Catalog writes "An enterprise-scale solution with a pretty robust set of tools for scanning a variety of information sources". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, Talend Open Studio, Informatica Cloud Data Integration and Matillion ETL, whereas Informatica Enterprise Data Catalog is most compared with Collibra Catalog, Denodo, Azure Data Factory, Alation Data Catalog and Informatica PowerCenter. See our AWS Glue vs. Informatica Enterprise Data Catalog report.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.