AWS Glue vs Informatica PowerCenter comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
12,012 views|8,420 comparisons
92% willing to recommend
Informatica Logo
19,928 views|16,697 comparisons
90% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Aug 31, 2022

We performed a comparison between AWS Glue and Informatica PowerCenter based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: AWS Glue users say deployment is straightforward. Users of Informatica PowerCenter share mixed reviews regarding deployment.
  • Features: Users of both products are happy with their stability and scalability.

    AWS Glue users say the solution is flexible, easy to integrate with other AWS services, and has a good interface. Reviewers mention there is a learning curve and say that it would be helpful if it supported Java.

    Informatica Powercenter users like the solution’s data integration, and like that it is robust. Users mention that the solution is dated and would prefer if it could handle more modern formats, such as JSON.
  • Pricing: AWS Glue users share mixed reviews on the solution’s pricing. Informatica PowerCenter users say it is expensive.
  • Service and Support: Users of both solutions are satisfied with the support.

Comparison Results: Of the two solutions, users prefer Informatica PowerCenter because AWS Glue only integrates with AWS services.

To learn more, read our detailed AWS Glue vs. Informatica PowerCenter Report (Updated: March 2024).
767,667 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages.""The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs.""The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it.""AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software.""It is a stable and scalable solution.""Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you.""AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3.""I like the fact that AWS Glue works with Python scripts."

More AWS Glue Pros →

"It reduces a lot of legacy coding.""What I like the most is that we have to deal with less while writing the queries.""The greatest feature is that it is very easy to have someone come in and jump right in. It is one of the nicest tools in terms of getting a person acquainted quickly.""Can manage a huge quantity of data and provide reliability.""The most valuable features are the metadata repository and the data warehouse application console.""It has helped us monetize.""Has a good visual tool for data mapping.""We have found the PowerCenter and B2B data transformation most valuable."

More Informatica PowerCenter Pros →

Cons
"The solution's visual ETL tool is of no use for actual implementation.""AWS Glue is more costly compared to other tools like Airflow.""If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data.""The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data.""The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3.""In terms of improvement, the performance of AWS Glue could be faster.""Glue could perform better. It sometimes takes too long to test a Glue job. Google Cloud Platform offers more Python scripts than AWS.""While working on AWS Glue, I could not find any training material for it."

More AWS Glue Cons →

"Its interface can be modernized. It is an old product. I have been working with it for 14 years, and it still looks the same. It hasn't been modernized much. It also needs to handle more modern formats, such as JSON files. It works with the old text files and databases, but it does not always work with the newer, modern stuff. You need to make your own programs to support that kind of stuff. Support is also a kind of difficult with Informatica. They don't do direct support and rely on using their distributors around the globe for support, which means that you kind of have to go through this layer of different companies before you get help.""The performance of Informatica PowerCenter could improve.""I would like to see an improvement in the digital adoption.""Informatica PowerCenter could improve on the documentation for the implementation. The documents provided are not very good for a new user.""While Informatica is great for data-integration, it does not have any analytics features. Thus, organizations have to always look for another product for their BI needs.""There is some room for improvement in terms of pricing.""Support could be better.""The UI is outdated and old-fashioned, at least in our current version. Also, we have experienced some stability issues with the Workflow Monitor application."

More Informatica PowerCenter Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "We have found the pricing very cost-effective. The licensing is CPU and data source-based."
  • "Cost could be improved."
  • "Licensing is a one time cost. But maintenance costs depend on what you want, how long you need it. Maintenance is a kind of insurance. With health insurance, you don't know whether you will get sick or need to go to hospital or not but you have to have insurance. It's the same thing with support. If you have that expertise in resolving issues, if you have enough experience in your IT department, I would say you don't need the support. But in practice, they recommend you go with the support. If you want support you have to pay for it."
  • "Price-wise, it's more expensive than SSIS, but it's a better tool, so it has more features. Licensing is on a yearly basis."
  • "Its maintenance is expensive."
  • "It's much more expensive, almost three times more expensive than most other solutions."
  • "We are satisfied with the pricing."
  • "I consider this to be an expensive product."
  • More Informatica PowerCenter Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    767,667 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Technology has made it easier for businesses to organize and manipulate data to get a clearer picture of what’s going on with their business. Notably, ETL tools have made managing huge amounts of data significantly easier and faster, boosting many organizations’ business intelligence operations There are many third-party vendors offering ETL solutions, but two of the most popular are PowerCenter Informatica and Microsoft SSIS (SQL Server Integration Services). Each technology has its advantages but there are also similarities on how they carry out the extract-transform-load processes and only differ in terminologies. If you’re in the process of choosing ETL tools and PowerCenter Informatica and Microsoft SSIS made it to your shortlist, here is a short comparative discussion detailing the differences between the two, as well as their benefits. Package Configuration Most enterprise data integration projects would require the capacity to develop a solution in one platform and test and deploy it in a separate environment without having to manually change the established workflow. In order to achieve this seamless movement between two environments, your ETL technology should allow the dynamic update of the project’s properties using the content or a parameter file or configuration. Both Informatica and SSIS support this functionality using different methodologies. In Informatica, every session can have more than one source and one or more destination connections. There are… Read more →
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up and… more »
    Top Answer:SSIS PowerPack is a group of drag and drop connectors for Microsoft SQL Server Integration Services, commonly called SSIS. The collection helps organizations boost productivity with code-free… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Ranking
    1st
    Views
    12,012
    Comparisons
    8,420
    Reviews
    32
    Average Words per Review
    419
    Rating
    7.8
    3rd
    out of 100 in Data Integration
    Views
    19,928
    Comparisons
    16,697
    Reviews
    29
    Average Words per Review
    471
    Rating
    7.6
    Comparisons
    Also Known As
    PowerCenter
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Informatica PowerCenter is a data integration and data visualization tool. The solution works as an enterprise data integration platform that helps organizations access, transform, and integrate data from various systems. The product is designed to support companies in the full cycle of a project, from its initial rollout to critical deployments. Informatica PowerCenter allows developers and analysts to collaborate while accelerating the work process to deploy projects within days instead of months.

    The Advanced edition of the product provides an additional real-time engine which allows companies to have always-on enterprise data integration. This ensures seamless collaboration and increment of data lineage visibility and impacts analysis.

    The Premium edition of the solution offers an early warning system that detects unexpected behaviors or incorrect utilization of resources in the workflows and alerts companies in the case that these occur. This version of the product also offers automatic data validation, which ensures data accuracy and reduces testing time and expenditure of resources for by up to 90%.

    Informatica PowerCenter Features

    The product provides users with various features which allow them to execute data integration initiatives such as analytics, data warehousing, data governance, consolidation, and application migration. The features of the solution include:

    • Collaboration: Informatica PowerCenter offers role-based tools and processes which enable business self-service while benefiting from high-quality IT resources.

    • Automation: Through various automations and easy-to-use software, users can utilize graphical and codeless tools and initiate effective data integration without additional knowledge.

    • Scalability: The tool provides high scalability to users, which ensures seamless performance and minimum downtime. PowerCenter also has adaptive load balancing, pushdown optimization, and dynamic partitioning.

    • Monitoring: Through the extensive monitoring feature, the operations and governance of the solution are easily overseen by users. The tool also provides alerts that can prevent damage to the system.

    • Real-time data: Through real-time data, users can monitor applications and analytics, ensuring their efficient operation.

    • Prototyping: Informatica lets its users collaborate with information technology to prototype, profile, and validate results in a timely manner.

    • Connectivity: Users can access and integrate data from different types of sources through high-performance connectors.

    • Automated data validation testing: The product offers script-free automated and repeatable audit and validation of data.

    • Data transformation: This feature allows users to use comprehensive parsing of JSON, PDF, XML, Microsoft Office, and the Internet of Things (IoT) for non-relation data.

    • Cloud applications connectivity: The product allows for seamless connection to cloud application sources and targets.

    Informatica PowerCenter Benefits

    The benefits of using Informatica PowerCenter include:

    • The tool can work over a wide range of systems and platforms and also allows for lean integration.

    • It enhances the quality and speed of performance and optimizes the cost of the process for your organization.

    • PowerCenter supports multiple databases, including TPump, Parallel Transporter Fastload, and Teradata MLoad.

    • The tool is very easy to monitor and maintain, which simplifies the data integration process for companies.

    • The centralized error logging system allows users to locate errors in a timely manner and correct them.

    • The tool can convert data from an application to another format, as it serves as one of the most powerful data transformation solutions.

    • PowerCenter can also serve as middleware between two applications.

    • The solution offers both parallel processing and load balancing.

    • PowerCenter is a tool with a high level of security, which also minimizes essential administration activities.

    • The solution ensures the quality of information, as it does not allow invalid or unwanted data to be uploaded to the source.

    Reviews from Real Users

    Yahya T., a developer and architect at L'Oreal, says the product is stable, provides good support, and integrating it with other systems is very fast.

    Mohamed E., a senior manager for Data management and data governance at a tech company, says PowerCenter is stable, mature, and offers flexibility in building the pipeline and has a drag-and-drop mode because it's GUI-based; technical support is brilliant.

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    University of Texas MD Anderson Cancer Center, LexisNexis, Rabobank
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm20%
    Computer Software Company14%
    Manufacturing Company7%
    Insurance Company7%
    REVIEWERS
    Computer Software Company22%
    Financial Services Firm20%
    Insurance Company7%
    Retailer7%
    VISITORS READING REVIEWS
    Financial Services Firm17%
    Computer Software Company12%
    Manufacturing Company8%
    Insurance Company8%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise72%
    REVIEWERS
    Small Business16%
    Midsize Enterprise11%
    Large Enterprise73%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    AWS Glue vs. Informatica PowerCenter
    March 2024
    Find out what your peers are saying about AWS Glue vs. Informatica PowerCenter and other solutions. Updated: March 2024.
    767,667 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Informatica PowerCenter is ranked 3rd in Data Integration with 78 reviews. AWS Glue is rated 7.8, while Informatica PowerCenter is rated 8.0. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Informatica PowerCenter writes "Stable, provides good support, and integrating it with other systems is very fast, but its pricing is expensive". AWS Glue is most compared with AWS Database Migration Service, SSIS, Informatica Cloud Data Integration, Talend Open Studio and Confluent, whereas Informatica PowerCenter is most compared with Informatica Cloud Data Integration, Azure Data Factory, SSIS, Databricks and Informatica PowerExchange. See our AWS Glue vs. Informatica PowerCenter report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.