AWS Glue vs Informatica Cloud Data Integration comparison

Cancel
You must select at least 2 products to compare!
Amazon Web Services (AWS) Logo
12,012 views|8,420 comparisons
92% willing to recommend
Informatica Logo
3,563 views|2,909 comparisons
88% willing to recommend
Comparison Buyer's Guide
Executive Summary
Updated on Sep 6, 2022

We performed a comparison between AWS Glue and Informatica Cloud Data Integration based on our users’ reviews in four categories. After reading all of the collected data, you can find our conclusion below.

  • Ease of Deployment: For the most part, users of both solutions feel they are easy and straightforward to deploy.
  • Features: AWS Glue can easily sync data from the source to the solution phase and users say it provides excellent intuitive automation. They find it is very robust and flexible, enabling them to write their own queries to achieve the desired transformations quickly. However, they say AWS is not very user friendly and only works with other AWS tools and solutions.

    Informatica Cloud Data Integration offers mass ingestion functionality, and users say it is very flexible, elastic, and is for enterprise organizations. The solution makes it easy to create integrations and they provide many connectors. Many users feel the solution has a steep learning curve and performance limitations.
  • Pricing: AWS Glue users tell us the solution is affordable and offers a pay-as-you-use option. Informatica Cloud Data Integration users feel the pricing could be improved.
  • Service and Support: Overall, users are satisfied with the service and support.

Comparison Results: For users vested in the AWS ecosystem, AWS is hands down the best choice. Informatica Cloud Data Integration is flexible and allows users to decide how to distribute their IPUs in their own networks. Data residency laws make it challenging to choose this solution, as their regions are currently very limited.

To learn more, read our detailed AWS Glue vs. Informatica Cloud Data Integration Report (Updated: March 2024).
768,740 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages.""The solution integrates well with other AWS products or services.""It's fairly straightforward as a product; it's not very complicated.""AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3.""We have found it beneficial when moving data from one source to another.""I like its integration and ability to handle all data-related tasks.""The most valuable feature for me is the visual interface of AWS Glue.""AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software."

More AWS Glue Pros →

"It has all the advantages of the Cloud in that you can use it without worrying about infrastructure, upkeep, or upgrades.""The most valuable feature is the building of mockups and tasks.""The most valuable features of Informatica Cloud Data Integration for our clients are the AI capabilities within Informatica Intelligent Cloud Services.""It is quite easy to use and flexible.""We have a lot of integrations, and it's very easy to create integrations. They have a lot of connectors.""It is a scalable solution. Scalability-wise, I rate the solution a nine out of ten.""The support is very good.""The user interface which is very easy to use if we have any problems to solve."

More Informatica Cloud Data Integration Pros →

Cons
"The setup and installation is a bit complex without advanced knowledge or training.""I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features.""In terms of improvement, the performance of AWS Glue could be faster.""Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult.""One area that could be improved is the ETL view. The drag-and-drop interface is not as user-friendly as some other ETL tools.""The mapping area and the use of the data catalog from Glue could be better.""While working on AWS Glue, I could not find any training material for it.""There should be more connectors for different databases."

More AWS Glue Cons →

"I would like to see more functionality added so that it is a bit closer to how much you can do with Informatica PowerCenter.""With the solution, we had some issues, and we have every day, and we used to open a ticket. Sometimes, there are data issues and transformation issues.""Cost-wise, it could be better.""Error reporting and debugging need improvement.""One area where Informatica Cloud Data Integration could improve is in providing more accelerators for certain functionalities.""One area that needs to improve is the user experience because it is very complex. The trial version is very complex so it's not easy to start using the program immediately. You must study the rules first.""Certain shortcomings in the product's UI make it an area where improvements are required.""The cloud version of the Informatica, it's a very substandard product. They might say it's enterprise-ready but it's not at all ready. They need to add more features, such as improved data replication features. If you look at other tools, such as Matillion they are now cloud-native and flexible. Additionally, Informatica Cloud Data Integration should have a good migration strategy from Informatica PowerCenter to Informatica Cloud Data Integration."

More Informatica Cloud Data Integration Cons →

Pricing and Cost Advice
  • "The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
  • "It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
  • "Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
  • "Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
  • "This solution is affordable and there is an option to pay for the solution based on your usage."
  • "AWS Glue is quite costly, especially for small organizations."
  • "AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage."
  • "The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
  • More AWS Glue Pricing and Cost Advice →

  • "It is cost effective and an easily accessible tool."
  • "The pricing structure is good, but having to pay for extra drivers to be used in an ICS environment makes me a little nervous."
  • "Licensing is difficult to understand, but the team is always available to explain anything. They are very helpful."
  • "My understanding is that Informatica is quite expensive compare to other tools that are available in the market."
  • "Our customers sometimes are able to negotiate a much better price for Informatica Cloud Data Integration based on their relationship with the vendor."
  • "Its pricing model can be improved."
  • "I'm not sure about the most recent pricing trends, but I don't believe it's significantly different from PowerCenter. I believe it is nearly the same."
  • "The price of Informatica Cloud Data Integration could be reduced."
  • More Informatica Cloud Data Integration Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
    768,740 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:AWS Glue and Azure Data factory for ELT best performance cloud services.
    Top Answer:We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in… more »
    Top Answer:AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or… more »
    Top Answer:Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power… more »
    Top Answer:Complex transformations can easily be achieved using PowerCenter, which has all the features and tools to establish a real data governance strategy. Additionally, PowerCenter is able to manage huge… more »
    Top Answer:When it comes to cloud data integration, this solution can provide you with multiple benefits, including Overhead reduction by integrating data on any cloud in various ways Effective integration of… more »
    Ranking
    1st
    Views
    12,012
    Comparisons
    8,420
    Reviews
    32
    Average Words per Review
    419
    Rating
    7.8
    5th
    Views
    3,563
    Comparisons
    2,909
    Reviews
    17
    Average Words per Review
    467
    Rating
    7.8
    Comparisons
    Learn More
    Overview

    AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.

    AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.

    The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.

    AWS Glue Features

    AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:

    • Automatic schema discovery: AWS Glue crawlers connect to the organization's source or target data source through a prioritized list of classifiers to determine the schema for users' data. This feature creates metadata in companies' AWS Glue Data Catalog.

    • Schemas for data stream management: The AWS Glue Schema Registry enables users to validate and control the evolution of streaming data through registered Apache Avro schemas for no additional charge.

    • Automatic scaling based on workload: This feature dynamically scales resources up and down based on workload. The feature controls job resources, removing them depending on how much the workload can be split up.

    • FindMatches: This feature is for machine learning-based data deduplication and cleansing, and works by finding records that are imperfect matches of each other to remove useless data copies.

    • Edit, debug, and test ETL code: This feature helps users who have chosen to interactively develop their ETL code by providing development endpoints for editing, debugging, and testing the code it generates for them.

    • AWS Glue DataBrew: An interactive, point-and-click visual interface for specialists to clean and normalize data without the need to write any code.

    • AWS Glue Interactive Sessions: This feature simplifies the development of data integration jobs by enabling data engineers to interactively prepare and explore data.

    • AWS Glue Studio Job Notebooks: This AWS Glue feature provides serverless notebooks with minimal setup, allowing developers to start working in a timely manner.

    • Complex ETL pipeline building: This feature allows the product to be invoked on a schedule, on demand, or based on an event, allowing users to start multiple jobs in parallel or specify dependencies to build complex ETL pipelines.

    • AWS Glue Studio: This AWS Glue feature allows users to visually transform data through a drag-and-drop interface. The product automatically generates the code for ETL processes for users' data.

    AWS Glue Benefits

    AWS Glue offers a wide range of benefits for its users. These benefits include:

    • Users of other AWS products can easily onboard with AWS Glue, as it is integrated across a wide range of the company's services.

    • The solution is serverless, which allows for a lower total cost of ownership.

    • AWS Glue offers more power for users, as it automates much of the effort in building, maintaining, and running ETL jobs.

    • The product allows customers to easily discover and search across all their AWS datasets through AWS Glue Data Catalog.

    • AWS Glue does not require additional payment for managing and enforcing schemas for data streams.

    • The solution facilitates the authority of scalable ETL jobs for beginners and non-coding experts through a drag-and-drop interface.

    Reviews from Real Users

    Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.

    Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.

    Informatica Cloud Data Integration is a cloud-native cloud data integration solution that enables users to connect a large number of applications and data sources across on-premises and integrate the data sources at scale on the cloud. The product is built on microservices-driven management and integration platform as a service (iPaaS) and assists organizations to govern costs, increase productivity and collaboration, and simplify their experience. Informatica Cloud Data Integration allows companies to deliver data and analytics to lines of business in a timely manner, build data warehouses on Amazon Redshift, Google Cloud BigQuery, Snowflake, and Microsoft Azure Synapse Analytics, and utilize the required data integration patterns, including elastic processing, extract, load, and transform (ELT), and extract, transform, and load (ETL).

    The solution allows users to to build enterprise-scale integration workloads within hours while it improves the productivity of development teams by providing them a codeless, drag-and-drop user interface. Companies can benefit from integration features built for data warehousing and optimized connectors for bulk loads of billions of records. Informatica Cloud Data Integration offers organizations the option of going serverless at scale by allowing them to process data integration jobs from cloud-hosted as well as managed environments. The Spark-based engine allows the solution to handle high-volume data demands and complex data integration tasks.

    Informatica Cloud Data Integration Features

    Informatica Cloud Data Integration provides its users with various features and tools. Among the key capacities of the product are:

    • Advanced Pushdown Optimization: Informatica Cloud Data Integration offers a feature that provides users with the benefits of ELT while maintaining their data flow definitions at a logical or abstract level. This feature allows users to choose a runtime option that complies with the workload as well as send their data processing work to cloud ecosystem pushdown, cloud data warehouse pushdown, Spark serverless processing, or traditional ETL.

    • Connectors for all major data sources: This feature provides out-of-the-box connectivity to a large number of cloud and on-premise systems, data stores, analytics and BI tools, and enterprise and middleware applications.

    • Data transformation capabilities: This feature allows users to process data transformation in real time or batch by using a variety of transformation types, such as cleansing, masking, aggregation, fileting, parsing, and ranking.

    • Spark-based complex data integration: Informatica Cloud Data Integration Elastic allows specialists to use elastic clusters to process their data transformation.

    • Codeless integration: This feature facilitates the creation of simple-to-sophisticated data integration projects with a visual mapping designer that speeds up pre-build transformations for development through a variety of endpoints across cloud and on-premises.

    • Serverless data integration: Users can achieve cloud data integration in a mode called Advanced Serverless, where they can benefit from a fully managed environment with no software, no cloud administration, and no servers or clusters to manage.

    • Taskflow orchestration: This feature allows users to combine batch and real-time integration through a taskflow designer in order to create simple-to-sophisticated orchestrations.

    • Intelligent structure discovery: This feature uses the CLAIRE engine to automatically understand the parsing model for complicated files based on their structure.

    • Change data capture: Utilizing the prebuilt task wizards and Change Data Capture tool, users can automatically pull only the updated or incremental data from source systems to the targets on a frequent basis.

    • Security: The product offers various features which ensure the highest level of data and workload security and comply with various policies.

    Informatica Cloud Data Integration Benefits

    Informatica Cloud Data Integration brings multiple benefits to its users. These include:

    • The product offers optimized connectivity to various systems through custom build-connectors.

    • Users can benefit from improved elasticity and performance by utilizing Spark clusters and auto-tuning.

    • The tool allows developers to focus on business logic by facilitating infrastructure management through serverless deployment features.

    • Informatica Data Cloud Integration provides user flexibility by connecting to any database, cloud data lake, on-premise apps, and data warehouses.

    • Through a zero-coding environment and role-appropriate user experience, the solution is suitable for all types of users.

    • The solution offers consistent experience and unified metadata across all cloud services.

    • Users can leverage enterprise-level performance for integration design with no coding required.

    • Informatica Data Cloud Integration scales as a business grows, providing a high level of adaptability.

    Reviews from Real Users

    Divya R., a senior consultant at Deloitte, rates Informatica Cloud Data Integration highly because it is a UI-based tool with great scripting.

    A data architect at a retailer likes Informatica Cloud Data Integration because of its flexible licensing, good connectors, and timely upgrades and patches.

    Sample Customers
    bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
    Chicago Cubs, Telegraph Media Group
    Top Industries
    REVIEWERS
    Computer Software Company47%
    Financial Services Firm18%
    Pharma/Biotech Company12%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company14%
    Manufacturing Company7%
    Insurance Company7%
    REVIEWERS
    Computer Software Company37%
    Pharma/Biotech Company21%
    Manufacturing Company11%
    Individual & Family Service5%
    VISITORS READING REVIEWS
    Financial Services Firm16%
    Computer Software Company14%
    Manufacturing Company9%
    Insurance Company8%
    Company Size
    REVIEWERS
    Small Business29%
    Midsize Enterprise13%
    Large Enterprise58%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise12%
    Large Enterprise73%
    REVIEWERS
    Small Business21%
    Midsize Enterprise21%
    Large Enterprise57%
    VISITORS READING REVIEWS
    Small Business15%
    Midsize Enterprise11%
    Large Enterprise74%
    Buyer's Guide
    AWS Glue vs. Informatica Cloud Data Integration
    March 2024
    Find out what your peers are saying about AWS Glue vs. Informatica Cloud Data Integration and other solutions. Updated: March 2024.
    768,740 professionals have used our research since 2012.

    AWS Glue is ranked 1st in Cloud Data Integration with 37 reviews while Informatica Cloud Data Integration is ranked 5th in Cloud Data Integration with 40 reviews. AWS Glue is rated 7.8, while Informatica Cloud Data Integration is rated 7.8. The top reviewer of AWS Glue writes "Provides serverless mechanism, easy data transformation and automated infrastructure management". On the other hand, the top reviewer of Informatica Cloud Data Integration writes "A stable, scalable, and user-friendly solution". AWS Glue is most compared with AWS Database Migration Service, Informatica PowerCenter, SSIS, Talend Open Studio and Confluent, whereas Informatica Cloud Data Integration is most compared with Informatica PowerCenter, Azure Data Factory, Fivetran, Mule Anypoint Platform and Matillion ETL. See our AWS Glue vs. Informatica Cloud Data Integration report.

    See our list of best Cloud Data Integration vendors.

    We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.