Informatica PowerCenter and AWS Glue are both prominent in the data integration tools category. AWS Glue seems to have the upper hand with its feature set, deployment flexibility, and cost-effectiveness.
Features: Informatica PowerCenter provides comprehensive ETL capabilities, supports complex transformations, and ensures strong security. It allows extensive integration across different data sources, including NoSQL databases, and offers robust debugging methods. AWS Glue, on the other hand, benefits from complete integration with AWS services, features for code generation, and a flexible pricing model, making it appealing for businesses requiring agility and scalability.
Room for Improvement: Informatica PowerCenter could improve its pricing strategy, interface modernity, and cloud capabilities while simplifying setup for new users. AWS Glue needs enhancements in error handling, documentation, language support, and integration flexibility beyond AWS environments.
Ease of Deployment and Customer Service: Informatica PowerCenter generally favors on-premises or private cloud setups, offering strong support but facing critiques on complexity and response time. AWS Glue is optimized for cloud environments, providing seamless integration with AWS products and flexible deployment but could improve in support accessibility and user-friendly pipeline setups.
Pricing and ROI: Informatica PowerCenter is more expensive due to its comprehensive feature set, valuable for large enterprises needing robust integration solutions, though costly for small to medium businesses. AWS Glue, with its pay-as-you-go model, presents a flexible and cost-effective solution, particularly advantageous for businesses seeking scalable options without high initial costs. Both solutions offer value, but AWS Glue's pricing appeals to a broader range of companies.
I advocate using Glue in such cases.
Upgrades occur every four months, and new developments coincide with version updates.
I like the technical support provided by Informatica.
For jobs requiring multiple RAM usage, we increase the number of workers accordingly.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
For scalability, I would rate Informatica PowerCenter between eight to nine.
As a managed service, it reduces management burdens.
Informatica PowerCenter is stable and can scale well.
A more user-friendly and simpler process would help speed up the deployment process.
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
Learning the latest functionalities is crucial, and while challenging, it is a vital part of staying current and ensuring an efficient ETL process.
With Informatica PowerCenter, I am looking for an AI interface that looks at the underlying data model of the databases and the metadata of the tables, allowing the developer to provide instructions on what data sources to connect to and how to apply or create Transformations.
Utilizing more stored procedures from Oracle databases in an easy way would significantly boost performance.
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
AWS charges based on runtime, which can be quite pricey.
Costing depends on resource usage, and cost optimization may involve redesigning jobs for flexibility.
The price of Informatica PowerCenter is high, especially for small and medium-sized businesses.
I find that the pricing and licensing for Informatica PowerCenter align with its quality.
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models.
I think if I'm working with big data, common languages like Python work quite nicely, which is advantageous.
The system supports real-time integration, which is essential for many of my tasks.
The functions in Informatica PowerCenter that I have found most valuable are the way it manages the volume of data, the push down optimization, and the performance aspects of it, mostly related to parallelism techniques.
AWS Glue is a serverless cloud data integration tool that facilitates the discovery, preparation, movement, and integration of data from multiple sources for machine learning (ML), analytics, and application development. The solution includes additional productivity and data ops tooling for running jobs, implementing business workflows, and authoring.
AWS Glue allows users to connect to more than 70 diverse data sources and manage data in a centralized data catalog. The solution facilitates visual creation, running, and monitoring of extract, transform, and load (ETL) pipelines to load data into users' data lakes. This Amazon product seamlessly integrates with other native applications of the brand and allows users to search and query cataloged data using Amazon EMR, Amazon Athena, and Amazon Redshift Spectrum.
The solution also utilizes application programming interface (API) operations to transform users' data, create runtime logs, store job logic, and create notifications for monitoring job runs. The console of AWS Glue connects all of these services into a managed application, facilitating the monitoring and operational processes. The solution also performs provisioning and management of the resources required to run users' workloads in order to minimize manual work time for organizations.
AWS Glue Features
AWS Glue groups its features into four categories - discover, prepare, integrate, and transform. Within those groups are the following features:
AWS Glue Benefits
AWS Glue offers a wide range of benefits for its users. These benefits include:
Reviews from Real Users
Mustapha A., a cloud data engineer at Jems Groupe, likes AWS Glue because it is a product that is great for serverless data transformations.
Liana I., CEO at Quark Technologies SRL, describes AWS Glue as a highly scalable, reliable, and beneficial pay-as-you-go pricing model.
Informatica PowerCenter is a data integration and data visualization tool. The solution works as an enterprise data integration platform that helps organizations access, transform, and integrate data from various systems. The product is designed to support companies in the full cycle of a project, from its initial rollout to critical deployments. Informatica PowerCenter allows developers and analysts to collaborate while accelerating the work process to deploy projects within days instead of months.
The Advanced edition of the product provides an additional real-time engine which allows companies to have always-on enterprise data integration. This ensures seamless collaboration and increment of data lineage visibility and impacts analysis.
The Premium edition of the solution offers an early warning system that detects unexpected behaviors or incorrect utilization of resources in the workflows and alerts companies in the case that these occur. This version of the product also offers automatic data validation, which ensures data accuracy and reduces testing time and expenditure of resources for by up to 90%.
Informatica PowerCenter Features
The product provides users with various features which allow them to execute data integration initiatives such as analytics, data warehousing, data governance, consolidation, and application migration. The features of the solution include:
Informatica PowerCenter Benefits
The benefits of using Informatica PowerCenter include:
Reviews from Real Users
Yahya T., a developer and architect at L'Oreal, says the product is stable, provides good support, and integrating it with other systems is very fast.
Mohamed E., a senior manager for Data management and data governance at a tech company, says PowerCenter is stable, mature, and offers flexibility in building the pipeline and has a drag-and-drop mode because it's GUI-based; technical support is brilliant.
We monitor all Cloud Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.