

Pentaho Data Integration and Informatica PowerCenter are competitors in the data integration and analytics space. Pentaho Data Integration appears to have the edge due to its versatility and cost-effectiveness, despite room for enhancement regarding cloud integration.
Features: Pentaho Data Integration offers a highly intuitive interface facilitating quick development with minimal coding, alongside extensive plugin support for versatile data transformations. Its open-source nature provides robust connectivity options. Informatica PowerCenter is acclaimed for its strong ETL capabilities and comprehensive transformation libraries, ideally suited for large data volumes and maintaining advanced data governance.
Room for Improvement: Pentaho Data Integration needs better native cloud integrations and real-time processing abilities; it sometimes requires workarounds due to its open-source inconsistencies. Informatica PowerCenter, while providing premium features, faces criticism for its high licensing costs, needing enhanced cloud integration and a simpler user interface experience.
Ease of Deployment and Customer Service: Pentaho Data Integration is flexible across various environments, appreciated for ease of access in its Community Edition, although responses to support requests may be delayed. Informatica PowerCenter is commendable for stability in on-premises setups, offering reliable technical support but at a significant cost, with noted integration challenges in cloud environments.
Pricing and ROI: Pentaho Data Integration is notably cost-effective, particularly with its free Community Edition, ensuring quick ROI and scalability without heavy licensing fees. Informatica PowerCenter serves as a high-end product delivering advanced enterprise features but at a premium price, potentially restricting for smaller organizations.
It also plays a vital role in revenue calculations, net asset valuations, and other key factors that support customer data and investment data pipelines.
The investment we have made is tremendous; it has saved a lot of time and effort, and fewer people are needed.
The return on investment is very good, as I previously mentioned, because the development team has been reduced to half, and it has saved us around one hour per day since we switched to Informatica PowerCenter.
I have seen a return on investment; my team was able to stay extremely small even though we had a lot of data integrations with many companies.
I can testify to the return on investment with metrics regarding time saved; we have increased our efficiency by about 20 to 30 percent due to the swift migration processes facilitated by the tool.
I have noticed a return on investment with Pentaho Data Integration and Analytics in terms of time savings and staff reduction.
The documentation is thorough, and anyone with minimal knowledge of ETL can easily understand it and work through errors.
I like the technical support provided by Informatica.
I have occasionally needed to communicate with the technical support of Informatica PowerCenter, especially when raising cases for complex mappings and performance optimization to identify bottlenecks in transformations.
24/7 assistance is available for the Enterprise Edition.
take the time to understand our business requirements, offering appropriate recommendations.
Communication with the vendor is challenging
In the cloud, scaling up and down becomes easy when working with cloud providers.
The scalability of Informatica PowerCenter is tremendous because we can install it on any of our employees' systems, and it handles each and every task very swiftly.
We can easily scale the memory and also the workflows.
It can be scaled well until you reach a point where you need to perform a lot of operations, and the issue arises when it runs out of memory to handle some data.
Its ability to scale horizontally in cloud-native architectures or for massive real-time processing is limited.
Pentaho Data Integration handles larger datasets better.
We are getting 100% uptime every day.
Informatica PowerCenter is stable and can scale well.
The product is very stable with very few issues encountered in production.
Performance issues arise due to reliance on a flowchart-based mechanism instead of scripts, which can lead to longer execution times.
I find that version 3.1 is the most stable version I have ever used.
It's pretty stable, however, it struggles when dealing with smaller amounts of data.
With Informatica PowerCenter, I am looking for an AI interface that looks at the underlying data model of the databases and the metadata of the tables, allowing the developer to provide instructions on what data sources to connect to and how to apply or create Transformations.
Utilizing more stored procedures from Oracle databases in an easy way would significantly boost performance.
Informatica Cloud and its support becomes quite expensive for the organization compared to peers such as SnapLogic or Netezza, which offer lower pricing.
We should also explore more effective partitioning for parallel processing and fine-tuning database connections to reduce load times and improve ETL speed.
Pentaho Data Integration and Analytics can be improved by working with different environments, specifically the possibility to change the variables, meaning I write my variables only once and can change them for different environments such as production or development.
Pentaho Data Integration and Analytics could have real-time processing and automatic alerting, having alerts or automatic notifications when a job fails or when certain data doesn't meet certain rules.
I find that the pricing and licensing for Informatica PowerCenter align with its quality.
The price of Informatica PowerCenter is high, especially for small and medium-sized businesses.
We haven't paid for it; our client had paid for this tool.
I use the community version of Pentaho Data Integration and Analytics, and I do not need additional costs.
The setup cost was minimal, and the pricing experience was pretty good.
The company covered it and they had no problem paying for it because they saw that it was cost-effective in terms of performance afterwards.
The system supports real-time integration, which is essential for many of my tasks.
Informatica monitors can be used to monitor the jobs that we run, and if there is any kind of failure, we can diagnose it right away.
Another valuable feature is the use of Mapplets; if we have one mapping created that we want to use again and again for other workflows, we can create a Mapplet and save it so that we can reuse the mapping, reducing our workload.
Pentaho Data Integration and Analytics has positively impacted my organization because it meant we didn't have to write a lot of custom API back-end processing logic; it did the majority of that heavy lifting for us.
It automates the data workflow, including extraction, cleansing, and loading into warehouses for BI reporting purposes, while also removing duplicates, validating data, and standardizing formats, enabling real-time decision-making.
Pentaho Data Integration and Analytics has positively impacted my organization because it is easier to use, and my knowledge about this work facilitates the translation from the source to my final system.
| Product | Mindshare (%) |
|---|---|
| Informatica PowerCenter | 3.4% |
| Pentaho Data Integration and Analytics | 1.7% |
| Other | 94.9% |


| Company Size | Count |
|---|---|
| Small Business | 15 |
| Midsize Enterprise | 11 |
| Large Enterprise | 75 |
| Company Size | Count |
|---|---|
| Small Business | 18 |
| Midsize Enterprise | 17 |
| Large Enterprise | 32 |
Informatica PowerCenter is known for its robust data integration, scalability, and user-friendly interfaces. It simplifies data processing with real-time capabilities, handling large datasets efficiently. Its adaptability with diverse sources makes it suitable for complex data environments.
Informatica PowerCenter offers extensive transformation options with features like flow designer, mapping, and error handling, enhancing development efficiency. Its GUI interface allows seamless integration across different platforms, making it suitable for managing extensive datasets. Traceability and support cater to evolving data requirements, while adaptability with multiple sources aids in driving strategic data outputs. Some areas for improvement include a more robust cloud strategy, better documentation, and improved API integrations. Enhanced automation and setup processes could further refine the experience.
What are the key features of Informatica PowerCenter?Informatica PowerCenter plays a vital role in data integration and ETL processes for building data warehouses. Industries like banking, insurance, and healthcare utilize it for extracting, transforming, and loading data into target systems, supporting analytics, reporting, and compliance. Companies often transition to cloud environments for enhanced scalability and efficiency.
Pentaho Data Integration and Analytics offers an intuitive platform for data workflows, enabling users to easily manage ETL processes across diverse data formats, ensuring seamless automation and development.
With its drag-and-drop interface, Pentaho allows for efficient ETL workflows without extensive coding. It supports a multitude of data formats and sources such as SQL, NoSQL, Hadoop, CSV, and JSON. Advanced features like metadata injection and API integration enable seamless automation. However, improvements in big data performance, better cloud service integration, and enhanced real-time processing capabilities can enhance user experience. Additional connectors and improved documentation are sought after by many. Providing support for more programming languages and optimizing memory usage also presents opportunities for enhancement.
What are the key features of Pentaho Data Integration and Analytics?Pentaho is employed across finance, healthcare, and retail industries for ETL processes. It's instrumental in integrating data from ERP, SAP systems, Excel, and APIs to develop comprehensive reports and data models. Companies rely on its capabilities for both on-premises and cloud deployments, improving data transparency and management.
We monitor all Data Integration reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.