Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Pentaho Business Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.4
Amazon EMR delivers significant cost savings and efficiency, with some users achieving up to 20% savings and positive ROI.
Sentiment score
5.4
Pentaho Business Analytics mixed ROI perceptions highlight efficiency gains but unclear returns compared to competitors like QlikView and Tableau.
 

Customer Service

Sentiment score
7.7
Amazon EMR support is generally proactive and responsive, though user experiences vary, especially during open-source integrations.
Sentiment score
6.5
Pentaho Business Analytics receives mixed reviews for customer support, with users relying heavily on forums and community assistance.
We get all call support, screen sharing support, and immediate support, so there are no problems.
They help with billing, cost determination, IAM properties, security compliance, and deployment and migration activities.
 

Scalability Issues

Sentiment score
7.4
Amazon EMR is scalable and versatile, though some face resource speed issues and performance differences between environments.
Sentiment score
7.0
Pentaho Business Analytics is scalable with good performance but occasionally needs professional help for complex data handling.
Scalability can be provisioned using the auto-scaling feature, EC2 instances, on-demand instances, and storage locations like block storage, S3, or file storage.
 

Stability Issues

Sentiment score
7.7
Amazon EMR is stable and reliable, with high availability, but could improve slightly to address occasional concerns.
Sentiment score
6.5
Pentaho Business Analytics is stable but may face Java caching issues, impacting performance and requiring careful cache management.
Regular updates, patch installations, monitoring, logging, alerting, and disaster recovery activities are crucial for maintaining stability.
It can handle large datasets.
 

Room For Improvement

Amazon EMR requires improved user-friendliness, stability, monitoring, integration, and pricing adjustments to enhance performance, scalability, and compatibility.
Pentaho Business Analytics lacks an intuitive interface, robust integration, self-service features, and requires technical expertise, limiting usability.
The cost factor differs significantly. When you run Spark application on EKS, you run at the pod level, so you can control the compute cost. But in Amazon EMR, when you have to run one application, you have to launch the entire EC2.
There is room for improvement with respect to retries, handling the volume of data on S3 buckets, cluster provisioning, scaling, termination, security, and integration between services like S3, Glue, Lake Formation, and DynamoDB.
Pentaho Business Analytics is hard to learn and not suited for initial users as it requires knowledge of operating systems, Java, and other technical skills.
 

Setup Cost

Amazon EMR pricing is usage-based, perceived higher, but optimizable through instance management and auto-scaling for Big Data tasks.
Enterprise buyers find the free Pentaho Community Edition cost-effective, while the Enterprise Edition offers value with support and features.
Cost optimization can be achieved through instance usage, cluster sharing, and auto-scaling.
Pentaho Business Analytics is priced similarly to other competitors such as QlikView and Tableau.
 

Valuable Features

Amazon EMR offers scalable, cost-effective data processing with easy integration, advanced features, and robust management on a cloud-based infrastructure.
Pentaho Business Analytics provides easy data integration, customizable dashboards, extensive connectivity, and supports efficient data transformation and delivery.
Amazon EMR helps in scalability, real-time and batch processing of data, handling efficient data sources, and managing data lakes, data stores, and data marts on file systems and in S3 buckets.
Amazon EMR provides out-of-the-box solutions with Spark and Hive.
It is a stable product, and it can handle large datasets.
 

Categories and Ranking

Amazon EMR
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
24
Ranking in other categories
Hadoop (3rd), Cloud Data Warehouse (13th)
Pentaho Business Analytics
Average Rating
8.0
Reviews Sentiment
6.7
Number of Reviews
45
Ranking in other categories
BI (Business Intelligence) Tools (19th), Cloud Operations Analytics (2nd), Reporting (12th)
 

Mindshare comparison

Amazon EMR and Pentaho Business Analytics aren’t in the same category and serve different purposes. Amazon EMR is designed for Hadoop and holds a mindshare of 12.8%, down 14.3% compared to last year.
Pentaho Business Analytics, on the other hand, focuses on BI (Business Intelligence) Tools, holds 0.5% mindshare, down 0.6% since last year.
Hadoop Market Share Distribution
ProductMarket Share (%)
Amazon EMR12.8%
Cloudera Distribution for Hadoop21.9%
Apache Spark19.0%
Other46.3%
Hadoop
BI (Business Intelligence) Tools Market Share Distribution
ProductMarket Share (%)
Pentaho Business Analytics0.5%
Microsoft Power BI14.1%
Tableau Enterprise10.3%
Other75.1%
BI (Business Intelligence) Tools
 

Featured Reviews

Prashant  Singh - PeerSpot reviewer
Seamless data integration enhances reporting efficiency and an easy setup
Amazon EMR has multiple connectors that can connect to various data sources. The service charges are based on processing only, depending on the resources used, which can help save money. It is easy to integrate with other services for storage, allowing data to be shifted to cheaper storage based on usage.
Mir Gulzar Ahmed - PeerSpot reviewer
Excels in handling unstructured data, helping organizations navigate through different storage systems
Pentaho can help organizations by providing them an insight of their unstructured data using one platform(Pentaho Business Analytics). The features are almost identical to other BIS platforms but to me, customers can benefit as it has a community version with most of its Enterprise features. It also has a free limited-period trial version. The other feature that I would like to share here is, that users have access to a complete spectrum of data from different sources with the system’s adaptive big data layer, which takes the source of the data into account. The software is built on an open architecture and can be integrated with multiple systems. However, Pentaho Data Integration and Analytics has been acquired by HDS which offers an Enterprise edition for organizations that also need to meet product compliance.
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
871,358 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
12%
Educational Organization
12%
Healthcare Company
7%
Financial Services Firm
13%
Computer Software Company
9%
Educational Organization
8%
Manufacturing Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business6
Midsize Enterprise5
Large Enterprise11
By reviewers
Company SizeCount
Small Business22
Midsize Enterprise7
Large Enterprise15
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
Compared to others, Amazon seems efficient and is considered good for Big Data workloads. Costs are involved based on cluster resources, data volumes, EC2 ( /products/amazon-ec2-reviews ) instances...
What needs improvement with Amazon EMR?
There is room for improvement with respect to retries, handling the volume of data on S3 ( /products/amazon-s3-reviews ) buckets, cluster provisioning, scaling, termination, security, and integrati...
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows: 1) If you have legacies of SAP, Oracle - look for SAP...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
Pentaho Business Analytics offers the best value for money. While improvements can be made in some areas, particularly with more cloud-based solutions, it is not in their domain because they do not...
What needs improvement with Pentaho Business Analytics?
From an integration perspective, Pentaho Business Analytics is not the best tool on the market. There are things done by Apache that are better, though I am not the one implementing them, so this i...
 

Also Known As

Amazon Elastic MapReduce
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Overview

 

Sample Customers

Yelp
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Cloudera, Apache, Amazon Web Services (AWS) and others in Hadoop. Updated: September 2025.
871,358 professionals have used our research since 2012.