Amazon EMR vs Pentaho Business Analytics comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Average Rating
7.8
Number of Reviews
20
Ranking in other categories
Hadoop (3rd), Cloud Data Warehouse (8th)
Pentaho Business Analytics
Average Rating
8.0
Number of Reviews
42
Ranking in other categories
BI (Business Intelligence) Tools (19th), Cloud Operations Analytics (4th), Reporting (17th)
 

Market share comparison

As of June 2024, in the Hadoop category, the market share of Amazon EMR is 14.9% and it decreased by 17.1% compared to the previous year. The market share of Pentaho Business Analytics is 1.3% and it increased by 126.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
Unique Categories:
Cloud Data Warehouse
4.4%
BI (Business Intelligence) Tools
0.6%
Cloud Operations Analytics
50.0%
 

Featured Reviews

CG
Aug 1, 2023
Suitable for online deployments with a serverless architecture
EMR Serverless is useful for online deployments that require a serverless architecture. We were previously using a server-based architecture, but we have since switched to serverless The product is very well-designed. It is easy to use, and it is very cost-effective.   The solution eliminates…
Mir Gulzar Ahmed - PeerSpot reviewer
Nov 6, 2023
Excels in handling unstructured data, helping organizations navigate through different storage systems
Pentaho can help organizations by providing them an insight of their unstructured data using one platform(Pentaho Business Analytics). The features are almost identical to other BIS platforms but to me, customers can benefit as it has a community version with most of its Enterprise features. It also has a free limited-period trial version. The other feature that I would like to share here is, that users have access to a complete spectrum of data from different sources with the system’s adaptive big data layer, which takes the source of the data into account. The software is built on an open architecture and can be integrated with multiple systems. However, Pentaho Data Integration and Analytics has been acquired by HDS which offers an Enterprise edition for organizations that also need to meet product compliance.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"Amazon EMR is a good solution that can be used to manage big data."
"The initial setup is straightforward."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"The initial setup is pretty straightforward."
"The solution is pretty simple to set up."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The solution is scalable."
"Easy to use components to create the job."
"The initial setup is pretty straightforward."
"We were able to install it without any assistance from tech support."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
"The most valuable feature of Pentaho is the Tableau report."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
 

Cons

"The initial setup was time-consuming."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"The legacy versions of the solution are not supported in the new versions."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"The dashboard management could be better. Right now, it's lacking a bit."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"There is no need to pay extra for third-party software."
"The problem for us is it starts very slow."
"The repository should be improved."
"Another concern is that Pentaho is not customizable or interactive."
"Pentaho Business Analytics' user interface is outdated."
"Version control would be a good addition."
"Pentaho, at the general level, should greatly improve the easy construction of its dashboards and easy integration of information from different sources without technical user intervention."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
"Logging capability is needed."
 

Pricing and Cost Advice

"Amazon EMR's price is reasonable."
"Amazon EMR is not very expensive."
"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"The price of the solution is expensive."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"The product is not cheap, but it is not expensive."
"The cost of Amazon EMR is very high."
"There is no need to pay extra for third-party software."
"Free and commercial versions are available."
"We were lucky enough to find a Pentaho OEM partner who offered a data warehouse model and the ETL software for about 60K SGD per year."
"Pentaho is expensive ."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
787,061 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
13%
Manufacturing Company
8%
Educational Organization
6%
Financial Services Firm
23%
Computer Software Company
12%
Government
10%
Educational Organization
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What needs improvement with Amazon EMR?
As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data.
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows: 1) If you have legacies of SAP, Oracle - look for SAP...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
The organization has both options based on their needs and budget constraints. The Enterprise Edition is expensive with references to an added number of features.
What needs improvement with Pentaho Business Analytics?
The product to me is not as user-friendly as other players in the market. It also still needs improvement in the reporting module. You will need to search for deployment examples or need to have a ...
 

Also Known As

Amazon Elastic MapReduce
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Overview

 

Sample Customers

Yelp
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: May 2024.
787,061 professionals have used our research since 2012.