Try our new research platform with insights from 80,000+ expert users

Apache Spark vs Pentaho Business Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
7.3
Apache Spark reduces operational costs by up to 50%, offering high ROI and efficient performance despite infrastructure expenses.
Sentiment score
5.9
Pentaho Business Analytics enhanced efficiency and output but faced ROI calculation challenges compared to alternatives like QlikView and Tableau.
 

Customer Service

Sentiment score
6.1
Apache Spark support ranges from vibrant community help to paid vendor plans, with experiences varying based on user needs.
Sentiment score
6.8
Pentaho receives mixed feedback on support, with strong community engagement but varying customer service and documentation quality.
 

Scalability Issues

Sentiment score
7.7
Apache Spark is scalable, efficiently manages large workloads, and is praised for stability, adaptability, and expansive capabilities.
Sentiment score
7.2
Pentaho scales well and fits organizational needs but may require admin expertise for complex environments and large data.
 

Stability Issues

Sentiment score
7.5
Apache Spark is stable and reliable, with improved versions addressing issues, widely used by major tech companies.
Sentiment score
6.6
Pentaho Business Analytics is generally stable but has occasional Java caching, RAM, and subprocess issues with large databases.
 

Room For Improvement

Pentaho needs user interface, reporting, and visualization enhancements, along with better debugging, metadata management, and cost-effective licensing.
Pentaho Business Analytics is hard to learn and not suited for initial users as it requires knowledge of operating systems, Java, and other technical skills.
 

Setup Cost

Pentaho's Community Edition is cost-free, while the Enterprise Edition, offering enhanced features, can cost up to $250,000 annually.
Pentaho Business Analytics is priced similarly to other competitors such as QlikView and Tableau.
 

Valuable Features

Pentaho Business Analytics offers powerful ETL, user-friendly interface, customizable dashboards, and cost-saving, open-source analytics and reporting features.
It is a stable product, and it can handle large datasets.
 

Categories and Ranking

Apache Spark
Average Rating
8.4
Reviews Sentiment
7.4
Number of Reviews
66
Ranking in other categories
Hadoop (1st), Compute Service (4th), Java Frameworks (2nd)
Pentaho Business Analytics
Average Rating
8.0
Reviews Sentiment
6.8
Number of Reviews
44
Ranking in other categories
BI (Business Intelligence) Tools (20th), Cloud Operations Analytics (5th), Reporting (13th)
 

Mindshare comparison

Apache Spark and Pentaho Business Analytics aren’t in the same category and serve different purposes. Apache Spark is designed for Hadoop and holds a mindshare of 17.7%, down 21.1% compared to last year.
Pentaho Business Analytics, on the other hand, focuses on BI (Business Intelligence) Tools, holds 0.5% mindshare, down 0.6% since last year.
Hadoop
BI (Business Intelligence) Tools
 

Featured Reviews

Dunstan Matekenya - PeerSpot reviewer
Open-source solution for data processing with portability
Apache Spark is known for its ease of use. Compared to other available data processing frameworks, it is user-friendly. While many choices now exist, Spark remains easy to use, particularly with Python. You can utilize familiar programming styles similar to Pandas in Python, including object-oriented programming. Another advantage is its portability. I can prototype and perform some initial tasks on my laptop using Spark without needing to be on Databricks or any cloud platform. I can transfer it to Databricks or other platforms, such as AWS. This flexibility allows me to improve processing even on my laptop. For instance, if I'm processing large amounts of data and find my laptop becoming slow, I can quickly switch to Spark. It handles small and large datasets efficiently, making it a versatile tool for various data processing needs.
Mir Gulzar Ahmed - PeerSpot reviewer
Excels in handling unstructured data, helping organizations navigate through different storage systems
Pentaho can help organizations by providing them an insight of their unstructured data using one platform(Pentaho Business Analytics). The features are almost identical to other BIS platforms but to me, customers can benefit as it has a community version with most of its Enterprise features. It also has a free limited-period trial version. The other feature that I would like to share here is, that users have access to a complete spectrum of data from different sources with the system’s adaptive big data layer, which takes the source of the data into account. The software is built on an open architecture and can be integrated with multiple systems. However, Pentaho Data Integration and Analytics has been acquired by HDS which offers an Enterprise edition for organizations that also need to meet product compliance.
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
856,873 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
13%
Manufacturing Company
7%
Comms Service Provider
6%
Financial Services Firm
20%
Computer Software Company
14%
Educational Organization
8%
Real Estate/Law Firm
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Apache Spark is open-source, so it doesn't incur any charges.
What needs improvement with Apache Spark?
There is complexity when it comes to understanding the whole ecosystem, especially for beginners. I find it quite complex to understand how a Spark job is initiated, the roles of driver nodes, work...
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows: 1) If you have legacies of SAP, Oracle - look for SAP...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
Pentaho Business Analytics is priced similarly to other competitors such as QlikView ( /products/qlikview-reviews ) and Tableau ( /products/tableau-reviews ). I usually use the community edition.
What needs improvement with Pentaho Business Analytics?
Pentaho Business Analytics ( /categories/bi-business-intelligence-tools ) is hard to learn and not suited for initial users as it requires knowledge of operating systems, Java, and other technical ...
 

Also Known As

No data available
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: June 2025.
856,873 professionals have used our research since 2012.