Cloudera Distribution for Hadoop vs Pentaho Business Analytics comparison

Cancel
You must select at least 2 products to compare!
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Cloudera Distribution for Hadoop and Pentaho Business Analytics based on real PeerSpot user reviews.

Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop.
To learn more, read our detailed Hadoop Report (Updated: April 2024).
767,995 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"I don't see any performance issues.""We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.""The scalability of Cloudera Distribution for Hadoop is excellent.""In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues.""The solution's most valuable feature is the enterprise data platform.""The tool can be deployed using different container technologies, which makes it very scalable.""Cloudera is a very manageable solution with good support.""It is helpful to gather and process data."

More Cloudera Distribution for Hadoop Pros →

"We were able to install it without any assistance from tech support.""Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud.""The initial setup is pretty straightforward.""I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced.""Easy to use components to create the job.""The most valuable feature of Pentaho is the Tableau report.""Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."

More Pentaho Business Analytics Pros →

Cons
"It could be faster and more user-friendly.""The pricing needs to improve.""They should focus on upgrading their technical capabilities in the market.""The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS.""The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better.""Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.""The price of this solution could be lowered.""The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions."

More Cloudera Distribution for Hadoop Cons →

"Another concern is that Pentaho is not customizable or interactive.""Pentaho, at the general level, should greatly improve the easy construction of its dashboards and easy integration of information from different sources without technical user intervention.""Logging capability is needed.""Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process.""The repository should be improved.""Version control would be a good addition.""We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost.""Pentaho Business Analytics' user interface is outdated."

More Pentaho Business Analytics Cons →

Pricing and Cost Advice
  • "When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
  • "The price could be better for the product."
  • "I haven't bought a license for this solution. I'm only using the Apache license version."
  • "Cloudera requires a license to use."
  • "Cloudera Distribution for Hadoop is expensive, with support costs involved."
  • "I wouldn't recommend CDH to others because of its high cost."
  • "The price is very high. The solution is expensive."
  • "The solution is expensive."
  • More Cloudera Distribution for Hadoop Pricing and Cost Advice →

  • "Free and commercial versions are available."
  • "Pentaho is expensive ."
  • "We were lucky enough to find a Pentaho OEM partner who offered a data warehouse model and the ETL software for about 60K SGD per year."
  • More Pentaho Business Analytics Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
    767,995 professionals have used our research since 2012.
    Comparison Review
    Anonymous User
    Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same companies are caught up in an imbrogilo as to which tool to use, what are the technology/and end business user wise differences/ do i actually need to purchase commercial edition, is there any work around etc. Differences :- In the below mentioned points, I have tried to cover functionality wise the differences a. Reports :- Jaspersoft is known for its picture pixel perfect reporting. Jasper uses ireport for designing the reports. Hence, for having reports, Jaspersoft is the most ideal candidate. Pentaho uses Pentaho Report Designer. b. Dashboards :- Pentaho provides much more capabililties, interactivity in terms of dashboards. Dashboards designed in Pentaho are far more superior in functionality, aesthetically as compared to Jaspersoft. Pentaho CE uses CDE/CDF, Pentaho EE uses PDD . Dashboard functionality is present only in the Enterprise edition of Jaspersoft. c. Pentaho is having an intermediate layer known as Xactions & hence providing much more flexibility in terms of plugin designing, integration with applications, having out of box experience etc. Xactions supports scripting and scheduling of scripts execution. Jaspersoft dosent provide that much of flexibility in terms of… Read more →
    Questions from the Community
    Top Answer:The tool can be deployed using different container technologies, which makes it very scalable.
    Top Answer:The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its… more »
    Top Answer:The tool's ability to be deployed on a cloud model is an area of concern where improvements are required. The tool works very well when deployed on an on-premises model. The deployment on a cloud… more »
    Top Answer:There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows:  1) If you have legacies of SAP, Oracle  - look for SAP… more »
    Top Answer:The organization has both options based on their needs and budget constraints. The Enterprise Edition is expensive with references to an added number of features.
    Top Answer:The product to me is not as user-friendly as other players in the market. It also still needs improvement in the reporting module. You will need to search for deployment examples or need to have a… more »
    Ranking
    2nd
    out of 22 in Hadoop
    Views
    2,959
    Comparisons
    2,278
    Reviews
    14
    Average Words per Review
    409
    Rating
    8.1
    Views
    1,175
    Comparisons
    854
    Reviews
    4
    Average Words per Review
    526
    Rating
    7.0
    Comparisons
    Also Known As
    Pentaho, Kettle, Hitachi Pentaho Business Analytics
    Learn More
    Overview
    Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.

    Pentaho is an open source business intelligence company that provides a wide range of tools to help their customers better manage their businesses. These tools include data integration software, mining tools, dashboard applications, online analytical processing options, and more.

    Pentaho has two product categories: There is the standard enterprise version. This is the product that comes directly from Pentaho itself with all of the benefits, features, and programs that come along with a paid application such us analysis services, dashboard design, and interactive reporting.

    The alternative is an open source version, which the public is permitted to add to and tweak the product. This solution has its advantages, aside from the fact that it is free, in that there are many more people working on the project to improve its quality and breadth of functionality.

    Sample Customers
    37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
    Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
    Top Industries
    REVIEWERS
    Financial Services Firm25%
    Computer Software Company21%
    Insurance Company14%
    Comms Service Provider11%
    VISITORS READING REVIEWS
    Financial Services Firm22%
    Computer Software Company16%
    Educational Organization8%
    Manufacturing Company8%
    REVIEWERS
    Computer Software Company19%
    University13%
    Financial Services Firm13%
    Educational Organization6%
    VISITORS READING REVIEWS
    Financial Services Firm21%
    Government12%
    Computer Software Company11%
    Educational Organization9%
    Company Size
    REVIEWERS
    Small Business28%
    Midsize Enterprise17%
    Large Enterprise55%
    VISITORS READING REVIEWS
    Small Business17%
    Midsize Enterprise9%
    Large Enterprise74%
    REVIEWERS
    Small Business50%
    Midsize Enterprise17%
    Large Enterprise33%
    VISITORS READING REVIEWS
    Small Business27%
    Midsize Enterprise13%
    Large Enterprise60%
    Buyer's Guide
    Hadoop
    April 2024
    Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: April 2024.
    767,995 professionals have used our research since 2012.

    Cloudera Distribution for Hadoop is ranked 2nd in Hadoop with 47 reviews while Pentaho Business Analytics is ranked 21st in BI (Business Intelligence) Tools with 42 reviews. Cloudera Distribution for Hadoop is rated 8.0, while Pentaho Business Analytics is rated 8.0. The top reviewer of Cloudera Distribution for Hadoop writes "Good end-to-end security features and we like that it's cloud independent". On the other hand, the top reviewer of Pentaho Business Analytics writes "Flexible, easy to understand, and simple to set up". Cloudera Distribution for Hadoop is most compared with Amazon EMR, HPE Ezmeral Data Fabric, Apache Spark and MongoDB, whereas Pentaho Business Analytics is most compared with Microsoft Power BI, Databricks, Microsoft SQL Server Reporting Services, SAP Crystal Reports and TIBCO Jaspersoft.

    We monitor all Hadoop reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.