Try our new research platform with insights from 80,000+ expert users

Apache HBase vs Cloudera Distribution for Hadoop comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache HBase
Ranking in NoSQL Databases
10th
Average Rating
6.0
Reviews Sentiment
6.3
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Cloudera Distribution for H...
Ranking in NoSQL Databases
8th
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
50
Ranking in other categories
Hadoop (2nd)
 

Mindshare comparison

As of August 2025, in the NoSQL Databases category, the mindshare of Apache HBase is 5.6%, up from 4.9% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 2.4%, down from 2.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Sekhar Reddy B - PeerSpot reviewer
Offers real-time aggregations and easy for a beginner to learn to use this
We use it for real-time data grouping The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database. We've seen performance issues when we have more regions. The product needs improvement in that area. So we…
Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable part is the column family structure."
"Apache HBase is a database used for data storage."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"The product provides better data processing features than other tools."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The data science aspect of the solution is valuable."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
 

Cons

"We've seen performance issues."
"I don't like using Apache HBase to store huge amounts of data because of many performance issues."
"The procedure for operations could be simplified."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"It is quite complicated to configure and install."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"Cloudera's support is extremely bad and cannot be relied on."
"They should focus on upgrading their technical capabilities in the market."
 

Pricing and Cost Advice

Information not available
"I wouldn't recommend CDH to others because of its high cost."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"The price is very high. The solution is expensive."
"The price could be better for the product."
"I believe we pay for a three-year license."
"Cloudera requires a license to use."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
864,155 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
22%
Computer Software Company
10%
Educational Organization
8%
Manufacturing Company
7%
Financial Services Firm
20%
Educational Organization
16%
Computer Software Company
12%
Energy/Utilities Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Apache HBase?
Apache HBase is a database used for data storage.
What needs improvement with Apache HBase?
We've seen performance issues when we have more regions. The product needs improvement in that area. So we experience performance issues sometimes when the load increases.
What advice do you have for others considering Apache HBase?
It's better to use AWS DynamoDB or Cassandra. I would rate it an eight out of ten. It is easy for a beginner to learn.
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation. Integrating wit...
 

Also Known As

HBase
No data available
 

Overview

 

Sample Customers

Bloomberg, Wells Fargo, Apple, Capital One, NVIDIA
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Find out what your peers are saying about Apache HBase vs. Cloudera Distribution for Hadoop and other solutions. Updated: July 2025.
864,155 professionals have used our research since 2012.