No more typing reviews! Try our Samantha, our new voice AI agent.

Cassandra vs Cloudera Distribution for Hadoop comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cassandra
Ranking in NoSQL Databases
6th
Average Rating
8.0
Reviews Sentiment
6.0
Number of Reviews
25
Ranking in other categories
Vector Databases (12th)
Cloudera Distribution for H...
Ranking in NoSQL Databases
10th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd)
 

Mindshare comparison

As of June 2026, in the NoSQL Databases category, the mindshare of Cassandra is 8.3%, down from 10.6% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 5.5%, up from 2.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Mindshare Distribution
ProductMindshare (%)
Cassandra8.3%
Cloudera Distribution for Hadoop5.5%
Other86.2%
NoSQL Databases
 

Featured Reviews

Monirul Islam Khan - PeerSpot reviewer
Head, Data Integration & Management at a non-profit with 10,001+ employees
Has maintained secure document storage and efficient data distribution with peer-to-peer architecture
The functions or features in Cassandra that I have found most valuable are that it is a distributed system similar to Mongo. It's good enough for comparison with another SQL database, so it's smooth and organized for distributed database system. The peer-to-peer architecture in Cassandra is helpful for network decentralization, and I have already introduced that feature. Cassandra features in peer-to-peer as well as another monitoring, so basically, it's good enough for our service. The tunable consistency level in Cassandra is good, and we are using that feature already. In terms of built-in caching and lightweight transactions in Cassandra, the transaction level is good, and it's optimized, so there are no more issues in that database. Based on my experience, Cassandra is good for document management system, as well as distributed database system, and the automatic recovery process is there. Additionally, the database monitoring system or auditing system is well-comparable with other database systems, so we are actually happy to be using this Cassandra database.
SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable features of Cassandra are the NoSQL database, high performance, and zero-copy streaming."
"I'd rate the solution ten out of ten."
"Its retrieval is similar to an RDBMS, so our team finds it easy to adapt."
"Ability to achieve write speeds 10k tps: Compared to existing, it is 300% percent higher."
"The most valuable features of this solution are its speed and distributed nature."
"A consistent solution."
"If you need availability and consistency, you can go with Cassandra."
"Based on my experience, Cassandra is good for document management system, as well as distributed database system, and the automatic recovery process is there."
"For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters."
"Cloudera is one of the best solutions for on-prem."
"The file system is a valuable feature."
"It has been helpful in allowing data storage in one centralized location with data lakes and all of the surrounding applications."
"We were able to utilize data which was untapped previously."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"Very good end-to-end security features."
"CDH has a wide variety of proprietary tools that we use, like Impala, and from that perspective, it's quite useful as opposed to something open-source, as we get a lot of value from Cloudera's proprietary tools."
 

Cons

"The secondary index in Cassandra was a bit problematic and could be improved."
"The solution doesn't have joins between tables so you need other tools for that."
"The initial setup of Cassandra can be difficult in the configuration. There might be a need to have assistance. The implementation process can six months for connecting to certain databases."
"If you have a requirement of aggregation and joints, Cassandra doesn't support a solution that can give the aggregation."
"Cassandra could be more user-friendly like MongoDB."
"The solution is limited to a linear performance."
"The solution is not easy to use because it is a big database and you have to learn the interface. This is the case though in most of these solutions."
"We found some issues with the batch inserts when the data volume is large."
"On same ground I didn't see much training materials from Cloudera."
"There are better solutions out there that have more features than this one."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"The performance can be improved. We have experienced some performance issues."
"The security of this solution could be improved."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
 

Pricing and Cost Advice

"Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
"I don't have the specific numbers on pricing, but it was fairly priced."
"We pay for a license."
"I use the tool's open-source version."
"We are using the open-source version of Cassandra, the solution is free."
"There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
"The price is very high. The solution is expensive."
"Cloudera requires a license to use."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"It is an expensive product."
"I wouldn't recommend CDH to others because of its high cost."
"I believe we pay for a three-year license."
"The tool is not expensive."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
899,052 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Comms Service Provider
8%
Computer Software Company
6%
Construction Company
6%
Financial Services Firm
22%
Construction Company
10%
Marketing Services Firm
8%
Manufacturing Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise14
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
 

Questions from the Community

What is your experience regarding pricing and costs for Cassandra?
The pricing for Cassandra is a little bit high, so it would be better for our community services if they consider community pricing for any non-profit organization like an NGO or other things. It w...
What needs improvement with Cassandra?
Regarding areas of improvement for Cassandra, currently, we are not facing significant issues. Some issues arise from our vendors like Apache slowness and distribution or load balancing from HAProx...
What is your primary use case for Cassandra?
My use case for Cassandra is for a document and other unstructured data management system as well as structured data for ultra-poor member community edition, community members' PII information, so ...
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
 

Overview

 

Sample Customers

1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Find out what your peers are saying about Cassandra vs. Cloudera Distribution for Hadoop and other solutions. Updated: April 2026.
899,052 professionals have used our research since 2012.