Try our new research platform with insights from 80,000+ expert users

Cassandra vs Cloudera Distribution for Hadoop vs Couchbase Enterprise comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of August 2025, in the NoSQL Databases category, the mindshare of Cassandra is 9.7%, down from 13.4% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 2.4%, down from 2.5% compared to the previous year. The mindshare of Couchbase Enterprise is 8.7%, down from 12.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Himanshu Amodwala - PeerSpot reviewer
Well-equipped to handle a massive influx of data and billions of requests
The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount. For instance, when a customer leaves comments or feedback on an image, they anticipate an immediate reflection of these changes on the portal. Similarly, sellers altering product attributes or updating images expect instant visibility of these modifications. Handling large data volumes with Cassandra has been an excellent experience. Despite challenges related to the influx, these were not attributed to Cassandra itself but rather to middle-layer issues. Generally, it demonstrated scalability with workloads, thanks to its horizontal scaling capabilities. We could easily add new nodes to the system as needed, ensuring the platform coped well with increasing loads. The tool's most beneficial feature for scalability is its entire architecture. The absence of a single point of failure or a leader within the ecosystem contributes to its robust scalability. This key aspect influenced our decision to opt for the Cassandra ecosystem. In terms of performance, it demonstrated the ability to handle approximately 1.6 billion requests per day. This was achieved on AWS using EC2 instances, and it was during a period about four to five years ago.
Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Ravi_Singh  - PeerSpot reviewer
Supports multiple data models and offers AI capabilities
With some of the operations, we used to face some challenges with scalability. Although it worked pretty well, in some scenarios, we noticed issues where the replications and the sharding were not happening correctly. In recent versions, we also faced some issues in terms of enabling advanced operations like FTS and vectors. Although it works pretty well, in some places, we do face challenges, especially on a heavy scale. I think all issues are being addressed in the latest version of Couchbase. The resources are not that good for Couchbase. The tool's documentation is pretty extensive, but if you go for any kind of courses or tutorials, there are very limited resources available. It also becomes a little bit challenging for new people to get onboard into it. MongoDB and other such open-source database tools perform really well as they're really widely adopted, and they have resources available to get you onboarded pretty quickly. I think that we do face some challenges with Couchbase, but luckily, we have the tool's enterprise version solution, so we get all the support from the product team.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"A consistent solution."
"The solution's database capabilities are very good."
"I'd rate the solution ten out of ten."
"Since I haven't had years of experience with it, it's still new to me. One valuable feature is its distribution, so I can run it partly in the cloud and part on-prem. That's a feature I'd like to use but haven't yet because we're trying to move to Azure. I don't know if or when that will happen. Ideally, we'd have it distributed over the cloud and on-prem simultaneously, so if something happens to our on-prem, we can keep going in the cloud, like a pay-as-you-go model with Azure."
"The most valuable features of Cassandra are its scaling capabilities and its non-SQL nature capabilities."
"I am satisfied with the performance."
"Some of the valued features of this solution are it has good performance and failover."
"The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount."
"Cloudera, as a whole, is designed to provide organizations with solutions for big data."
"This is the only solution that is possible to install on-premise."
"Customer service and support were able to fix whatever the issue was."
"The product as a whole is good."
"It has the best proxy, security, and support features compared to open-source products."
"The search function is the most valuable aspect of the solution."
"The product is completely secure."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"Couchbase was a stable solution for us."
"The valuable features of Couchbase are the many documents and index types, and they made a lot of features available enabling us to use it as a complete solution for our needs."
"Couchbase has not given any performance problems as of now."
"The main advantages were associated with it being a no SQL database. It helped us send out metrics or rewards to multiple players in our game at a very low latency."
"The best thing about Couchbase is its versatility in handling data."
"The most valuable features are the ease of application and the merging of data."
"I rate Couchbase a nine out of ten."
"The most valuable feature of Couchbase is document indexing. It is better than MongoDB. Additionally, the solution is easy to use."
 

Cons

"Interface is not user friendly."
"While Cassandra can handle NoSQL, I think there should be more flexibility for whole schema design when data is stored in wide columns. Additionally, I believe that eventual consistency should be enhanced."
"The solution is limited to a linear performance."
"Cassandra is very complex to manage. Sometimes, I need to involve a senior DevOps engineer if we encounter a problem."
"We experience configuration issues when accommodating the volumes we require, which often necessitates consultation with the Cassandra development team."
"There were challenges with the query language and the development interface. The query language, in particular, could be improved for better optimization. These challenges were encountered while using the Java SDK."
"The secondary index in Cassandra was a bit problematic and could be improved."
"Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product."
"If they could support modifying the data more easily than the current implementation, it would be beneficial."
"There are better solutions out there that have more features than this one."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"They should focus on upgrading their technical capabilities in the market."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"The main problem has been with integration with the services."
"Couchbase needs to improve the consistent reliability of the replication feature. Sometimes, the replications would be delayed."
"I would like Couchbase to provide more functionality via the UI, as some operations, such as time-based scaling, currently require using the API."
"It's easy to deploy. Where the challenge comes in is when you start putting data in, doing the indexes, and doing the integration with systems. Integration is one of their weakest points. Natively, there should be a wide range of integration options to be able to get data in."
"I would like Couchbase to provide more functionality via the UI, as some operations, such as time-based scaling, currently require using the API."
"It is very difficult to load the backup of the older version to the newer version."
"There are some limitations to the database. The SQL database cannot handle real-time processing for critical IoT scenarios. What we have to do is store our data into the database then code it out, this wastes a lot of time."
"We would like to have a better management of Kubernetes with the free, open source version of Couchbase. We don't have any major complaints other than that."
 

Pricing and Cost Advice

"I use the tool's open-source version."
"Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
"We pay for a license."
"I don't have the specific numbers on pricing, but it was fairly priced."
"There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
"We are using the open-source version of Cassandra, the solution is free."
"The price is very high. The solution is expensive."
"I believe we pay for a three-year license."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The solution is expensive."
"It is an expensive product."
"The product’s price depends from project to project."
"I wouldn't recommend CDH to others because of its high cost."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"I would rate this solution a nine out of ten for pricing as it is affordable."
"It seems very reasonable. It's a lot cheaper than Redis, but we've got an enterprise license. So, it's about normal. It's not outrageous in price as far as we've seen. From Couchbase, there's no additional fee as far as I'm aware, but when you're integrating, there's an additional fee because a lot of times, they don't have an integration stack."
"I wouldn't say Couchbase offers good value for money."
"It can range between 25,000 to 40,000 Euros per year depending on company requirements."
"The licensing cost of Couchbase is quite expensive compared to other databases."
"The price of this solution is better than some of the other competitors."
"We estimate that it's not very expensive, however, the pricing that you can get from the account managers, e.g. the public pricing, could be a bit expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
865,164 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
12%
Comms Service Provider
7%
Retailer
6%
Financial Services Firm
19%
Educational Organization
17%
Computer Software Company
12%
Energy/Utilities Company
6%
Financial Services Firm
15%
Computer Software Company
13%
Retailer
8%
Manufacturing Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cassandra?
The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operat...
What needs improvement with Cassandra?
While Cassandra can handle NoSQL, I think there should be more flexibility for whole schema design when data is store...
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, u...
What needs improvement with Cloudera Distribution for Hadoop?
It is quite complicated to configure and install. Integrating the platform into an information system is always a cha...
What needs improvement with Couchbase?
The main issue we keep facing from the past couple of years, observing other teams using Couchbase, is that whenever ...
What is your primary use case for Couchbase?
Basically we have clusters, Couchbase clusters, databases, and that is how we use Couchbase with XDCR. All the cluste...
What advice do you have for others considering Couchbase?
It is a good solution, but as every product needs improvement, this also needs some enhancement. It is a good product...
 

Overview

 

Sample Customers

1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Amadeus, Cisco, Comcast, LinkedIn, GE
Find out what your peers are saying about MongoDB, ScyllaDB, Microsoft and others in NoSQL Databases. Updated: August 2025.
865,164 professionals have used our research since 2012.