No more typing reviews! Try our Samantha, our new voice AI agent.

Cassandra vs Cloudera Distribution for Hadoop vs InfluxDB comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of April 2026, in the NoSQL Databases category, the mindshare of Cassandra is 7.8%, down from 11.5% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 3.6%, up from 1.9% compared to the previous year. The mindshare of InfluxDB is 5.3%, down from 10.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Mindshare Distribution
ProductMindshare (%)
InfluxDB5.3%
Cassandra7.8%
Cloudera Distribution for Hadoop3.6%
Other83.3%
NoSQL Databases
 

Featured Reviews

Monirul Islam Khan - PeerSpot reviewer
Head, Data Integration & Management at a non-profit with 10,001+ employees
Has maintained secure document storage and efficient data distribution with peer-to-peer architecture
The functions or features in Cassandra that I have found most valuable are that it is a distributed system similar to Mongo. It's good enough for comparison with another SQL database, so it's smooth and organized for distributed database system. The peer-to-peer architecture in Cassandra is helpful for network decentralization, and I have already introduced that feature. Cassandra features in peer-to-peer as well as another monitoring, so basically, it's good enough for our service. The tunable consistency level in Cassandra is good, and we are using that feature already. In terms of built-in caching and lightweight transactions in Cassandra, the transaction level is good, and it's optimized, so there are no more issues in that database. Based on my experience, Cassandra is good for document management system, as well as distributed database system, and the automatic recovery process is there. Additionally, the database monitoring system or auditing system is well-comparable with other database systems, so we are actually happy to be using this Cassandra database.
SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
Mugeesh Husain - PeerSpot reviewer
Team Lead, Software at Energybox
Time series data has been managed efficiently for IoT sensors but reporting still needs improvement
How InfluxDB can be improved is relevant since for Energy Box, we face certain issues. We have customers worldwide, including the United States, United Kingdom, and Europe, but when we expanded to China two years ago, they indicated that they do not support the cloud version there. Our application is built on the cloud, which required us to create a separate application for Azure China, which was painful for us. The second issue involves frequent version changes. For example, we started with version one, transitioned to version two, and I heard they are considering InfluxDB version three, reverting to earlier practices. InfluxDB should improve without completely changing its approach. Now we have to redo our work for InfluxDB version three. Regarding needed improvements, the documentation is sufficient, but pricing presents a challenge. InfluxDB has standard pricing, which is acceptable for large companies. However, for startups in our position, they should provide special discounts so everyone can utilize it. The pricing should adapt as companies grow, which is a reasonable expectation.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Its retrieval is similar to an RDBMS, so our team finds it easy to adapt."
"A consistent solution."
"The most valuable features are the counter features and the NoSQL schema. It also has good scalability. You can scale Cassandra to any finite level."
"The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount."
"We can add almost one million columns to the solution."
"The most valuable features of this solution are its speed and distributed nature."
"Since I haven't had years of experience with it, it's still new to me. One valuable feature is its distribution, so I can run it partly in the cloud and part on-prem. That's a feature I'd like to use but haven't yet because we're trying to move to Azure. I don't know if or when that will happen. Ideally, we'd have it distributed over the cloud and on-prem simultaneously, so if something happens to our on-prem, we can keep going in the cloud, like a pay-as-you-go model with Azure."
"Some of the valued features of this solution are it has good performance and failover."
"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"Professional support enabled us to provide great customer service and our clients are able to perform proactive maintenance in an efficient manner."
"Cloudera is a very manageable solution with good support."
"Cloudera is a great product and, overall, there are many features."
"Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform, offers power processing, supports different file systems and query engines, and provides parallel processing for handling many requests."
"I am very comfortable with this product."
"I like the combination of all the tools that allow me to provide solutions and enable me to solve the use cases I'm working on."
"The solution is stable."
"My advice for others looking into using InfluxDB is to use it the same way I did, because it is really stable, easy and friendly to use, and it is a great product overall."
"The most valuable features of InfluxDB are the documentation and performance, and the good plugins metrics in the ecosystem."
"The most valuable feature of the solution is we can use InfluxDB to integrate with and plug into any other tools."
"The most valuable features of InfluxDB are the documentation and performance, and the good plugins metrics in the ecosystem."
"The user interface is well-designed and easy to use. It provides a clear overview of the data, making it simple to understand the information at hand."
"Overall, InfluxDB delivered excellent performance, stability, and simplicity for telemetry-driven use cases."
"As a time series database, it is very powerful and lightweight, and it can deal with heavy workloads very easily."
"InfluxDB's best feature is that it's a cloud offering. Other good features include its time-series DB, fast time-bulk queries, and window operations."
 

Cons

"One of the issues with the solution is that you cannot drop write like you're able to in MongoDB and MySQL, where you can join tables."
"Some issues arise from our vendors like Apache slowness and distribution or load balancing from HAProxy, which should better handle consumption for high-level concurrency."
"Fine-tuning was a bit of a challenge."
"Cassandra could be more user-friendly like MongoDB."
"Row-level locking is not available; might be very helpful in update use cases."
"The solution is not easy to use because it is a big database and you have to learn the interface."
"The initial setup of Cassandra can be difficult in the configuration. There might be a need to have assistance. The implementation process can six months for connecting to certain databases."
"Cassandra can improve by adding more built-in tools. For example, if you want to do some maintenance activities in the cluster, we have to depend on third-party tools. Having these tools build-in would be e benefit."
"Cloudera CDH5.5.x does not support SparkR."
"The procedure for operations could be simplified."
"It could be faster and more user-friendly."
"The only thing that needs improvement is the cost, it's a very expensive solution and one of the main reasons companies are not attracted to the product."
"They should focus on upgrading their technical capabilities in the market."
"We're currently trying to perform a failed installation and it's little bit difficult. It should restart the installation where it left off."
"The stability is problematic. We did encounter quite a lot of issues with the cluster going down quite frequently."
"The licensing was by node."
"If it gets a little bit more into the metric side, then it would really be great, similar to Prometheus."
"I've tried both on-premises and cloud-based deployments, and each has its limitations."
"The error logging capability can be improved because the logs are not very informative."
"In terms of features that I would like to see or have, in the community version, some features are not available. I would like to have clustering and authentication in the community version."
"It is challenging to get long-running backups while running InfluxDB in a Microsoft Azure Kubernetes cluster."
"In terms of features that I would like to see or have, in the community version, some features are not available. I would like to have clustering and authentication in the community version."
"InfluxDB cannot be used for high-cardinality data. It's also difficult and time-consuming to write queries, and there are some issues with bulk API."
"It is challenging to get long-running backups while running InfluxDB in a Microsoft Azure Kubernetes cluster."
 

Pricing and Cost Advice

"I use the tool's open-source version."
"We are using the open-source version of Cassandra, the solution is free."
"Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
"There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
"We pay for a license."
"I don't have the specific numbers on pricing, but it was fairly priced."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The solution is expensive."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"The product’s price depends from project to project."
"The price could be better for the product."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The tool is not expensive."
"The pricing must be improved."
"We are using the open-source version of InfluxDB."
"The tool is an open-source product."
"InfluxDB is open-source, but there are additional costs for scaling."
"InfluxDB recently increased its price. It is very expensive now."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
885,837 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Computer Software Company
7%
Retailer
6%
Comms Service Provider
6%
Financial Services Firm
23%
Marketing Services Firm
9%
Comms Service Provider
6%
Construction Company
6%
Financial Services Firm
10%
Manufacturing Company
10%
Comms Service Provider
9%
University
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise2
Large Enterprise14
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise4
Large Enterprise8
 

Questions from the Community

What is your experience regarding pricing and costs for Cassandra?
The pricing for Cassandra is a little bit high, so it would be better for our community services if they consider com...
What needs improvement with Cassandra?
Regarding areas of improvement for Cassandra, currently, we are not facing significant issues. Some issues arise from...
What is your primary use case for Cassandra?
My use case for Cassandra is for a document and other unstructured data management system as well as structured data ...
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, u...
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What needs improvement with InfluxDB?
One thing I appreciate about InfluxDB is its balance between performance and ease of use, especially with Flux making...
What is your primary use case for InfluxDB?
My main use case for InfluxDB has been mostly for monitoring and analyzing the time-series data related to system met...
What advice do you have for others considering InfluxDB?
My advice for others looking into using InfluxDB would be to clearly define their time-series data use cases upfront ...
 

Overview

 

Sample Customers

1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
ebay, AXA, Mozilla, DiDi, LeTV, Siminars, Cognito, ProcessOut, Recommend, CATS, Smarsh, Row 44, Clustree, Bleemeo
Find out what your peers are saying about MongoDB, Microsoft, Couchbase and others in NoSQL Databases. Updated: March 2026.
885,837 professionals have used our research since 2012.