Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs InfluxDB vs Neo4j Graph Database comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of February 2026, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 3.3%, up from 1.9% compared to the previous year. The mindshare of InfluxDB is 5.6%, down from 11.3% compared to the previous year. The mindshare of Neo4j Graph Database is 6.1%, up from 4.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Market Share Distribution
ProductMarket Share (%)
InfluxDB5.6%
Neo4j Graph Database6.1%
Cloudera Distribution for Hadoop3.3%
Other85.0%
NoSQL Databases
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Manager, Bussines Development & Co Owner at Troia d.o.o.
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Mugeesh Husain - PeerSpot reviewer
Team Lead, Software at Energybox
Time series data has been managed efficiently for IoT sensors but reporting still needs improvement
How InfluxDB can be improved is relevant since for Energy Box, we face certain issues. We have customers worldwide, including the United States, United Kingdom, and Europe, but when we expanded to China two years ago, they indicated that they do not support the cloud version there. Our application is built on the cloud, which required us to create a separate application for Azure China, which was painful for us. The second issue involves frequent version changes. For example, we started with version one, transitioned to version two, and I heard they are considering InfluxDB version three, reverting to earlier practices. InfluxDB should improve without completely changing its approach. Now we have to redo our work for InfluxDB version three. Regarding needed improvements, the documentation is sufficient, but pricing presents a challenge. InfluxDB has standard pricing, which is acceptable for large companies. However, for startups in our position, they should provide special discounts so everyone can utilize it. The pricing should adapt as companies grow, which is a reasonable expectation.
RT
VP odfTechnology at Enterpi Software Solutions Private Limited
Delivers superior search and data aggregation capabilities
Neo4j helps with advanced search needs, providing good search results and aggregates compared to MongoDB. Aggregating with MongoDB can be difficult; however, with Neo4j, it's easier. Aggregating data, backing up, and creating new clusters are user-friendly from the back end. In DevOps web deployment, we noticed no database issues. We created Docker instances and set them up efficiently, managing databases up to 50 gigabytes.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cloudera is a very manageable solution with good support."
"The solution is stable."
"The most valuable feature is Kubernetes."
"It has the best proxy, security, and support features compared to open-source products."
"I don't see any performance issues."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"Cloudera provides a hybrid solution that combines compute on cloud or on-premises."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"InfluxDB works as expected with excellent scalability and stability, which is critical for our application."
"The solution is very powerful."
"While I would rate InfluxDB a ten on a scale of one to ten, users should be thoughtful about matching the engine to their specific needs."
"In our case, it started with a necessity to fill the gap that we had in monitoring. We had very reactive monitoring without trend analysis and without some advanced features. We were able to implement them by using a time series database. We are able to have all the data from applications, logs, and systems, and we can use a simple query language to correlate all the data and make things happen, especially with monitoring. We could more proactively monitor our systems and our players' trends."
"Based on InfluxDB, we have great analytics produced by our SRE team, and with that, we have an alerting and monitoring system in place."
"Overall, InfluxDB delivered excellent performance, stability, and simplicity for telemetry-driven use cases."
"The most valuable features of InfluxDB are the documentation and performance, and the good plugins metrics in the ecosystem."
"InfluxDB is a database where you can insert data. However, it would be best if you had different components for alerting, data sending, and visualization. You need to install tools to collect data from servers. It must be installed on Windows or Linux servers. During installation, ensure that the configuration file is correct to prevent issues. Once data is collected, it can be sent to InfluxDB. For visualization, you can use open-source tools like Grafana."
"Enables people to understand what the business problem is and how the technology helps."
"For now, the tool doesn't break down or stop, so it is quite stable."
"It is good for search-based tasks, providing solid search results and aggregate results."
"The solution's best feature is how it differs from traditional SQL databases. It's hard to map people and find those near me in SQL, which requires long, complex queries. Neo4j Graph Database makes this easier with simpler queries. It also supports more data types, like JSON, which SQL doesn't."
"Creates the ability to visualize outputs."
"As a graph database, I am surprised at their performance and response time."
 

Cons

"It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation."
"This is a very expensive solution."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"Cloudera's support is extremely bad and cannot be relied on."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"The competitors provide better functionalities."
"InfluxDB can improve by including new metrics on other technologies. They had some changes recently to pool data from endpoints but the functionality is not good enough in the industry."
"One area for improvement is the querying language. InfluxDB deprecated FluxQL, which was intuitive since developers are already familiar with standard querying."
"I've tried both on-premises and cloud-based deployments, and each has its limitations."
"In terms of features that I would like to see or have, in the community version, some features are not available. I would like to have clustering and authentication in the community version."
"However, I cannot ignore the challenges I faced while configuring the database with my message brokers, whether Rabbit or Kafka, because the documentation is not properly provided."
"If it gets a little bit more into the metric side, then it would really be great, similar to Prometheus."
"Sometimes, when we write too much data within a minute, the data count becomes excessive, reaching perhaps 100,000 or 500,000 data points, and InfluxDB gives a timeout exception, which we must handle in our application."
"It is challenging to get long-running backups while running InfluxDB in a Microsoft Azure Kubernetes cluster."
"For me, when the tool was deployed on an on-premises model, it was a little bit difficult the first time."
"There are concerns about performance and whether the tool can necessarily scale to provide the solution."
"The only problem is that the community is quite small."
"So far, we have not had any issues and are happy with the product in general."
"The tool could improve by having more resources, especially for Golang, which we use. It lacks good basic libraries and doesn't have an ORM (Object-Relational Mapping) tool, which many NoSQL databases have. We thought about building an ORM for the Neo4j Graph Database but are too busy."
 

Pricing and Cost Advice

"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The product’s price depends from project to project."
"The tool is not expensive."
"The price is very high. The solution is expensive."
"The pricing must be improved."
"The solution is fairly expensive."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The solution is expensive."
"InfluxDB is open-source, but there are additional costs for scaling."
"We are using the open-source version of InfluxDB."
"The tool is an open-source product."
"InfluxDB recently increased its price. It is very expensive now."
"The tool is not expensive."
"The solution is open source so that you can use it for free. They also offer an enterprise version with its billing. If your company is earning well, I suggest using the enterprise version. Otherwise, you can deploy it on your own cloud and pay based on usage."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
881,757 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Marketing Services Firm
9%
Computer Software Company
8%
Comms Service Provider
6%
Manufacturing Company
10%
University
9%
Computer Software Company
9%
Financial Services Firm
9%
Financial Services Firm
17%
Computer Software Company
10%
Energy/Utilities Company
9%
Comms Service Provider
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
By reviewers
Company SizeCount
Small Business6
Midsize Enterprise3
Large Enterprise8
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, u...
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What do you like most about InfluxDB?
InfluxDB is a database where you can insert data. However, it would be best if you had different components for alert...
What needs improvement with InfluxDB?
Although I didn't encounter any significant challenges, I think that if there was a NoSQL version of InfluxDB, that w...
What is your primary use case for InfluxDB?
My main use case for InfluxDB involved working on a LEO satellite KPI monitoring application, where I gathered latenc...
What is your experience regarding pricing and costs for Neo4j?
The solution is open source so that you can use it for free. They also offer an enterprise version with its billing. ...
What needs improvement with Neo4j Graph Database?
The only problem is that the community is quite small.
What is your primary use case for Neo4j Graph Database?
We have used Neo4j in microservices. In one of the microservices, we used Neo4j since we have some requirements simil...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
ebay, AXA, Mozilla, DiDi, LeTV, Siminars, Cognito, ProcessOut, Recommend, CATS, Smarsh, Row 44, Clustree, Bleemeo
Walmart, Telenor, Wazoku, Adidas, Cerved, GameSys, eBay, Schleich, ICIJ, die Bayerisch, Megree, InfoJobs, LinkedIn
Find out what your peers are saying about MongoDB, Microsoft, ScyllaDB and others in NoSQL Databases. Updated: February 2026.
881,757 professionals have used our research since 2012.