No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Distribution for Hadoop vs ScyllaDB comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
10th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd)
ScyllaDB
Ranking in NoSQL Databases
5th
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
12
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of June 2026, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 5.5%, up from 2.2% compared to the previous year. The mindshare of ScyllaDB is 6.2%, down from 10.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Mindshare Distribution
ProductMindshare (%)
ScyllaDB6.2%
Cloudera Distribution for Hadoop5.5%
Other88.3%
NoSQL Databases
 

Featured Reviews

SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
Manikandan Gunasekaran - PeerSpot reviewer
Director of Engineering at Ola
Reliable data management with great reliability and performance
From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction. Additionally, ticketing and support systems could be improved due to the time it takes to get answers. There's also an issue with compatibility when attempting to switch back from the enterprise to the community version.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters."
"With a single script, we are able to run the jobs within minutes, which is an advantage."
"Cloudera Manager is the most valuable feature for its ease of use, features, ease of upgrade and install components."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"Implementing a Hadoop cluster has become relatively straight-forward using CDH."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"The product is completely secure."
"CDH has a wide variety of proprietary tools that we use, like Impala, and from that perspective, it's quite useful as opposed to something open-source, as we get a lot of value from Cloudera's proprietary tools."
"The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively."
"The documentation is good. It integrates easily with our existing data infrastructure."
"It is lightweight, and it requires less infrastructure."
"I like how fast it is to query data from the ScyllaDB node!"
"ScyllaDB allows fine-tuning of the table structure. Speed is probably the most critical factor because we perform a lot of heavy data ingestion. One of its core features is its ability to handle high volumes and maintain speed when accessing data. Additionally, high availability and partitioning are built-in features of ScyllaDB."
"ScyllaDB is very fast, and I can use it for so many things."
"The best features of ScyllaDB are how it synchronizes data and its failover system. There's a unique formula to decide the number of nodes you need and the minimum required, which I find helpful. It also offers encryption and supports APIs, making it great for distributed systems and scaling databases across different regions. While it's easy to use, having prior experience helps configure it properly. There are many configurations; if you don't understand them, you might mess up the design. So, understanding your system's needs, like whether it requires more read or write operations, is crucial for setting up the correct configuration."
"Firstly, if I update something, it's most likely to finish within milliseconds."
 

Cons

"The Data Science Workbench doesn't support multiple languages. It needs to support multiple programming languages."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"Cloudera's support is extremely bad and cannot be relied on."
"There are better solutions out there that have more features than this one."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"Cloudera 5 is currently very unstable. Between two Cloudera 5 clusters, we have an incident at least twice a week due to what are now outstanding bugs."
"From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction."
"The documentation is not well established for new developers."
"The documentation of Scylla is an area with shortcomings and needs to be improved."
"Some of the regular commands in NoSQL do not work."
"We faced several challenges while integrating ScyllaDB into our AWS environment. One common issue was that a security port wasn’t opened on one node, preventingdata synchronization across clusters. We noticed the data wasn’t syncing correctly when we saw different record counts in other regions. After investigating, we found that the port was closed in one AWS region. Once we opened the port, the data synchronization across all nodes resumed as expected."
"If you don't have the best computing resources, then it's not easy to set up. In such cases, we have to run ScyllaDB in developer mode."
"The product needs to add more features and improve the response time of the support team."
"Data export, along with how we can purchase the data periodically, needs to be improved so that the storage is within control. Then, we could optimize it even better."
 

Pricing and Cost Advice

"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"I believe we pay for a three-year license."
"The pricing must be improved."
"The solution is expensive."
"I wouldn't recommend CDH to others because of its high cost."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The product’s price depends from project to project."
"It is an expensive tool compared to its competitor."
"It's free."
"The paid version of ScyllaDB is not that expensive. The main advantage of the paid version is direct support from the ScyllaDB team, which can resolve issues faster—typically within a day, compared to two to three days with the free version. The paid version also offers better guidance and support, while the free version has good documentation and is more high-level. I’d rate their support team nine out of ten because of the quick responses from their community."
"I believe that there is a yearly licensing cost and that it's expensive."
"It's a bit expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
900,838 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Construction Company
10%
Marketing Services Firm
8%
Manufacturing Company
6%
Outsourcing Company
12%
Financial Services Firm
8%
Comms Service Provider
8%
Transportation Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Midsize Enterprise2
Large Enterprise8
 

Questions from the Community

What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
What is your experience regarding pricing and costs for Scylla?
From what I’ve seen (and experienced), ScyllaDB pricing is very dependent on how you deploy it, and that’s where most of the confusion comes from.
What needs improvement with Scylla?
From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction. Additionally, ticketing and support systems could be improved due to the time it takes to get answers. T...
What is your primary use case for Scylla?
We dump a lot of our data, such as every entry created with respect to when a user rides a scooter, every record gets updated to ScyllaDB. It is used as a single source of truth and it manages mass...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
IBM, Investing.com, mParticle, Comcast, GE, Fanatics, Ola, CERN, adgear, Samsung
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. ScyllaDB and other solutions. Updated: June 2026.
900,838 professionals have used our research since 2012.