Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs SingleStore comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (1st), NoSQL Databases (8th)
SingleStore
Average Rating
8.8
Reviews Sentiment
7.4
Number of Reviews
6
Ranking in other categories
Database as a Service (DBaaS) (11th), Vector Databases (16th)
 

Mindshare comparison

While both are Databases solutions, they serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 19.1%, down 27.2% compared to last year.
SingleStore, on the other hand, focuses on Database as a Service (DBaaS), holds 2.7% mindshare, up 1.2% since last year.
Hadoop Market Share Distribution
ProductMarket Share (%)
Cloudera Distribution for Hadoop19.1%
Apache Spark17.1%
HPE Data Fabric14.6%
Other49.199999999999996%
Hadoop
Database as a Service (DBaaS) Market Share Distribution
ProductMarket Share (%)
SingleStore2.7%
Amazon RDS17.5%
MongoDB Atlas13.6%
Other66.2%
Database as a Service (DBaaS)
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Yasin Sarı - PeerSpot reviewer
High-speed data processing, seamless scalability, and excellent high availability making it an optimal choice for those prioritizing performance and efficiency in a database solution
There's a noteworthy consideration when it comes to collecting massive amounts of data. It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks. Attempting to use it for direct extraction, for instance, might lead to memory-related challenges. While MySQL version five might lack extensive SQL capabilities, SingleStore also has its constraints, requiring simpler SQL writing. This becomes evident when seeking advanced functionalities like window functions or JSON functions, where SingleStore doesn't offer an extensive toolkit, necessitating a more straightforward approach to SQL.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform, offers power processing, supports different file systems and query engines, and provides parallel processing for handling many requests."
"The solution's most valuable feature is the enterprise data platform."
"The most valuable feature is Kubernetes."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"We had a data warehouse before all the data. We can process a lot more data structures."
"Cloudera is a very manageable solution with good support."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The most valuable feature is the ability to create pipelines, streamline and extract data from the pipelines."
"MemSQL supports the MySQL protocol, and many functions are similar, so the learning curve is very short."
"The ability to store data in memory is a standout feature, enhanced by robust failover mechanisms."
"The product can automatically reinstall and reconfigure in case of a shutdown."
"The paramount advantage is the exceptional speed."
"It's a distributed relational database, so it does not have a single server, it has multiple servers. Its architecture itself is fast because it has multiple nodes to distribute the workload and process large amounts of data."
 

Cons

"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"The Cloudera training has deteriorated significantly."
"It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation."
"The procedure for operations could be simplified."
"The competitors provide better functionalities."
"The initial setup of Cloudera is difficult."
"Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions."
"The performance of some analytics engines provided by Cloudera is not that good."
"For new customers, it's very tough to start. Their documentation isn't organized, and there's no online training available. SingleStore is working on it, but that's a major drawback."
"Poor key distribution can significantly impact performance, requiring a backward approach in design rather than adding tables incrementally."
"We don't get good discounts in Pakistan."
"It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks."
"There should be more pipelines available because I think that if MemSQL can connect to other services, that would be great."
"Having the ability to migrate servers using a single command would be extremely beneficial."
 

Pricing and Cost Advice

"The pricing must be improved."
"The price could be better for the product."
"I wouldn't recommend CDH to others because of its high cost."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The price is very high. The solution is expensive."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"I would advise users to try the free 128GB version."
"The product's licensing is not expensive. It is comparable."
"They have two main options: cloud installation and bare-metal installation, each with different pricing models."
"SingleStore is a bit expensive."
"Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
873,003 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
19%
Financial Services Firm
18%
Computer Software Company
11%
Energy/Utilities Company
6%
Financial Services Firm
32%
Computer Software Company
11%
Comms Service Provider
6%
University
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
By reviewers
Company SizeCount
Small Business4
Large Enterprise3
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What do you like most about SingleStore DB?
The paramount advantage is the exceptional speed.
What is your experience regarding pricing and costs for SingleStore DB?
Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure. While building an on-premise cluster incurs an initial cost for servers with ample RAM...
What needs improvement with SingleStore DB?
There's a noteworthy consideration when it comes to collecting massive amounts of data. It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
400+ customers including: 6sense, Adobe, Akamai, Ant Money, Arcules, CARFAX, Cigna, Cisco, Comcast, DELL, DBS Bank, Dentsu, DirectlyApply, EY, Factors.AI, Fathom Analytics, FirstEnergy, GE, Goldman Sachs, Heap, Hulu, IMAX, impact.com, Kroger, LG, LiveRamp, Lumana, Nvidia, OpenDialog, Outreach, Palo Alto Networks, PicPay, RBC, Samsung, SegMetrics, Siemens, SiteImprove, SiriusXM, SK Telecom, SKAI, SONY, STC, SunRun, TATA, Thorn, ZoomInfo.
Find out what your peers are saying about Cloudera, Apache, Amazon Web Services (AWS) and others in Hadoop. Updated: November 2025.
873,003 professionals have used our research since 2012.