Cloudera Distribution for Hadoop Valuable Features
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library.
All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
View full review »The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions.
Data warehouse solutions mainly use structured, regular, and formatted data, but Cloudera Distribution for Hadoop can handle unstructured data. This is the most interesting part. Also, the huge amount of data can be tuned in HDFS rather than relational databases. Cloudera Distribution for Hadoop can be a promising solution for distributed file systems, real-time processing, batch mode processing, AI, and machine learning use cases.
We are using several security features in the solution. These include Linux's security implementations and its built-in firewall. We also rely on single sign-on and encryption—at rest and in transit—for sensitive data. It has access, ensuring that not everyone can use every service; for example, some users can access Hive, others Impala, and others hBase, depending on their privileges.
We also use LDAP to track who registers or logs into the cluster. Additionally, we use key nodes to manage firewalls between Cloudera Manager or the Cloudera cluster and other data sources.
View full review »Big data, from the perspective of the end-user, is almost similar to an open-source solution. However, in the case of Cloudera Distribution, we have support from the Cloudera site, which is good. Additionally, we also engage an external company, which acts as a system integrator, to provide partial involvement in the maintenance and installation of Cloudera Distribution for Hadoop implementations. Nonetheless, our team also works with it, not solely as end users but also when they are developing something on this platform.
Buyer's Guide
Cloudera Distribution for Hadoop
July 2025

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.
861,803 professionals have used our research since 2012.
The product is completely secure. It meets our protection needs. We have a dedicated on-premise cluster. Every year, the vendor introduces new versions and supports many tools that are available. They have different hosts. They have a private cloud and a public cloud base.
The best part of the tool is that it is able to expand horizontally and vertically when its customer wants to grow the business. The tool can be deployed using different container technologies, which makes it very scalable.
The solution has data managers. You can manage all services from one place in an integrated manner. You don't have to manage the other services separately by Spark, etc.
View full review »The data science aspect of the solution is valuable.
LS
Lotar Schin
Head of Big Data and Analytics Competency center at OTP Bank Hungary
The solution has good features connected to end-to-end security. It's difficult to ensure a safe environment so this is a primary feature and the main reason we use Cloudera is because it's cloud-independent. That's very important because we are considering partially moving our workloads to cloud and not tying ourselves to a given vendor. For instance, if you were to go with Microsoft Azure, it would be impossible to move to another cloud provider if you're not satisfied with their pricing or quality of service.
View full review »Data Lake is mature. Their integration across the Hadoop ecosystem and the interoperability across other distributions is mature. The technical skill set for Cloudera is available. They enhance their roadmap quarterly and provide new features to enhance current functionalities and capabilities. They are capitalizing on their product and have a clear roadmap.
View full review »It offers a pre-build distribution. Even the support is there for the solution, whatever the issue is that you face. And apart from that, they have a platinum best practice to manage tools and all these things.
View full review »The Cloudera Distribution for Hadoop is valuable.
View full review »The feature I found most valuable in Cloudera Distribution for Hadoop is the Cloudera Manager. It's a good component because it makes log management easy. It's really useful as a management and monitoring console.
View full review »The file system is a valuable feature.
View full review »The product provides many APIs to connect with other applications. The product provides better data processing features than other tools.
View full review »The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.
View full review »It allows us to store huge amounts of data, which is an advantage.
They have BI (Business Intelligence) tools. There are many AI tools.
We are able to connect and analyze the data to get reports. The reports are very good.
The main advantage is the storage is less expensive.
View full review »We find CDSW useful and plan to use it as a one-stop application for model build and training. Currently, we use Zeppelin notebook and we want to gravitate to a single application for notebooks.
View full review »YM
Yevgen Manzhulyanov
CEO at AM-BITS LLC
It is a good enterprise platform. It is easier and more stable. Additionally, it has the best proxy, security, and support features compared to open-source products.
View full review »AK
AkramKhan
Senior Data Architect Manager at Unifonic
The best feature is the layer shared experience. If you have a cluster available on-prem or in the cloud, you can manage that security layer using the shared SDX and it provides flexibility. New features are constantly being added.
View full review »Cloudera is a very manageable solution with good support.
View full review »The product as a whole is good.
View full review »The most valuable feature is Kubernetes.
View full review »MG
Mohamed Gomaa
Data engineer at a tech services company with 11-50 employees
Cloudera is always developing new tools and supports a wide range of tools. We also really like the Cloudera community. You can have any question and will have your answer within a few hours. Cloudera is better than other competitors because they acquired Hortonworks.
View full review »The most valuable feature is Impala, the querying engine, which is very fast. We have been able to work with one terabyte of data in less than 20 minutes. The speed makes it easy for us to process all of the data that comes in, in time.
The support is very good.
All of the data has automatic triple replication in order to secure integrity.
View full review »The feature that we've used quite intensively is Spark, in how it specifically can speed up some of the data to assist with processing.
View full review »The search function is the most valuable aspect of the solution.
View full review »I like the combination of all the tools that allow me to provide solutions and enable me to solve the use cases I'm working on. You need tools or components to foresee everything, and they are all in our emails. Sometimes you try several of them, and sometimes one will work better than the other. So you have to test the tools to see what works for you.
The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized.
View full review »SC
Sumit Chaudhuri
Lead Consultant - Product Development at FIS (http://www.fisglobal.com/)
Keeping multi copies of the file and tools of map reduce like PIG, HIVE due to their flexibility it is easy to develop the application with less or almost no knowledge of Java and Sql. And capability to handle huge data size.
View full review »- Cloudera Manager for administering the Hadoop cluster
- Cloudera specific solutions like Impala
- Extensive documentation
- Good user community
Cloudera Manager is the most valuable feature for it’s ease of use, features, ease of upgrade and install components. CM can also be use to set up high availability within minutes. Others features like Hive, Pig, Impala, Flume and Spark are also valuable.
View full review »Faster runtime for batch jobs.
View full review »The Cloudera Manager administrator webpage simplifies the administration tasks and helps to maintain a global overview of the cluster performance.
View full review »Enterprise resource management, ease of use in terms of integration within the Hadoop ecosystem related products, and security.
View full review »Very solid. Excellent user experience. good documentation. The Cloudera Manager is definitely a deal breaker. Packaging for Ubuntu is great for all the components.
View full review »Mostly HUE, Impala, Sqoop, and Hive. The impala-shell command is number one.
View full review »- Cloudera Manager
- Impala
- Sentry
The features I find most valuable are--
- Enterprise security features (authentication, authorization, data governance, and data protection)
- Proactive support
- Training
The features we've found most valuable are--
- Fast processing of data
- Easy to manipulate using HiveQL
It automates the installation and configuration of Hadoop and different Big Data services.
View full review »The most valuable feature for me are--
- Sentry - provides granular-level security
- Impala - open-source, MPP database
The features most valuable to me are--
- Installation (very easy initial setup)
- Configuration
- Ability to update configuration through UI
- Cluster rolling restarts
- Cluster wide configuration management
The solution's most valuable feature is the enterprise data platform.
View full review »Cloudera, as a whole, is designed to provide organizations with solutions for big data. Cloudera is not one single component. It has many components related to storage, analytics, queries, and processing. All of these components work together to support big data implementation and analytics.
View full review »Buyer's Guide
Cloudera Distribution for Hadoop
July 2025

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.
861,803 professionals have used our research since 2012.