Cloudera Distribution for Hadoop is valued for its easy installation and management, comprehensive security features like Sentry and encryption, and excellent data processing capabilities with Impala and Cloudera Manager. It supports large-scale data management with tools like Hive, Pig, Spark, and integrates smoothly across different environments. Users appreciate its scalability, extensive documentation, proactive support, and efficient role-based access control, enabling effective management of big data with robust analytics and machine learning capabilities.
- "The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
- "We're now able to store large volumes of data through Cloudera Distribution for Hadoop, and we're able to push large volumes of data to the platform, which used to be a challenge, especially when storing a terabyte of information."
- "For enterprise organizations that can bear the cost, it's a good solution."
Cloudera Distribution for Hadoop has room for improvement in multiple areas such as stability, processing speed, and integration capabilities. Users find the licensing structure expensive and suggest enhancements to training materials and support. They report challenges with documentation, deployment complexity, and user interface. Additionally, better cloud integration, data science support, and price adjustments are recommended to make it more competitive. Issues with data compatibility and a lack of certain features also pose difficulties for many organizations.
- "Cloudera's support is extremely bad and cannot be relied on."
- "Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."
- "The only thing that needs improvement is the cost, it's a very expensive solution and one of the main reasons companies are not attracted to the product."