

Find out in this report how the two AI Data Analysis solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
There are licensing costs that have been saved when we moved some of the data platforms, decommissioned them, and moved on to this platform.
In terms of return on investment, I see great changes in operational effectiveness measured by RTO when comparing on-premises solutions with cloud solutions.
A specific example of the positive impact of Cloudera Data Platform is the clearly saved time and improved performance, which is the main result of it.
Using Cohesity DataProtect is easier to manage, and it simplifies various components into one architecture, reducing the need for extensive human resources to manage backups.
I would rate the customer support of Cloudera Data Platform ten out of ten.
I have communicated with technical support, and they are responsive and helpful.
Cloudera support is timely and responsive, adhering to the SLAs they provide.
The support can depend on the region, and for larger customers, I advise having a Technical Account Manager for better assistance.
For the support, I can provide a rating of four only because they initially provide some steps, but later say they are not sure, which is a problem in a production environment.
CDP allows for easy, mostly automated scalability where I can schedule job workflows, fine-tune system resource metrics, and add nodes with just a click.
They have the cloud burst feature available where if the on-premises capacity is not sufficient at a point in time, you can run that Spark job on the cloud itself.
The ability to scale processing capacity on demand for batch jobs without impacting other workloads, and support for a growing number of concurrent users and teams accessing the platform simultaneously are significant advantages.
Cohesity DataProtect is built on a scale-out architecture, which means it can effectively scale to meet various needs.
Sometimes the end user is not experienced or does not have all the expertise related to Cloudera specifically, making it very difficult to manage properly
Sometimes a node goes down, but it automatically returns to a healthy state.
Cloudera Data Platform is pretty stable in my experience; there are not any downtime or reliability issues.
On the whole, any problems were more related to hardware limitations rather than issues with Cohesity DataProtect itself.
We aim to address these issues with a Kubernetes-based platform that will simplify the task of upgrading services.
Cloudera Data Platform should include additional capabilities and features similar to those offered by other data management solutions like Azure and Databricks.
Cloudera Data Platform can be improved by addressing the feasibility of using it in the cloud; there are some complexities around the components used in cloud by Cloudera Data Platform that are not really convenient.
While there are improvements to be made, such as providing support for older systems like IBM iSeries and tandem systems from HP, the solution overall shifts from older methods to modern practices.
There is room to improve the user interface of Cohesity DataProtect for more intuitive navigation.
Initially, CDH had a straightforward pricing model based on nodes, but CDP includes factors like processors, cores, terabytes, and drives, making it difficult to calculate costs.
We find Cloudera Data Platform to be cost-effective.
So far, I would say that it is competitive pricing that we have received.
I find Cohesity DataProtect to be expensive.
By using the Hadoop File System for distributed storage, we have 1.5 petabytes of physical storage with 500 terabytes of effective storage due to a replication factor of three.
The Ranger integration makes it more flexible and reliable for me by allowing control over data access, specifying who can access at what level, such as table level, masking, or data layer level.
What stands out the most in Cloudera Manager are SDX, which provide centralized control for governance, security, and data lineage across multiple sources.
The platform is based on a scale-out architecture with each node having compute, RAM, SSD, and HDD.
Global deduplication ensures that only unique data blocks are stored, significantly reducing storage consumption.
Some of the most valuable features of Cohesity DataProtect for me include instant mass restore, anomaly detection, and its ability to handle large data volumes effectively.
| Product | Market Share (%) |
|---|---|
| Cohesity DataProtect | 0.5% |
| Cloudera Data Platform | 0.7% |
| Other | 98.8% |

| Company Size | Count |
|---|---|
| Small Business | 8 |
| Midsize Enterprise | 7 |
| Large Enterprise | 26 |
| Company Size | Count |
|---|---|
| Small Business | 19 |
| Midsize Enterprise | 22 |
| Large Enterprise | 43 |
Cloudera Data Platform offers a powerful fusion of Hadoop technology and user-centric tools, enabling seamless scalability and open-source flexibility. It supports large-scale data operations with tools like Ranger and Cloudera Data Science Workbench, offering efficient cluster management and containerization capabilities.
Designed to support extensive data needs, Cloudera Data Platform encompasses a comprehensive Hadoop stack, which includes HDFS, Hive, and Spark. Its integration with Ambari provides user-friendliness in management and configuration. Despite its strengths in scalability and security, Cloudera Data Platform requires enhancements in multi-tenant implementation, governance, and UI, while attribute-level encryption and better HDFS namenode support are also needed. Stability, especially regarding the Hue UI, financial costs, and disaster recovery are notable challenges. Additionally, integration with cloud storage and deployment methods could be more intuitive to enhance user experience, along with more effective support and community engagement.
What are the key features?Cloudera Data Platform is implemented extensively across industries like hospitality for data science activities, including managing historical data. Its adaptability extends to operational analytics for sectors like oil & gas, finance, and healthcare, often enhanced by Hortonworks Data Platform for data ingestion and analytics tasks.
What is Cohesity DataProtect?
Cohesity DataProtect is a top-level, sophisticated, software-defined backup and recovery solution created for cloud environments. Cohesity DataProtect is made to hyperscale and is one the most thorough policy-based protection solutions available on the market today.
Cohesity DataProtect melds multiple-point products into a single software that is able to be deployed as on-premise or consumed as a service.
Top Features:
Hyperscale made easy: Cohesity DataProtect improves data protection by doing away with the need for backup silos and administers backup and recovery with a single user-friendly, easy-to-understand interface. Cohesity DataProtect provides extensive 24/7 enterprise-class protection for a large, varying set of sources, including virtual and physical servers, NAS and SaaS workloads, relational and distributed databases, and traditional and containerized applications.
Super fast recovery: Cohesity DataProtect offers near-zero recovery point objectives (RPOs) and near-instant recovery time objectives (RTOs) to satisfy your business service-level agreements (SLAs). Using Cohesity Helios’ unified data plane and control plane you can immediately search and recover data on any Cohesity cluster, located anywhere. DataProtect distinctively minimizes downtime by immediately mass restoring any amount of virtual machines (VMs) to any point in time, and lowers data protection costs by as much as 70% or more.
Backup as a Service: Feel free to take advantage of the elasticity of the public cloud and the cost-effective value of Cohesity DataProtect when delivered as a service. When you choose an OPEX option, you can eliminate the need for on-premises hardware. There is a SaaS option that will allow you to very simply configure your backup workloads and immediately begin protecting your vital all-important data and applications.
It’s as easy as: sign-up, connect, and protect!
Ransomware protection: Cohesity DataProtect offers stellar recovery at scale. The solution is so intuitive, it has machine learning-based anomaly detection to stop incidents before they happen. Cohesity DataProtection can provide immutable backups, Datalock write once, read many (WORM), encryption, and role-based access control (RBAC).
Cohesity DataProtect integrates well with many of today’s top solutions, including, but not limited to:
Reviews from Real Users
One reviewer, who is head of IT infrastructure at Kampmann says, “Cohesity really is a Next-Gen Data Management Software,” and goes on to indicate that Cohesity “works flawlessly with easy replication of backups and numerous supported backup sources.”
Another user, who is a senior network engineer at a legal firm, relates that "Cohesity is a robust and feature-rich solution"
We monitor all AI Data Analysis reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.