Try our new research platform with insights from 80,000+ expert users

Cloudera Data Platform vs Databricks comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
4.6
Cloudera Data Platform users report improved performance, cost savings, enhanced data access, and better management, despite limited specifics.
Sentiment score
6.6
Organizations benefit from Databricks' cost-effectiveness and efficiency, though some find evaluating immediate gains challenging due to specific contexts.
There are licensing costs that have been saved when we moved some of the data platforms, decommissioned them, and moved on to this platform.
Data engineer at a tech vendor with 10,001+ employees
In terms of return on investment, I see great changes in operational effectiveness measured by RTO when comparing on-premises solutions with cloud solutions.
Cloud Data Administrator at a financial services firm with 10,001+ employees
A specific example of the positive impact of Cloudera Data Platform is the clearly saved time and improved performance, which is the main result of it.
Data Platform Specialist at Lutech
For a lot of different tasks, including machine learning, it is a nice solution.
Senior Data Engineer at a logistics company with 51-200 employees
When it comes to big data processing, I prefer Databricks over other solutions.
Head CEO at bizmetric
 

Customer Service

Sentiment score
6.2
Cloudera's support varies, with efficient paid service, delayed standard responses, and mixed user experiences in communication and assistance.
Sentiment score
7.1
Databricks customer service is praised for responsiveness and expertise, despite occasional delays and communication issues via Microsoft.
I would rate the customer support of Cloudera Data Platform ten out of ten.
Principal Consultant Data Analytics at a outsourcing company with 5,001-10,000 employees
I have communicated with technical support, and they are responsive and helpful.
Data Architect at ubl
Cloudera support is timely and responsive, adhering to the SLAs they provide.
Cloud Data Administrator at a financial services firm with 10,001+ employees
Whenever we reach out, they respond promptly.
Senior Data Engineer at a logistics company with 51-200 employees
As of now, we are raising issues and they are providing solutions without any problems.
Data Platform Architect at KELLANOVA
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
Data Engineer at CRAFT Tech
 

Scalability Issues

Sentiment score
6.3
Cloudera Data Platform is lauded for robust scalability, cloud integration, though some encounter upgrade challenges and costly on-premises options.
Sentiment score
7.4
Databricks provides excellent scalability, supporting diverse data sizes and sectors with high-performance cloud infrastructure and cost-effective management.
CDP allows for easy, mostly automated scalability where I can schedule job workflows, fine-tune system resource metrics, and add nodes with just a click.
Cloud Data Administrator at a financial services firm with 10,001+ employees
They have the cloud burst feature available where if the on-premises capacity is not sufficient at a point in time, you can run that Spark job on the cloud itself.
Data engineer at a tech vendor with 10,001+ employees
Integration with other tools works well for us and we successfully scaled the solution after two to three years without any issues.
Data Architect at ubl
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Senior Data Engineer at a logistics company with 51-200 employees
Databricks is an easily scalable platform.
Data Platform Architect at KELLANOVA
I would rate the scalability of this solution as very high, about nine out of ten.
Data Engineer at CRAFT Tech
 

Stability Issues

Sentiment score
6.4
Cloudera Data Platform is generally stable, reliable, and capable of handling substantial data, with occasional issues in complex setups.
Sentiment score
7.7
Databricks is stable and reliable, with high performance and robustness, despite occasional minor issues resolved quickly.
Sometimes the end user is not experienced or does not have all the expertise related to Cloudera specifically, making it very difficult to manage properly
Data architect at SentientAI, Karachi
Sometimes a node goes down, but it automatically returns to a healthy state.
Cloud Data Administrator at a financial services firm with 10,001+ employees
Cloudera Data Platform is pretty stable in my experience; there are not any downtime or reliability issues.
Data engineer at a tech vendor with 10,001+ employees
They release patches that sometimes break our code.
Senior Data Engineer at a logistics company with 51-200 employees
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
Data Platform Architect at KELLANOVA
Databricks is definitely a very stable product and reliable.
Data Engineer at a tech vendor with 1,001-5,000 employees
 

Room For Improvement

Cloudera Data Platform needs upgrades in multi-tenancy, governance, UI, cloud integration, performance, and AI support for improvement.
Databricks users desire advanced visualization, better integration, enhanced documentation, predictive analytics features, and improved user experience and tools.
We aim to address these issues with a Kubernetes-based platform that will simplify the task of upgrading services.
Senior Architect at a comms service provider with 1,001-5,000 employees
Cloudera Data Platform should include additional capabilities and features similar to those offered by other data management solutions like Azure and Databricks.
Data Architect at ubl
Cloudera Data Platform can be improved by addressing the feasibility of using it in the cloud; there are some complexities around the components used in cloud by Cloudera Data Platform that are not really convenient.
ML Engineer - Director at a financial services firm with 10,001+ employees
Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.
Data Engineer at a engineering company with 1,001-5,000 employees
We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly.
Senior Data Engineer at a logistics company with 51-200 employees
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
Solution Architect at Mercedes-Benz AG
 

Setup Cost

Cloudera Data Platform's complex pricing model is cost-effective and affordable, though deployment may benefit from professional services.
Databricks' pricing is seen as high for large data volumes but competitive for batch processing on cloud platforms.
Initially, CDH had a straightforward pricing model based on nodes, but CDP includes factors like processors, cores, terabytes, and drives, making it difficult to calculate costs.
Senior Architect at a comms service provider with 1,001-5,000 employees
We find Cloudera Data Platform to be cost-effective.
Cloud Data Administrator at a financial services firm with 10,001+ employees
So far, I would say that it is competitive pricing that we have received.
Data engineer at a tech vendor with 10,001+ employees
It is not a cheap solution.
Data Platform Architect at KELLANOVA
 

Valuable Features

Cloudera Data Platform excels in scalability, integration, and security, offering user-friendly, cost-efficient management of massive data volumes.
Databricks simplifies large-scale analytics with user-friendly UI, powerful integrations, and scalable features for enhanced performance and collaboration.
By using the Hadoop File System for distributed storage, we have 1.5 petabytes of physical storage with 500 terabytes of effective storage due to a replication factor of three.
Senior Architect at a comms service provider with 1,001-5,000 employees
The Ranger integration makes it more flexible and reliable for me by allowing control over data access, specifying who can access at what level, such as table level, masking, or data layer level.
Cloud Data Administrator at a financial services firm with 10,001+ employees
What stands out the most in Cloudera Manager are SDX, which provide centralized control for governance, security, and data lineage across multiple sources.
Data Platform Specialist at Lutech
Databricks' capability to process data in parallel enhances data processing speed.
Data Engineer at a engineering company with 1,001-5,000 employees
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
Data Platform Architect at KELLANOVA
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
Data Engineer at CRAFT Tech
 

Categories and Ranking

Cloudera Data Platform
Ranking in Data Management Platforms (DMP)
4th
Average Rating
7.6
Reviews Sentiment
5.5
Number of Reviews
36
Ranking in other categories
Cloud Master Data Management (MDM) (8th), AI Data Analysis (16th)
Databricks
Ranking in Data Management Platforms (DMP)
5th
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
91
Ranking in other categories
Cloud Data Warehouse (9th), Data Science Platforms (1st), Streaming Analytics (1st)
 

Featured Reviews

T Sarwar - PeerSpot reviewer
Data architect at SentientAI, Karachi
Has enabled efficient big data processing and querying but remains complex to manage and configure
Cloudera Data Platform should use fewer tools and remove the complexity between them. It should make it easier for the end user to change the configuration and understand it better. The UI tool for jobs in Cloudera Data Platform can be improved to provide a proper image of ETL jobs and detailed consolidated graphs to monitor Spark-based Hue jobs.
ShubhamSharma7 - PeerSpot reviewer
Data Engineer at a engineering company with 1,001-5,000 employees
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
report
Use our free recommendation engine to learn which Data Management Platforms (DMP) solutions are best for your needs.
879,310 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Performing Arts
10%
Manufacturing Company
8%
Financial Services Firm
8%
Transportation Company
7%
Financial Services Firm
18%
Computer Software Company
9%
Manufacturing Company
9%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise7
Large Enterprise25
By reviewers
Company SizeCount
Small Business25
Midsize Enterprise12
Large Enterprise56
 

Questions from the Community

What is your experience regarding pricing and costs for Hortonworks Data Platform?
The experience with pricing, setup cost, and licensing is very good.
What needs improvement with Hortonworks Data Platform?
Cloudera Data Platform can be improved in several areas. I recently attended their roadmap session. Whatever limitations they have identified involve moving data from on-premises to cloud as a sing...
What is your primary use case for Hortonworks Data Platform?
My main use case for Cloudera Data Platform is dealing with large volumes of data and primarily handling unstructured data by combining structured and unstructured data on this platform. I use Clou...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
 

Also Known As

No data available
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
 

Overview

 

Sample Customers

Information Not Available
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Find out what your peers are saying about Cloudera Data Platform vs. Databricks and other solutions. Updated: December 2025.
879,310 professionals have used our research since 2012.