

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
When it comes to big data processing, I prefer Databricks over other solutions.
For a lot of different tasks, including machine learning, it is a nice solution.
It's not structured support, which is why we don't use purely open-source projects without additional structured support.
As of now, we are raising issues and they are providing solutions without any problems.
Whenever we reach out, they respond promptly.
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
It is a distributed file system and scales reasonably well as long as it is given sufficient resources.
Databricks is an easily scalable platform.
I would rate the scalability of this solution as very high, about nine out of ten.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Continuous management in the way of upgrades and technical management is necessary to ensure that it remains effective.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
They release patches that sometimes break our code.
Databricks is definitely a very stable product and reliable.
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.
We could use their job clusters, however, that increases costs, which is challenging for us as a startup.
This feature, if made publicly available, may act as a game-changer, considering many global organizations use SAP data for their ERP requirements.
If I could right-click to copy absolute paths or to read files directly into a data frame, it would standardize and simplify the process.
It is not a cheap solution.
Hadoop is a distributed file system, and it scales reasonably well provided you give it sufficient resources.
Apache Hadoop helps us in cases of hardware failure because it works 24/7, and sometimes servers crash in the field.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
Databricks' capability to process data in parallel enhances data processing speed.




| Company Size | Count | 
|---|---|
| Small Business | 14 | 
| Midsize Enterprise | 8 | 
| Large Enterprise | 21 | 
| Company Size | Count | 
|---|---|
| Small Business | 25 | 
| Midsize Enterprise | 12 | 
| Large Enterprise | 56 | 










Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.