Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
Our stakeholders and clients have expressed satisfaction with Azure Data Factory's efficiency and cost-effectiveness.
For a lot of different tasks, including machine learning, it is a nice solution.
When it comes to big data processing, I prefer Databricks over other solutions.
The technical support from Microsoft is rated an eight out of ten.
The technical support is responsive and helpful
The technical support for Azure Data Factory is generally acceptable.
Whenever we reach out, they respond promptly.
As of now, we are raising issues and they are providing solutions without any problems.
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
Azure Data Factory is highly scalable.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
Databricks is an easily scalable platform.
I would rate the scalability of this solution as very high, about nine out of ten.
The solution has a high level of stability, roughly a nine out of ten.
They release patches that sometimes break our code.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
Databricks is definitely a very stable product and reliable.
Incorporating more dedicated API sources to specific services like HubSpot CRM or Salesforce would be beneficial.
Sometimes, the compute fails to process data if there is a heavy load suddenly, and it doesn't scale up automatically.
There is a problem with the integration with third-party solutions, particularly with SAP.
Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.
We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly.
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
The pricing is cost-effective.
It is considered cost-effective.
It is not a cheap solution.
It connects to different sources out-of-the-box, making integration much easier.
The platform excels in handling major datasets, particularly when working with Power BI for reporting purposes.
Regarding the integration feature in Azure Data Factory, the integration part is excellent; we have major source connectors, so we can integrate the data from different data sources and also perform basic transformation while transforming, which is a great feature in Azure Data Factory.
Databricks' capability to process data in parallel enhances data processing speed.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
Product | Market Share (%) |
---|---|
Azure Data Factory | 6.8% |
Databricks | 8.3% |
Other | 84.9% |
Company Size | Count |
---|---|
Small Business | 31 |
Midsize Enterprise | 19 |
Large Enterprise | 55 |
Company Size | Count |
---|---|
Small Business | 25 |
Midsize Enterprise | 12 |
Large Enterprise | 56 |
Azure Data Factory efficiently manages and integrates data from various sources, enabling seamless movement and transformation across platforms. Its valuable features include seamless integration with Azure services, handling large data volumes, flexible transformation, user-friendly interface, extensive connectors, and scalability. Users have experienced improved team performance, workflow simplification, enhanced collaboration, streamlined processes, and boosted productivity.
Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
We monitor all Cloud Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.