

Databricks and Darwin compete in the data analytics and machine learning platform category. Databricks has the upper hand with its versatility and scalability, while Darwin excels in automated model generation.
Features: Databricks offers robust capabilities such as Delta data format optimization, collaborative notebooks, and efficient machine learning libraries. It supports multiple programming languages and integrates with Azure Machine Learning. Darwin provides automated model generation, streamlining the process for non-data scientists to build and iterate models efficiently.
Room for Improvement: Databricks could enhance its visualization capabilities and deeper integration with Power BI and Tableau. Its pricing is high, and integration with data sources could be improved. Darwin's dashboards need to be more user-friendly for broader use, and its ability to handle unsupervised models requires enhancement.
Ease of Deployment and Customer Service: Databricks supports various deployment options, including public, private, and hybrid clouds but faces scalability challenges. Its customer service has mixed reviews. Darwin's documentation reduces the need for technical support but has limited deployment flexibility, primarily operating on public cloud environments.
Pricing and ROI: Databricks' pay-per-use model can be expensive but offers ease of use and scalability, positively impacting ROI. Darwin is considered cost-effective, especially compared to hiring data scientists, providing value through streamlined model development and deployment while typically only incurring licensing costs.
When it comes to big data processing, I prefer Databricks over other solutions.
For a lot of different tasks, including machine learning, it is a nice solution.
Whenever we reach out, they respond promptly.
As of now, we are raising issues and they are providing solutions without any problems.
I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features.
The patches have sometimes caused issues leading to our jobs being paused for about six hours.
I would rate the scalability of this solution as very high, about nine out of ten.
Databricks is an easily scalable platform.
They release patches that sometimes break our code.
Databricks is definitely a very stable product and reliable.
Although it is too early to definitively state the platform's stability, we have not encountered any issues so far.
We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools.
They're now coming up with their IBI dashboard, and I think they're on the right track to improve that even further.
It would be beneficial to have utilities where code snippets are readily available.
It is not a cheap solution.
The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse.
Databricks' capability to process data in parallel enhances data processing speed.
The platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.
| Product | Market Share (%) |
|---|---|
| Databricks | 13.9% |
| Darwin | 0.5% |
| Other | 85.6% |

| Company Size | Count |
|---|---|
| Small Business | 6 |
| Large Enterprise | 2 |
| Company Size | Count |
|---|---|
| Small Business | 25 |
| Midsize Enterprise | 12 |
| Large Enterprise | 56 |
SparkCognition builds leading artificial intelligence solutions to advance the most important interests of society. We help customers analyze complex data, empower decision making, and transform human and industrial productivity with award-winning machine learning technology and expert teams focused on defense, IIoT, and finance.
Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.
Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.
What features make Databricks unique?In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.