Databricks Reviews

Name: Databricks
Brand: Databricks
Rating: 4.1 (94 reviews)

Vendor: Databricks

4.1 out of 5

94 reviews
96% willing to recommend

Leave a review

What is Databricks?

Databricks offers a scalable, versatile platform that integrates seamlessly with Spark and multiple languages, supporting data engineering, machine learning, and analytics in a unified environment.

Get the Databricks Buyer's Guide and find out what your peers are saying about Databricks, Informatica Intelligent Data Management Cloud (IDMC), Teradata and more!

Databricks is the #1 ranked solution in top Data Science Platforms, #1 ranked solution in Streaming Analytics tools, #4 ranked solution in top Cloud Data Warehouse solutions, and #5 ranked solution in top Data Management Platforms. PeerSpot users give Databricks an average rating of 8.2 out of 10. Databricks is most commonly compared to Informatica Intelligent Data Management Cloud (IDMC): Databricks vs Informatica Intelligent Data Management Cloud (IDMC). Databricks is popular among the large enterprise segment, accounting for 64% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 18% of all views.

Helped 900,644 peers since 2012

Featured Databricks reviews

SimonRobinson

Governance And Engagement Lead

I believe we could improve Databricks integration with cloud service providers. The impact of our current integration has not been particularly good, and it's becoming very expensive for us. The inefficiencies in our implementation, such as not shutting down warehouses when they're not in use or reserving the right number of credits, have led to increased costs. We made several beginner mistakes, such as not taking advantage of incremental loading and running overly complicated queries all the time. We should be using ETL tools to help us instead of doing it directly in Databricks. We need more experienced professionals to manage Databricks effectively, as it's not as forgiving as other platforms such as Snowflake. I think introducing customer repositories would facilitate easier implementation with Databricks.

Read full review

Satyam Wagh

Consultant at Nice Software Solutions

Databricks already provides monthly updates and continuously works on delivering new features while enhancing existing ones. However, the platform could become easier to use. While instruction-led workshops are available, offering more free instructional workshops would allow a wider audience to access and learn about Databricks. Additionally, providing use cases would help beginners gain more knowledge and hands-on experience. Regarding my experience, I was initially unfamiliar with the platform and had to conduct research and learn through various videos. I did find some instruction-led classes, but several of those required payment. The platform should provide more free resources to enable a broader audience to access and learn about Databricks. The platform itself is user-friendly and easy to use without complex issues, so I believe it does not need improvement in its core functionality. Rather, supporting aspects can be enhanced.

Read full review

ShubhamSharma7

Data Engineer at a engineering company with 1,001-5,000 employees

Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

Read full review

Databricks mindshare

Product category:

As of June 2026, the mindshare of Databricks in the Cloud Data Warehouse category stands at 9.7%, up from 9.1% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Cloud Data Warehouse Mindshare Distribution
Product	Mindshare (%)
Databricks	9.7%
Snowflake	15.1%
Teradata	8.8%
Other	66.4%

Cloud Data Warehouse

PeerResearch reports based on Databricks reviews

Type	Title	Date
Category	Cloud Data Warehouse	Jun 23, 2026	Download
Product	Reviews, tips, and advice from real users	Jun 23, 2026	Download
Comparison	Databricks vs Snowflake	Jun 23, 2026	Download
Comparison	Databricks vs Teradata	Jun 23, 2026	Download
Comparison	Databricks vs BigQuery	Jun 23, 2026	Download

Valuable Features

Databricks excels with its user-friendly interface, scalable Spark clusters, and versatile code environment supporting SQL, Python, and R. Notebooks enhance collaboration and facilitate complex data tasks. The integration of Delta Lake and Unity Catalog adds significant data governance benefits. Its machine learning capabilities and built-in libraries streamline big data management. Databricks' seamless cloud integration provides efficient data processing, making it ideal for data engineering, machine learning, and real-time analytics.

"The features of Databricks have impacted my organization positively, as it has done so very efficiently since we switched from several platforms to using this one."
"Databricks tech support has been great every time I've dealt with them, and their team is highly knowledgeable."
"Databricks has a scalable Spark cluster creation process, and the creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."

Room for Improvement

Databricks can enhance visualization capabilities, provide more machine learning libraries, and improve integration with other platforms. Users seek better UI, clearer communication for updates, and pricing structures. Scalability, data governance, and support for non-technical users could be improved. The tool would benefit from a more user-friendly experience and better integration with SQL and BI tools. More learning resources, including trial versions and free instructional workshops, are desired. Debugging and API deployment processes need refinement.

"I think the aspects of Databricks that should be improved are that it could be faster and that I would like to be able to run direct queries from the server."
"Databricks is still having some stability issues."
"I would like it if Databricks adopted an interface more like R Studio."

ROI

Organizations report cost savings and increased efficiency with Databricks, migrating from more expensive systems like Hadoop and RDBMS. They praise its ease of use, shorter setup times, and ability to scale queries effectively. However, some find it challenging to quantify ROI immediately, especially when transitioning or in nonprofit scenarios. Overall, users experience reduced costs, significant time savings, and easier data management, although some are still assessing its financial impact.

Pricing

Databricks employs a flexible, tiered licensing model with a pay-as-you-go option, making it appealing for cloud deployments, especially on Azure due to competitive pricing. While generally considered cost-effective for batch processing, there are concerns about higher costs for large data volumes, streaming cases, and resource mismanagement. Some users find Databricks expensive, with costs depending on usage and compute volume. Pricing varies widely, from affordable to costly, influenced by data size and processing needs.

"It is an expensive tool. The licensing model is a pay-as-you-go one."
"The product pricing is moderate."
"Databricks are not costly when compared with other solutions' prices."

Popular Use Cases

Databricks is used for large-scale data processing, analytics, and machine learning across industries like retail, manufacturing, and finance. Organizations utilize it for batch and stream data processing, ETL operations, and creating data pipelines. Databricks facilitates data transformation, integrating with Azure and AWS environments, and enhances tasks such as SQL querying, predictive analytics, and AI solution development. Users benefit from its capabilities in managing, preparing, and analyzing data efficiently with scalable resources.

Service and Support

Databricks customer service is well-regarded, with users noting proactive support and helpful resources. Technical support receives mixed feedback; some users experience timely and professional responses, while others mention slower interactions, possibly due to partnerships or missing direct agreements. On average, users rate technical support positively, though some face initial communication issues. Documentation is praised for its clarity, reducing the need for direct support.

Deployment

Databricks's initial setup is generally straightforward due to good documentation and cloud-based deployment. Some find it easy, particularly with Azure integration, though complexity can vary based on business needs or lack of experience. Initial deployment time ranges from minutes to weeks, depending on the project scope and team expertise. Training or third-party assistance improves ease of setup. While most report an easy process, specific configurations or new architectures can increase complexity.

Scalability

Databricks offers massively scalable capabilities, efficiently leveraging cloud advantages, enabling rapid cluster expansion. Teams find it easy to scale horizontally and vertically, despite some cost concerns. Users appreciate its elasticity and support for both large and small workloads with high ratings. Automation features are valued, though some manual settings are required. Organizations consistently describe significant user growth potential and rely on Databricks for managing extensive data sets with scalability marked as a standout feature.

Stability

Databricks is consistently described as stable and reliable. Users generally report minimal disruptions, with occasional minor issues that are typically resolved quickly. Ratings frequently range from seven to nine out of ten for stability. Some mention the necessity for more testing and adaptations during updates or when handling large data volumes, yet the platform is praised for its robust performance, reliability, and powerful capabilities in data processing tasks.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Databricks Buyer's Guide for additional reliable information.

Review data by company size

By reviewers
Company Size	Count
Small Business	26
Midsize Enterprise	12
Large Enterprise	50

By reviewers

By visitors reading reviews
Company Size	Count
Small Business	681
Midsize Enterprise	382
Large Enterprise	1924

By visitors reading reviews

Top industries

By visitors reading reviews

Financial Services Firm

18%

Manufacturing Company

10%

Computer Software Company

Healthcare Company

Insurance Company

Government

Retailer

Comms Service Provider

Outsourcing Company

University

Energy/Utilities Company

Educational Organization

Construction Company

Media Company

Real Estate/Law Firm

Transportation Company

Wholesaler/Distributor

Pharma/Biotech Company

Non Profit

Hospitality Company

Consumer Goods Company

Performing Arts

Legal Firm

Recreational Facilities/Services Company

Marketing Services Firm

Logistics Company

Aerospace/Defense Firm

Renewables & Environment Company

Compare Databricks with alternative products

Learn more about Databricks

Databricks stands out for its scalability, ease of use, and powerful integration with Spark, multiple languages, and leading cloud services like Azure and AWS. It provides tools such as the Notebook for collaboration, Delta Lake for efficient data management, and Unity Catalog for data governance. While enhancing data engineering and machine learning workflows, it faces challenges in visualization and third-party integration, with pricing and user interface navigation being common concerns. Despite needing improvements in connectivity and documentation, it remains popular for tasks like real-time processing and data pipeline management.

What features make Databricks unique?

Notebook: Enables collaborative work among team members.
Delta Lake: Optimizes data management operations.
Unity Catalog: Provides governance over data assets.
Cloud Integration: Seamlessly connects with major cloud platforms.

What benefits can users expect from Databricks?

Versatility: Supports diverse applications in data science and engineering.
Performance: Delivers efficient handling of large-scale analytics tasks.
Collaboration: Enhances teamwork in data projects.
Unified Environment: Centralizes machine learning and analytics activities.

In the tech industry, Databricks empowers teams to perform comprehensive data analytics, enabling them to conduct extensive ETL operations, run predictive modeling, and prepare data for SparkML. In retail, it supports real-time data processing and batch streaming, aiding in better decision-making. Enterprises across sectors leverage its capabilities for creating secure APIs and managing data lakes effectively.

Databricks was previously known as Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash.

Databricks customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware

Related questions

Which do you prefer - Databricks or Azure Machine Learning Studio?

How would you compare Databricks vs Amazon SageMaker?

Which would you choose - Databricks or Azure Stream Analytics?

Which product would you choose for a data science team: Databricks vs Dataiku?

Which ETL or Data Integration tool goes the best with Amazon Redshift?

What are the main differences between Data Lake and Data Warehouse?

What are the benefits of having separate layers or a dedicated schema for each layer in ETL?

What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?

Are there any general guidelines to allocate table space quota to different layers in ETL?

What cloud data warehouse solution do you recommend?

Databricks Reviews Summary
Author info	Rating	Review Summary
Governance And Engagement Lead	3.5	I value Databricks' Unity Catalog and Python for data export. However, our implementation is expensive and inefficient due to beginner mistakes, lack of experienced staff, and poor cloud integration. We haven't seen ROI yet despite the platform's potential.
Consultant at Nice Software Solutions	4.5	I found Databricks excellent for ETL pipelines and AI integration, drastically reducing processing time and costs with features like Unity Catalog. I recommend it, though more free learning resources for beginners would be beneficial.
Data Engineer at a engineering company with 1,001-5,000 employees	4.0	As a data engineer, I've used Databricks on Azure for three years, valuing its multilingual notebooks and parallel processing. However, frequent cluster failures are a significant weakness I've encountered, despite its cost-saving benefits.
Data Platform Architect at KELLANOVA	3.5	I use Databricks on AWS for AI/ML, finding its Unified Catalog, serverless computing, and scalability valuable. While initial setup was complex and pricing is high, it's an advanced platform I recommend for analytics, rating it 7-8/10.
Head CEO at bizmetric	4.5	I use Databricks as an excellent, all-encompassing platform for end-to-end ML and data engineering, leveraging features like Unity Catalog and MLflow. While it offers great scalability and cost-effectiveness, DBFS file path handling could be improved with better utilities.
Senior Data Engineer at a logistics company with 51-200 employees	3.5	I find Databricks a solid, all-in-one solution with good support, but managing costs with job clusters and dealing with disruptive patches and challenging migrations are significant issues, affecting operational stability.
Solution Architect at Mercedes-Benz AG	4.5	I find Databricks valuable for its collaborative notebooks and cost-saving features, especially for ETL and analytics. While API and model deployment could improve, particularly for LLMs, I rate it highly and haven't faced scalability issues.
Analista	5.0	I use Databricks primarily for SQL queries, finding its web-based, secure nature highly efficient and beneficial for collaboration. While I'd like it to be faster and support direct server queries, I appreciate its stability, scalability, and excellent support, leading to great ROI by saving time.
Data Engineer at a tech vendor with 1,001-5,000 employees	4.5	I use Databricks for stable, high big data processing, finding it more cost-effective and performant than AWS alternatives. Though very reliable and rated 9/10, I still hope for further cost reductions.
Data Engineer at CRAFT Tech	4.0	I highly recommend Databricks, finding it excellent for data lakehouses and AI/ML, especially with Delta Lake and Unity Catalog. Its stability, scalability, and easy setup are great, though dashboards need improvement. Overall, it's a top solution.

SimonRobinson

Governance And Engagement Lead

Jan 12, 2026

Improved data governance has enabled sensitive data tracking but cost management still needs work

What is our primary use case?

My usual use case for Databricks as an end-user mostly involves exporting data. This typically entails writing directly into a web interface to get the data out, so probably with Python.

What is most valuable?

The most significant benefit Databricks has brought to my company is the Unity Catalog. Previously, with our data warehouse, we weren't able to track where sensitive data was. The Unity Catalog has been a big improvement, even though we haven't gotten the rest right.

The user interface is very useful, especially in writing directly into a web interface.

From my perspective, the ability to export data effectively and use Python within Databricks are key valuable features.

What needs improvement?

We made several beginner mistakes, such as not taking advantage of incremental loading and running overly complicated queries all the time. We should be using ETL tools to help us instead of doing it directly in Databricks. We need more experienced professionals to manage Databricks effectively, as it's not as forgiving as other platforms such as Snowflake.

I think introducing customer repositories would facilitate easier implementation with Databricks.

For how long have I used the solution?

I have been working with Databricks for the last six months.

What do I think about the stability of the solution?

As a platform, Databricks is fine. However, our implementation isn't particularly reliable.

We've suffered from the lack of professionals with previous experience, which makes it difficult to dig ourselves out of the situation we've found ourselves in.

What do I think about the scalability of the solution?

The scalability level of Databricks at the moment exceeds our needs. It's not a problem for us.

The sky's the limit with Databricks.

How are customer service and support?

I have addressed technical support about our issue with Databricks. It was the team that engaged with them, and I believe our development teams also reach out for support, though I'm not sure what level of support they get.

Previously, when using Snowflake, we had customer reps who were really knowledgeable and helped us to avoid beginner mistakes. With Databricks, it seems we could have benefited from similar support. Our implementation team had no experience and made obvious mistakes. It may be that we opted not to have that support, but I believe we should have.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Before Databricks, I used SQL Server.

The big decision to switch from SQL Server to Databricks was motivated by the lack of auditing, lineage, and tracking sensitive data in SQL Server, along with a need for more flexibility.

How was the initial setup?

I did not participate in the initial setup of Databricks.

What about the implementation team?

We use a consultancy, Avanade, for our Databricks implementation. They had previously done a Databricks implementation for another part of our organization. Our implementation team lacked experience which resulted in several beginner mistakes.

What was our ROI?

So far, we're not measuring any return on investment, such as saving time, money, or resources with Databricks. We're still in the phase where our old system and the new system are running simultaneously, so everything is twice as expensive and much effort is doubled. We haven't progressed far enough yet to realize any ROI.

What's my experience with pricing, setup cost, and licensing?

I believe that in terms of credits for Databricks, we're spending between £15,000 and £20,000 a month.

I think Databricks is priced correctly. If we managed our resources better, we wouldn't be paying anywhere near that amount. The issue is with our management of resources.

Which other solutions did I evaluate?

No other options were considered because we used the consultancy Avanade, who had done a previous Databricks implementation for another part of our organization. We used them to recreate our implementation.

What other advice do I have?

I'm probably not the best person to discuss certain aspects of Databricks since I haven't explored it deeply and am not part of the team developing it.

We haven't utilized Databricks' machine learning capabilities.

From my company, data ingestion and transformation are done with Databricks, though I don't do it directly.

I don't use Databricks' features for managing data, such as data lake and warehouse operations.

Most of our current work with Databricks isn't really live yet, so measuring savings in time and money or identifying any return on investment isn't applicable right now.

I would rate this review a 7 overall.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Satyam Wagh

Consultant at Nice Software Solutions

Feb 3, 2026

Unified data workflows have cut ticket processing times and are driving faster business insights

What is our primary use case?

My main use case for Databricks involves the pipelines and ETL processes that we are implementing. Following the Medallion architecture with Gold, Silver, and Bronze layers, we filter the data, perform transformations, and integrate AI. Databricks has made this process significantly easier.

I worked for an airline company where they experienced substantial delays in data processing. When a passenger booked a ticket, it took 20 to 25 minutes for the transaction to reflect in the system. Using Databricks, we compressed that time from 10 to 6 minutes initially and eventually reduced it to just a few seconds. After setting up all the pipelines and leveraging Databricks features to enhance and accelerate the process, this project became truly impactful and time-based, resulting in reduced processing time and ultimately increased profit for the airline company.

What is most valuable?

The best features Databricks offers are Unity Catalog, Databricks Workflow, Databricks AI, Agentic AI, and the automated pipelines that utilize AI. The AI models are very easy to create and deploy in just a few seconds. These are helpful and user-friendly tools.

I find myself using Unity Catalog most frequently because it provides a unified governance solution for all data and AI needs on Databricks, offering centralized access control, auditing, lineage, and data discovery capabilities across the platform. The main features include access control, security compliance standard models, built-in auditing, and lineage tracking. Most of my projects have involved integrating Unity Catalog into systems and providing overall security, including a migration project to transition to Unity Catalog.

The platform's unified data intelligence capabilities allow teams to analyze, manage, and activate data at scale, leading to faster time to insights, cleaner data pipelines, and significant savings on infrastructure and engineering efforts. Databricks eliminates data silos, accelerates the time to insight, empowers all data personnel, and provides built-in governance and security. It also supports AI and ML, which is an added advantage in today's AI-driven field.

What needs improvement?

Regarding my experience, I was initially unfamiliar with the platform and had to conduct research and learn through various videos. I did find some instruction-led classes, but several of those required payment. The platform should provide more free resources to enable a broader audience to access and learn about Databricks. The platform itself is user-friendly and easy to use without complex issues, so I believe it does not need improvement in its core functionality. Rather, supporting aspects can be enhanced.

For how long have I used the solution?

I have been working as a data engineer for four years. Initially, I was a software engineer, but my career has progressed as a data engineer over this four-year period.

What was our ROI?

Definitely. As I mentioned regarding my airline project, it was impactful because the cost was reduced by 60 to 70 percent. The company was initially using Azure Blob storage, and in Databricks, the cluster and associated infrastructure were cheaper than other platforms. This reduction in both time and money resulted in real-time impact and significant cost savings.

What other advice do I have?

For advice for others considering Databricks, it is important to start by understanding its place in the data ecosystem and how it fits into your specific needs. Key points to consider include familiarizing yourself with Databricks, learning the basics, starting with data engineering, and incorporating ETL processes. You can then dive deeper into Databricks features such as notebooks, clusters, and jobs. Achieving certification enhances your skills validation. For best practices, it is critical to optimize performance and minimize complexity while continuously learning to stay competitive in the data field. Following these steps will be very beneficial for anyone pursuing a career as a data engineer and Databricks engineer.

Databricks is a truly essential platform for data engineering needs, and I recommend it to anyone looking to advance in the data engineering field. It is a very important platform and tool for every data engineer. I encourage everyone to learn and explore this product and to maximize its potential. I rate this product a 9 out of 10.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

ShubhamSharma7

Data Engineer at a engineering company with 1,001-5,000 employees

Jan 3, 2025

Capability to integrate diverse coding languages in a single notebook greatly enhances workflow

What is our primary use case?

I am working as a data engineer at Fractal. On a daily basis, I work on Azure Cloud, and I use Databricks frequently. We have EDF pipelines and utilize Synapse for our daily tasks.

What is most valuable?

I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

What needs improvement?

As a data engineer, I see cluster failure in our Databricks user databases as a major issue. I am unsure why, however, our flow, typically involving three to four notebooks, sometimes leads to cluster failure. Despite attempts to identify the problem, there are times when the reason remains unclear. Adjusting features like worker nodes and node utilization during cluster creation could mitigate these failures.

For how long have I used the solution?

I have been using the solution for three years now.

What do I think about the stability of the solution?

Cluster failure is one of the biggest weaknesses I notice in our Databricks.

Which solution did I use previously and why did I switch?

Databricks is beneficial for cost-saving since clients I work for transitioned from AWS Cloud to Azure Cloud for this reason.

How was the initial setup?

The initial setup is very straightforward for us.

What's my experience with pricing, setup cost, and licensing?

I am not very aware of the pricing. We use three to four clusters in our project. Increasing the number or size of clusters, such as adding more workers, would result in higher costs. That's why we limit ourselves to four clusters for our business.

Which other solutions did I evaluate?

In terms of cost efficiency, it's very useful because our clients switched from AWS Cloud to Azure Databricks to save costs.

What other advice do I have?

I would rate the overall product eight out of ten.

Everything is probably good as far as I have used it, but there's room for improvement in cluster integration. Enhancing cluster capabilities while keeping costs lower would be beneficial.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Prabhakar Bonam

Data Platform Architect at KELLANOVA

Apr 28, 2025

Cloud platform enables advanced collaboration but new SAP data feature could enhance its capabilities

What is our primary use case?

I am currently working as an IT architect. We have an AWS analytics platform, a cloud-based platform. We use Databricks for our AI/ML requirements and also the Databricks platform. For the past year, we have been using Databricks for our data scientist community to build their apps.

What is most valuable?

Databricks has a Unified Catalog that assists with secured access and governance. Additionally, serverless computing is crucial for our computing needs. Its collaboration features, such as data sharing capabilities, are also outstanding. Overall, the platform allows us to leverage cloud advantages effectively, enhancing our AI and ML projects.

What needs improvement?

I heard that a new feature is being developed for SAP that can bring SAP data directly into the platform for generating reports. This feature, if made publicly available, may act as a game-changer, considering many global organizations use SAP data for their ERP requirements.

For how long have I used the solution?

We have been using Databricks for one year.

What was my experience with deployment of the solution?

We did not experience deployment issues as we have an internal database AWS cloud admin team. We worked directly with Databricks personnel to configure our requirements.

What do I think about the stability of the solution?

Although it is too early to definitively state the platform's stability, we have not encountered any issues so far. However, since we are still in the process of building much on the platform, we are still observing its stability.

What do I think about the scalability of the solution?

Databricks is an easily scalable platform. It leverages the cloud advantage directly, and scalability is one of the great features of the cloud platform.

How are customer service and support?

We have a customer portal where we can raise issues. As of now, we are raising issues and they are providing solutions without any problems.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

Before Databricks, we were using the R Studio platform for our advanced analytics requirements. We switched to Databricks because it is an advanced platform compared to R Studio.

How was the initial setup?

The initial setup of Databricks could be complex. We set it up with AWS as the backend. It's crucial to have both cloud-specific knowledge and how to configure its features in the platform.

What about the implementation team?

We have our internal database AWS cloud admin team. We did not use external integrators or consultants.

What was our ROI?

We haven't yet seen an ROI from this solution, as we are still using and observing the platform.

What's my experience with pricing, setup cost, and licensing?

There are some issues with pricing specific to our accounts, and Databricks is investigating. It is not a cheap solution.

Which other solutions did I evaluate?

We previously used R Studio before switching to Databricks.

What other advice do I have?

I would recommend Databricks as an advanced analytics platform. It offers features to quickly pull data from various sources required for report generation. Overall, I rate this solution as seven or eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

IshwarSukheja

Head CEO at bizmetric

Jan 10, 2025

Unified platform simplifies end-to-end processes with intuitive data access solutions

What is our primary use case?

I use Databricks for various purposes, including data engineering, MLOps, machine learning training and deployment, the entire ML cycle, and dashboards. It serves different purposes for different projects.

What is most valuable?

Unity Catalog is a feature I am currently using extensively. I am migrating many projects to Unity Catalog. MLflow, which I use for model registering and creating the lineage of models, is also valuable.

Additionally, Databricks serves as a single platform for conducting the entire end-to-end lifecycle of machine learning models or AI ops. I don't need to switch between various tools, making it an all-encompassing solution for development and research. I use the lake house and utilize features effectively.

What needs improvement?

There has been a significant evolution in databases. One area of improvement is the Databricks File System (DBFS), where command-line challenges arise when accessing files. Standardization of file paths on the system could help, as engineers sometimes struggle.

It would be beneficial to have utilities where code snippets are readily available. This would allow engineers to easily click a snippet and import it into the notebook, enabling quick modifications to variables or paths for fetching files, such as reading data from DBFS files. If I could right-click to copy absolute paths or to read files directly into a data frame, it would standardize and simplify the process.

For how long have I used the solution?

I have used the solution for five years plus.

What do I think about the stability of the solution?

I would rate stability seven to eight out of ten.

What do I think about the scalability of the solution?

I would rate scalability seven to eight out of ten.

How are customer service and support?

I do not have any issues that require support. Many resources are available online.

How would you rate customer service and support?

Neutral

How was the initial setup?

I use infrastructure as code on the cloud to deploy the infrastructure. I have all the Git repositories and code repositories for deploying the code and models in the workspace. My setup includes a shared workspace, shared clusters, and integration with Unity Catalog.

What about the implementation team?

I have a team of 100 engineers working with me, and I head the Center of Excellence (COE).

What was our ROI?

I believe it is competitive across clouds. When it comes to big data processing, I prefer Databricks over other solutions. Cost-wise, it is very competitive. The setup process is straightforward, thanks to the use of Spark clusters. This allows for faster turnaround times with Databricks.

What other advice do I have?

The product rating is nine out of ten.

Databricks serves as a single platform that can handle numerous end-to-end machine learning tasks. The configuration is simple, scalability is excellent, and monitoring cluster utilization facilitates informed business decisions.

It's easy to schedule jobs, pipelines, and handle multiple use cases in parallel, providing countless benefits.

Which deployment model are you using for this solution?

Hybrid Cloud

Parag Bhosale

Senior Data Engineer at a logistics company with 51-200 employees

Jan 8, 2025

Integrating engineering and learning, but cost challenges arise with cluster management

What is our primary use case?

I usually handle data ingestion and create warehouses. I also assist other teams, such as analytics, to create reports or perform other tasks.

What is most valuable?

Having one solution for everything, from data engineering to machine learning, is beneficial since everything comes under one hood.

What needs improvement?

We often use a single cluster to ingest Databricks, which Databricks doesn't recommend. They suggest using a no-cluster solution like job clusters. This can be overwhelming for us because we started smaller.

We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly. We need to stay in sync with the DVR versions, and migrations can pose challenges. For example, issues arose when we moved a cluster from a previous version to the latest one. We could use their job clusters, however, that increases costs, which is challenging for us as a startup. Maintaining this infrastructure can be a headache.

For how long have I used the solution?

I have worked at a couple of companies, not just the current one, and I have about 20 to 25 months of experience with Databricks.

What do I think about the stability of the solution?

They release patches that sometimes break our code. These patches are supposed to fix issues, but sometimes they cause disruptions.

What do I think about the scalability of the solution?

The patches have sometimes caused issues leading to our jobs being paused for about six hours. Fortunately, nothing important is currently running on Databricks, however, if there were, it would be a significant issue.

How are customer service and support?

They are good. My company has a contract with them that includes good support. Whenever we reach out, they respond promptly.

How would you rate customer service and support?

Neutral

What was our ROI?

With the benefits we receive, the price is reasonable. However, it's important to have good use cases. If it's just for data ingestion, it might not be the best solution price-wise. For a lot of different tasks, including machine learning, it is a nice solution.

What other advice do I have?

I would rate the solution seven out of ten. That rating also depends on how we have the contract with Databricks.

It's still a solid and good rating. I work as a data engineer and Databricks engineer.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Rama Subba Reddy Thavva

Solution Architect at Mercedes-Benz AG

Jan 8, 2025

Shared notebooks and scheduling enhance cost efficiency

What is our primary use case?

We work on three platforms. Databricks is hosted on Azure for us, so we work with ADFS, Azure Data Factory, and also the AWS Cloud. We work for some customers.

What is most valuable?

The notebooks and the ability to share them with collaborators are valuable, as multiple developers can use a single cluster. This reduces costs. The scheduling part is managed by Databricks itself, for example, when it is idle, it will automatically turn off. All these features are handled by Databricks, reducing costs. We do not need to schedule separately.

For example, on AWS EC2, we have to create a Lambda function or use System Manager templates to schedule EC2 and EMRs. Here, it is taken care of, saving significant resources.

Additionally, notebooks can be shared within the development team which saves effort. Developers can share their notebooks. Git and Azure DevOps integration on the Databricks side is also very helpful.

What needs improvement?

The API deployment and model deployment are not easy on the Databricks side. We use MLflow for managing MLOps, however, further improvement would be beneficial, especially for large language models and related tools. Moreover, the API deployment should be simplified for ease of deployment and consumption.

For how long have I used the solution?

I have been using Databricks for approximately two and a half to three years.

What do I think about the scalability of the solution?

We have not faced any shortages so far. The clusters are available on demand, thus we have not encountered any scalability issues.

How are customer service and support?

We mostly had limited data support required from Databricks. Whenever we did need support, within two or three days the problem was solved. I would rate them ten out of ten.

How would you rate customer service and support?

Positive

What about the implementation team?

We bought it as a service, which is why we never implemented it ourselves. We do not have any implementation team.

Which other solutions did I evaluate?

For companies focused solely on data transformation, transferring data between databases, and not tackling machine learning or deep learning problems, I recommend ADF. It would be sufficient and cost-saving compared to a full-fledged solution like Databricks. However, for data analytics and solving ETL problems, one should consider Databricks.

What other advice do I have?

I would rate it nine out of ten.

reviewer2846955

Analista

May 28, 2026

Web-based SQL workflows have become more secure and have saved significant query time

What is our primary use case?

My main use case for Databricks is running SQL queries. I use Databricks in my day-to-day work by doing SQL queries directly in Databricks using the Genius platform to better correct the queries instead of doing queries on another platform.

What is most valuable?

The best features offered by Databricks include the fact that it is on the web, that it does not depend on installing any software, and most importantly, the security that prevents connection to anyone else who is not logged in.

Regarding the security and web access I mentioned, I have noticed concrete benefits related to collaboration and data protection within my team, such as it being very secure and the fact that every time we enter the platform, it does the same credential verification.

The features of Databricks have impacted my organization positively, as it has done so very efficiently since we switched from several platforms to using this one. After implementing Databricks in my organization, I have observed that it has been more efficient with my team.

What needs improvement?

I think the aspects of Databricks that should be improved are that it could be faster and that I would like to be able to run direct queries from the server. I have not seen any other improvements that I think are needed in Databricks.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

I rate the scalability of Databricks as excellent.

How are customer service and support?

Databricks customer support is very good. I would give Databricks customer support a rating of ten.

Which solution did I use previously and why did I switch?

I did not use any other solution before Databricks.

What was our ROI?

I have seen a return on investment, as time is greatly saved and processes are faster.

What's my experience with pricing, setup cost, and licensing?

My experience with pricing, implementation costs, and licensing is that it is very efficient and very fast.

What other advice do I have?

My advice to others considering using Databricks is that it is the best platform with artificial intelligence. I give this review an overall rating of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

AvivCohen

Data Engineer at a tech vendor with 1,001-5,000 employees

May 28, 2025

Experiencing smooth performance and cost advantages over previous tools

What is our primary use case?

The use case for Databricks is that we use the clustering for high big data processing within the cluster.

What is most valuable?

I think it is difficult to determine which feature of Databricks I enjoy the most since there are many valuable features.

What's valuable about Databricks to my organization is that it is more cost-effective and provides better performance than the current AWS tools and services they offer.

What needs improvement?

I am uncertain about specific improvements for Databricks.

It would be beneficial to make Databricks even more cost-effective.

For how long have I used the solution?

I have been using Databricks for two years.

What do I think about the stability of the solution?

My experience with Databricks has been smooth, and I haven't encountered any issues.

Databricks is definitely a very stable product and reliable.

How are customer service and support?

I have not used Databricks customer service or support.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Before Databricks, I used Batch processing, Fargate, and possibly Kubernetes.

I switched from my previous solutions because they were either too expensive or too difficult to configure.

Which other solutions did I evaluate?

I have considered other solutions besides Databricks, such as Snowflake, but we haven't explored it extensively yet.

We are still early in our Snowflake experience, so we don't know the pros and cons compared to Databricks.

What other advice do I have?

My deployment model for Databricks is limited as I'm not a heavy user.

I am not the person who purchased Databricks, but it was possibly acquired through the AWS Marketplace.

I may not have utilized Databricks machine learning capabilities.

My experience with the pricing and licensing model is that it remains relatively expensive. Though it's less expensive than AWS, we still need a more cost-effective solution.

I would rate Databricks overall a nine out of ten.

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)

Lax Kas

Data Engineer at CRAFT Tech

May 15, 2025

Unifying data for analytical insights with smooth AI and machine learning integration

What is our primary use case?

A typical use case for the solution is to build the data lakehouse for the client because they have a variety of source systems, and they want to unify that data into the lakehouse platform, where they want to use the data for analytical purposes and insights.

What is most valuable?

The most valuable features of Databricks are especially the Delta Lake and the Unity Catalog; those are the main features. The Unity Catalog is for data governance, and the Delta Lake is to build the lakehouse. Currently, they're coming up with workflow jobs, along with other supporting elements to create an end-to-end solution.

What needs improvement?

In my opinion, areas of Databricks that have room for improvement involve the dashboards. Until recently, everyone used third-party systems such as Power BI to connect to Databricks for dashboards and reports, but they're now coming up with their IBI dashboard, and I think they're on the right track to improve that even further.

For how long have I used the solution?

I have approximately four years of experience working with Databricks.

What do I think about the stability of the solution?

I would rate the stability of Databricks as highly stable, around nine out of ten.

What do I think about the scalability of the solution?

I would rate the scalability of this solution as very high, about nine out of ten.

How are customer service and support?

I rate the technical support as fine because they have levels of technical support available, especially partners who get really good support from Databricks on new features. For us, it's so far so good with no problems, and I would rate the support quality as eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup of the Databricks solution is reasonably fair enough. It doesn't give any trouble to implement the solution, and I think it's fairly easy to set up and work on Databricks.

What was our ROI?

I can't say if there's seen an ROI from the solution because I do not have exposure in that area, although I think the people who decided to implement Databricks might have done all this analysis and POCs.

What other advice do I have?

My relationship with the vendor is that I'm not a partner of Databricks; I work for a client where we use the Databricks software for implementing the solutions.

My clients are usually enterprise-level organizations, but the area where they're implementing is medium level here, although it might go into enterprise level in the future.

Regarding the price of Databricks, I don't involve myself in those decisions.

I think Databricks is very good at facilitating AI and machine learning projects; they implement AI and machine learning models very well, and clients can run their models on Databricks. I believe they are in a better place compared to competitors such as Snowflake, and they are tying up with important companies such as SAP and Palantir.

Based on my experience, I would recommend Databricks to other people. Overall, I would rate this solution as one of the best, about eight out of ten, although I might not know some of the pitfalls; it's based on use case to use case, but for us, it's working well.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure

Title	Rating	Mindshare	Recommending
Informatica Intelligent Data Management Cloud (IDMC)	4.0	N/A	92%	215 interviews Add to research
Teradata	4.1	8.8%	88%	83 interviews Add to research

Databricks Reviews

What is Databricks?

Featured Databricks reviews

Databricks mindshare

PeerResearch reports based on Databricks reviews

Valuable Features

Room for Improvement

ROI

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

Top industries

Compare Databricks with alternative products

Learn more about Databricks

Databricks customers

Related questions

Product Categories

Popular Comparisons

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

What about the implementation team?

What was our ROI?

What's my experience with pricing, setup cost, and licensing?

Which other solutions did I evaluate?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What was our ROI?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What do I think about the stability of the solution?

Which solution did I use previously and why did I switch?

How was the initial setup?

What's my experience with pricing, setup cost, and licensing?

Which other solutions did I evaluate?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?

For how long have I used the solution?

What was my experience with deployment of the solution?

What do I think about the stability of the solution?

What do I think about the scalability of the solution?

How are customer service and support?

How would you rate customer service and support?

Which solution did I use previously and why did I switch?

How was the initial setup?

What about the implementation team?

What was our ROI?

What's my experience with pricing, setup cost, and licensing?

Which other solutions did I evaluate?

What other advice do I have?

Which deployment model are you using for this solution?

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

What is our primary use case?

What is most valuable?

What needs improvement?