We use Databricks to define tool data and have many use cases to analyze and distribute the data.
Head of Referential and Big Data at a financial services firm with 5,001-10,000 employees
A highly scalable unified data platform that provides data access to any type of user
Pros and Cons
- "I like cloud scalability and data access for any type of user."
- "It would be better if it were faster. It can be slow, and it can be super fast for big data. But for small data, sometimes there is a sub-second response, which can be considered slow. In the next release, I would like to have automatic creation of APIs because they don't have it at the moment, and I spend a lot of time building them."
What is our primary use case?
How has it helped my organization?
Data is open to everyone; they can access it through many channels, including notebooks or SQL. That on its own democratizes the data.
What is most valuable?
I like cloud scalability and data access for any type of user.
What needs improvement?
It would be better if it were faster. It can be slow, and it can be super fast for big data. But for small data, sometimes there is a sub-second response, which can be considered slow.
In the next release, I would like to have automatic creation of APIs because they don't have it at the moment, and I spend a lot of time building them.
Buyer's Guide
Databricks
June 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
856,873 professionals have used our research since 2012.
For how long have I used the solution?
I have been using Databricks for roughly one and a half years.
What do I think about the stability of the solution?
Stability is excellent.
What do I think about the scalability of the solution?
Databricks is scalable. You can use the power of the cloud to scale your cluster size, either CPU or memory. The data doesn't work like a standard database, so you don't have it based on files, and you don't copy the data. It's super scalable. It's only the computing that you have to scale with the data.
We probably have 40 users with roles like developers, business analysts, and data scientists. We have big plans to increase the usage and have more departments using it.
How are customer service and support?
Technical support has helped us.
On a scale from one to ten, I would give technical support a five.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We used Cloudera before switching to Databricks.
How was the initial setup?
The initial setup was fairly okay. It takes about two minutes to deploy this solution. It's all code, so we click a button, and then it's done.
On a scale from one to five, I would give the initial setup a four.
What about the implementation team?
We set up and deployed this solution.
What was our ROI?
On a scale from one to five, I would give our ROI a three.
What's my experience with pricing, setup cost, and licensing?
We only pay for the Azure compute behind the solution. If you want to compute, you have to have a database layer and Azure below.
On a scale from one to five, I would give their pricing a two.
Which other solutions did I evaluate?
We looked at other options such as Snowflake and Cloudera on the cloud,
What other advice do I have?
I would tell potential users that they need proper cloud engineers and a
cloud infrastructure team to use this solution.
On a scale from one to ten, I would give Databricks a nine.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Lead Analytics at a manufacturing company with 10,001+ employees
Useful machine learning and easy to scale
Pros and Cons
- "In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance."
- "The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes."
What is our primary use case?
Our team is currently utilizing machine learning for various applications, and a few members are also exploring Databrick's use for ML operations.
What is most valuable?
In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance.
For how long have I used the solution?
I have been using Databricks for approximately six months
What do I think about the stability of the solution?
The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes.
What do I think about the scalability of the solution?
The scalability of Databricks is good as long as you have a data lake, and it's easy to scale.
We have approximately 50 users using this solution in my company.
How are customer service and support?
We have a different team who handles the support. I do not have contact with Databricks support.
Which solution did I use previously and why did I switch?
I have not used a similar solution to Databricks.
What was our ROI?
I have seen an ROI using Databricks.
What's my experience with pricing, setup cost, and licensing?
I rate the price of Databricks as eight out of ten.
What other advice do I have?
Having a good understanding of physical security in relation to cybersecurity in an OT (Operational Technology) environment would be beneficial, and utilizing an existing data lake prior to implementing a Databricks initiative would greatly aid in its success.
I rate Databricks an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Databricks
June 2025

Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
856,873 professionals have used our research since 2012.
Lead Architect at Birlasoft IndiaLtd.
Data analytics platform that supports large volumes of data and related activities
Pros and Cons
- "This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities."
- "The connectivity with various BI tools could be improved, specifically the performance and real time integration."
What is most valuable?
This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities. All asset complaints properties are available and this is very useful to ensure the quality of all data.
What needs improvement?
The connectivity with various BI tools could be improved, specifically the performance and real time integration. There is also some improvement required in the semantic layers to manage the data match as well as the data warehouse features.
In a future release, we would like to have features to better manage all ML development activities.
For how long have I used the solution?
I have been using this solution for three years.
What do I think about the stability of the solution?
This is a stable solution, especially compared to other technology on the market.
What do I think about the scalability of the solution?
It is a scalable solution but this depends on the platform that is being used. If you use a cloud platform such as Azure, it offers scalability. However, some platforms will not support scalability using Databricks.
We have around 20 users in our development team using Databricks.
How are customer service and support?
The customer service and support for this solution is good.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup is pretty simple and requires minimal configuration compared to other technology.
What's my experience with pricing, setup cost, and licensing?
I would rate the pricing for this solution a four out of five. This does depend on the environment or the infrastructure that one is using. There is a difference in pricing between using Azure or being on-premises.
Which other solutions did I evaluate?
Azure Synapse is a competitor that we evaluated but it is not mature enough to provide better performance than Databricks. We choose Databricks due to the ability to have a lot of data in Data Lakes and the Data Warehouse. We are also able to run data science activities using ML flow.
What other advice do I have?
If you are looking for custom model development and a lot of data management in a cloud agnostic manner, then Databricks is a good solution.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Manager, Customer Journey at a retailer with 10,001+ employees
You can connect multiple data sources and share work easily
Pros and Cons
- "I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."
- "I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data."
What is our primary use case?
I use Databricks for customer marketing analytics.
What is most valuable?
Databricks lets you schedule jobs pretty easily, and you can use SQL, Spark SQL, Python, or R. It also allows you to save a table or view.
I like that you can connect to multiple data sources. Most of our data is stored in the Azure data lake, but my previous company connected to SQL databases or even blob storage.
They've improved on many features. I don't do data engineering, but I had an issue a couple of years ago at my two companies ago. It took a long time to read and save tables, but I think the new Delta feature helped.
I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature.
What needs improvement?
I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data.
Because I work in analytics and not data engineering, I think that's probably the biggest one. There are better graphical tools, so I don't think Databricks can compete. You can do a simple graph, and it's not that great. However, I don't think they can ever stack up to Tableau, so it's probably not worth it to improve upon that.
For how long have I used the solution?
I've been using Databricks for two years.
What do I think about the stability of the solution?
Databricks is stable.
What do I think about the scalability of the solution?
Databricks is scalable.
How are customer service and support?
Databricks tech support has been great every time I've dealt with them. Their team is highly knowledgeable.
How was the initial setup?
Setting up Databricks is easy. I set it up at my previous company. That was on Azure as well, but they utilized a third-party team with expertise in Databricks to ensure everything was optimized.
What other advice do I have?
I rate Databricks 10 out of 10. I recommend taking advantage of Databricks support or a third-party provider to ensure it's set up optimally. I don't know if it's an additional service you must pay for, but we always had access to Databricks support in my last company.
I think that's worth the money because there are so many different scenarios with distributed computing. Even people who study analytics may not understand the ins and out of Spark. It's worth it to have a service contract for support.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Director - Data Engineering expert at Sankir Technologies
Is user friendly and has great performance, but documentation needs improvement
Pros and Cons
- "Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."
- "If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks."
What is our primary use case?
I use Databricks to explore new features and provide the industry visibility and scalability of Databricks to the companies that I work with.
I create proof of concepts for companies. As a consultant, I also create training courses on Databricks. If a company wants to leverage a service provided by Databricks and needs to train people, they use our courses.
What is most valuable?
Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance.
Databricks has made great strides in terms of performance.
It is very user friendly. I like the ease of creating a Spark cluster, submitting a job, or creating a notebook.
The UI has also changed for the better compared to what it was two years ago.
What needs improvement?
If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks.
It's a big ask to have people jump through a lot of hoops to get approval to create a Databricks cluster just to explore it, but if they can try it on their own with a free trial without an underlying cloud account it would be more convenient.
Documentation can be improved as well. There are so many versions of documents. For example, when I tried to create a DBU vault and secrets file, I had to go through multiple versions of documents. This could be improved so that the documentation is easy to use.
For how long have I used the solution?
I've been using this solution for about two years.
What do I think about the stability of the solution?
Stability wise, it's quite okay. In my experience, it doesn't crash.
What do I think about the scalability of the solution?
I have not used autoscaling because it consumes a lot of money and because my experience has been alright. In some cases, though, it is tied to the quota of the underlying infrastructure. I have not tested the scalability to its fullest extent, but with the workloads I run, it has been fine.
How are customer service and support?
When I wanted to create an AWS account and contacted technical support via email, I never received a response. Recently, however, I think they have improved their support a little bit, and I did get a call in response to my question. Overall, I've not faced any issues with the person I had to contact directly.
How was the initial setup?
The initial setup is not very easy, but it's medium in complexity.
What's my experience with pricing, setup cost, and licensing?
Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price.
What other advice do I have?
I would rate Databricks at seven on a scale from one to ten. If you compare it to Snowflake, for example, Snowflake doesn't mandate an underlying cloud account. It creates one on its own. That's a subtle convenience that Snowflake has and one that Databricks could also build.
Snowflake's documentation is easy to use in comparison to that of Databricks.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Owner at a marketing services firm with 1-10 employees
The data governance has been absolutely efficient in between other kinds of solutions
Pros and Cons
- "Databricks' Lakehouse architecture has been most useful for us. The data governance has been absolutely efficient in between other kinds of solutions."
- "I would like it if Databricks made it easier to set up a project."
What is our primary use case?
We use Databricks for video streaming and security purposes.
What is most valuable?
Databricks' Lakehouse architecture has been most useful for us. The data governance has been absolutely efficient in between other kinds of solutions.
What needs improvement?
I would like it if Databricks made it easier to set up a project. The use case determines which services we are going to use. You have the application engine, and you generate a potential budget for your workloads, so you can understand what you are going to do, what you are going to use, and what you will invest in.
Because I'm deploying on the Google Cloud Platform, measuring the investment, value, and use case is extremely difficult. So I leave it and move on without the risk. It would be easier if I had one page where you can see three columns: one for the use cases of a specific architecture, a second one for the prices based on the volume of data or machine time, and the third column for the budget. That would make it easier to know if I am using the appropriate architecture for the right solution.
I have seen something like that in Microsoft Azure, but obviously Microsoft Azure costs a lot of money. Amazon has something like that, but it's very complicated to use.
For how long have I used the solution?
We've been using Databricks for about five years.
What do I think about the stability of the solution?
Databricks is very stable and powerful.
What do I think about the scalability of the solution?
It was simple to make Databricks scalable. We found that we could set up an alert to tell us if we needed more resources, money, or time from our team. We're alerted when the system detects some trigger for any use of the instance. If you have another alert from your side, that would be extremely useful because it takes a lot of time to develop that kind of trigger.
How are customer service and support?
Databricks technical support was lovely. We don't need it so much, but the few questions we had were answered immediately.
How was the initial setup?
I am not a data engineer because I just started data science at the company, but it was straightforward and clear for the architect to set up. He provided me with that idea because he realized it would take time if we had use cases. You can select and change the data or add some modules or products. You have all the technology to do so.
What other advice do I have?
I rate Databricks eight out of 10. I like to move my customers into Databricks, but I take care of the internal system infrastructure so they can continue to use familiar software or operating systems and databases. They have a lot of doubts because they don't know the solution. We need to train them, explain things, and show the solution's potential value.
Generally, companies try to keep the same flavor when they migrate. For example, if they are using many Microsoft products, they want to work with Azure. If they are open to other options, they go with GCP or AWS. However, Databricks doesn't have enough customers here in my market because it's not a visible brand. Azure, GCP, and AWS are highly visible here, so the local teams are friendly with the three brands.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Other
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Machine Learning Engineer at a mining and metals company with 10,001+ employees
Highly scalable, stable and good technical support
Pros and Cons
- "Databricks is a scalable solution. It is the largest advantage of the solution."
- "The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good."
What is our primary use case?
We were using Databricks to build an AI solution. We are only evaluating it, we have approximately three people that tried it out. Later we choose another solution, we did not fully deploy Databricks.
How has it helped my organization?
Before I used Databricks it took me a long time to do some functions and now with Databricks I can do them much quicker. It scales very well.
What needs improvement?
The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good.
For how long have I used the solution?
I have used Databricks within the last 12 months.
What do I think about the stability of the solution?
The solution is stable.
What do I think about the scalability of the solution?
Databricks is a scalable solution. It is the largest advantage of the solution.
How are customer service and support?
We have been in contact with the technical support of Databricks, they were good.
Which solution did I use previously and why did I switch?
We have used a lot of different solutions, such as Watson and DataIQ.
How was the initial setup?
The initial setup is easy. However, I do not know much about the implementation because the company does it.
What about the implementation team?
We did the implementation of the solution.
What other advice do I have?
If companies want scalability, they should choose Databricks.
I rate Databricks a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Very elastic, easy to scale, and a straightforward setup
Pros and Cons
- "It's easy to increase performance as required."
- "Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."
What is our primary use case?
We work with clients in the insurance space mostly. Insurance companies need to process claims. Their claim systems run under Databricks, where we do multiple transformations of the data.
What is most valuable?
The elasticity of the solution is excellent.
The storage, etc., can be scaled up quite easily when we need it to.
It's easy to increase performance as required.
The solution runs on Spark very well.
What needs improvement?
Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively.
They're currently coming out with a new feature, which is Date Lake. It will come with a new layer of data compliance.
For how long have I used the solution?
We've been using the solution for two years.
What do I think about the stability of the solution?
I don't see any issues with stability going down to the cluster. It would certainly be fine if it's maintained. It's highly available even if things are dropped. It will still be up and running. I would describe it as very reliable. We don't have issues with crashing. There aren't bugs and glitches that affect the way it works.
What do I think about the scalability of the solution?
The system is extremely scalable. It's one of its greatest features and a big selling point. If a company needs to scale or expand, they can do so very easily.
We require daily usage from the solution even though we don't directly work with Databricks on a day to day basis. Due to the fact that we schedule everything we need and it will trigger work that needs to be done, it's used often. Do you need to log into the database console every day? No. You just need to configure it one time and that's it. Then it will deliver everything needed in the time required.
How are customer service and technical support?
We use Microsoft support, so we are enterprise customers for them. We raise a service request for Databricks, however, we use Microsoft. Overall, we've been satisfied with the support we've been given. They're responsive to our needs.
Which solution did I use previously and why did I switch?
We work with multiple clients and this solution is just one of the examples of products we work with. We use several others as well, depending on the client.
It's all wrappers between the same underlying systems. For example, Spark. It's all open-source. We've worked with them as well as the wrappers around it, whether the company was labeled Databrary, IBM insights, Cloudera, etc. These wrappers are all on the same open-source system.
If we with Azure data, we take over Databricks. Otherwise, we have to create a VM separately. Those things are not needed because Azure is already providing those things for us.
How was the initial setup?
The situation may have been a bit different for me than for many users or organizations. I've been in this industry for more than 15 or 17 years. I have a lot of experience. I also took the time to do some research and preparation for the setup. It was straightforward for me.
The deployment with Microsoft usually can be done in 20 minutes. However, it can take 40 to 45 minutes to complete. An organization only requires one person to upload the data and have complete access to the account.
What about the implementation team?
I deployed the solution myself. I didn't require any assistance, so I didn't enlist any resellers or consultants to help with the process.
What's my experience with pricing, setup cost, and licensing?
The solution is expensive. It's not like a lot of competitors, which are open-source.
What other advice do I have?
There isn't really a version, per se.
It's a popular service. I'd recommend the solution. The solution is cloud-agnostic right now, so it really can go into any cloud. It's the users who will be leveraging installed environments that can have these services, no matter if they are using Azure or Ubiquiti, or other systems.
I don't think you can find any other tool or any other service that is faster them Databricks. I don't see that right now. It's your best option.
Overall, I'd rate the solution eight out of ten. The reason I'm not giving it full marks is that it's expensive compared to open source alternatives. Also, the configuration is difficult, so sometimes you need to spend a couple of hours to get it right.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.

Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: June 2025
Popular Comparisons
Teradata
KNIME
Microsoft Azure Machine Learning Studio
Amazon SageMaker
Alteryx
Dataiku
Confluent
Apache Kafka
IBM SPSS Statistics
Altair RapidMiner
Azure Stream Analytics
Vertica
Amazon Kinesis
Apache Flink
Dremio
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?