Our primary use case of this product is for our customers who are running large systems and looking for an API -- a quick, easy integration with their own system. We use Databricks to create a secure API interface. I'm vice president of data science and we are customers of Databricks.
Vice President at a tech services company with 51-200 employees
Very easy to use and requires minimal coding and customizations
Pros and Cons
- "Easy to use and requires minimal coding and customizations."
- "Databricks is quite easy to use and requires less coding and customizations than a solution like AWS SageMaker, enabling more people to efficiently build and host their ML code while leveraging the already integrated MLflow to track and monitor all our different experiments."
- "Doesn't provide a lot of credits or trial options."
- "I present a lot of projects in various forums and seminars and there aren't a lot of credits and trial options with Databricks. Even if we want to explore, we're not able to and that's a challenge."
What is our primary use case?
What is most valuable?
Databricks is quite easy to use and requires less coding and customizations than a solution like AWS SageMaker which I'd previously used on a lot of projects. Databricks enables more people to efficiently build and host their ML code. Another great aspect is that MLflow is already integrated with Databricks which makes a big difference. It enables us to track and monitor all our different experiments. We have mostly used the MLflow part and generic notebooks with the ML building machine learning model, as well as using Pytorch for some of our medical imaging. We were able to quickly deploy both these features without requiring anything extra.
What needs improvement?
I'm struggling a little because I wanted to do some POC solutions. I present a lot of projects in various forums and seminars and there aren't a lot of credits and trial options with Databricks. Even if we want to explore, we're not able to and that's a challenge. The solution is quite expensive.
For how long have I used the solution?
I've been using this solution for a year.
Buyer's Guide
Databricks
May 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.
What do I think about the stability of the solution?
It's currently stable although we have not yet tested it with a huge volume of data. We'll focus on the performance and model serving capability in the near future. We're still carrying out performance testing, developing the models and figuring out the infrastructure.
What do I think about the scalability of the solution?
Scalability is quite good because we just used 128 GB of resources. It's quite easy to scale.
How was the initial setup?
It was relatively simple, we didn't face any challenges. Deployment takes around two days.
Which other solutions did I evaluate?
We did a PSU in Azure ML Studio which is quite a good solution, easy to deploy and use. It's almost a no-code platform. We've also found Azure ML Studio to be quite cost-effective.
What other advice do I have?
I would recommend trying Databricks because it's cloud agnostic. A lot of customers currently use Azure but want to build something on their own down the track. Databricks makes that easy with its integration with other cloud customers. If somebody wants to build something on their infrastructure or their own virtual cloud, this is a good platform.
I rate the solution eight out of 10 because of the issue I'm having with a lack of trial options.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Associate Principal - Data Engineering at a tech services company with 10,001+ employees
It's a unified platform that lets you do streaming and batch processing in the same place
Pros and Cons
- "I like that Databricks is a unified platform that lets you do streaming and batch processing in the same place. You can do analytics, too. They have added something called Databricks SQL Analytics, allowing users to connect to the data lake to perform analytics. Databricks also will enable you to share your data securely. It integrates with your reporting system as well."
- "Databricks is a unified platform that provides features like streaming and batch processing so all the data scientists, analysts, and engineers can collaborate on a single platform and it has all the features you need, so you don't need to go for any other tool."
- "Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity."
What is our primary use case?
We build data solutions for the banking industry. Previously, we worked with AWS, but now we are on Azure. My role is to assess the current legacy applications and provide cloud alternatives based on the customers' requirements and expectations.
Databricks is a unified platform that provides features like streaming and batch processing. All the data scientists, analysts, and engineers can collaborate on a single platform. It has all the features, you need, so you don't need to go for any other tool.
What is most valuable?
I like that Databricks is a unified platform that lets you do streaming and batch processing in the same place. You can do analytics, too. They have added something called Databricks SQL Analytics, allowing users to connect to the data lake to perform analytics. Databricks also will enable you to share your data securely. It integrates with your reporting system as well.
The Unity Catalog provides you with the data links and material capabilities. These are some of the unique features that fulfill all the requirements of the banking domain.
What needs improvement?
Every tool has room for improvement. Normally what happens, a solution will claim it can do ETL and everything else, but you encounter some limitations when you actually start. Then you keep on interacting with the vendor, and they continue to upgrade it. For example, we haven't fully implemented Databricks Unity Catalog, a newly introduced feature. We need to check how it works and then accordingly, there can be improvements in that also.
Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity.
For how long have I used the solution?
I have been using Databricks for a year.
What do I think about the scalability of the solution?
Databricks relies on scalability and performance. Every cloud vendor prioritizes scalability, high availability, performance, and security. These are the most important reasons to move to the cloud.
How was the initial setup?
Deploying Databricks on the cloud is straightforward. It's not like an on-premise solution, where you must create a cluster and all those other prerequisites for big data.
I don't think it's challenging to maintain, but you need an expert programmer because Databricks isn't GUI-based. With GUI-based tools, building ETLs is drag-and-drop. Databricks entirely relies on coding, so you need skilled programmers to building your code, ETLs, etc.
What's my experience with pricing, setup cost, and licensing?
The price of Databricks is based on the computing volume. You also need to pay storage costs for the cloud where you're hosting Databricks, whether it is AWS, Azure, or Google.
What other advice do I have?
I rate Databricks nine out of 10. Databricks is one of the best tools on the market.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Implementer
Buyer's Guide
Databricks
May 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.
STI Data Leader at grupo gtd
Easy to use with a free community version and helpful documentation
Pros and Cons
- "The solution offers a free community version."
- "I like the simplicity and ease of use."
- "We'd like a more visual dashboard for analysis It needs better UI."
What is most valuable?
I like the simplicity and ease of use.
You can deploy the solution to many clouds easily.
The initial setup is straightforward.
The solution offers a free community version.
What needs improvement?
The auto models can be improved.
We can create auto models like Microsoft Azure Machine Learning. In Azure Machine Learning, they have these features, for example, for auto models or code, or by code. They need this in Databricks.
We need more connectors between on-premises and the cloud.
We'd like a more visual dashboard for analysis It needs better UI.
For how long have I used the solution?
I've used the solution for one and a half months.
What do I think about the stability of the solution?
The solution is very stable. There are no bugs or glitches. It doesn't crash or freeze.
What do I think about the scalability of the solution?
Scalability is no problem. At the beginning, we created a cluster, for example, and if we need more performance in the future, for example, or to accelerate the training, we can change the cluster. It's quite straightforward.
We have five people using the solution.
In one or two years, we'd like to promote the solution to clients and increase usage. Right now, the way it is used is limited. I know that some banks and aeronautics companies use it.
How are customer service and support?
In terms of technical support, for now, we use the community.
Which solution did I use previously and why did I switch?
We are also aware of KNIME, Azure Machine Learning, and Anaconda. In Anaconda, we use many frameworks, for example.
We started with other platforms, like Azure Machine Learning due to the fact that, with AutoML, it's easy to use. However, now that we have more skills, we need other tools or platforms like Databricks. It's a good platform to deploy and develop machine learning in employees.
How was the initial setup?
The implementation is quite easy. It's not complex or difficult. The first time, I did it using a tutorial which was quite helpful. Later, I took a course. I know it quite well.
The deployment only takes a few days.
You only need to deploy or maintain the solution.
What about the implementation team?
We did not need any outside assistance in terms of setting up the solution.
What's my experience with pricing, setup cost, and licensing?
For us, this product is free. We use the community version.
I am interested in using the enterprise version, however. Whether we use it or not depends on the projects and customers we get.
What other advice do I have?
I work with a solution provider. We are a Databrick customer.
We are not partners of Databricks. Only we are partnered with Microsoft Azure and Amazon AWS.
We are using the latest version of the solution. However, I do not know the exact version number.
I still need time with the solution before providing advice to others. I need to prepare the capacity internally. So far, it's been great.
I'd rate the solution eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Head of Referential and Big Data at a financial services firm with 5,001-10,000 employees
A highly scalable unified data platform that provides data access to any type of user
Pros and Cons
- "I like cloud scalability and data access for any type of user."
- "Data is open to everyone; they can access it through many channels, including notebooks or SQL."
- "It would be better if it were faster. It can be slow, and it can be super fast for big data. But for small data, sometimes there is a sub-second response, which can be considered slow. In the next release, I would like to have automatic creation of APIs because they don't have it at the moment, and I spend a lot of time building them."
- "It would be better if it were faster. It can be slow, and it can be super fast for big data."
What is our primary use case?
We use Databricks to define tool data and have many use cases to analyze and distribute the data.
How has it helped my organization?
Data is open to everyone; they can access it through many channels, including notebooks or SQL. That on its own democratizes the data.
What is most valuable?
I like cloud scalability and data access for any type of user.
What needs improvement?
It would be better if it were faster. It can be slow, and it can be super fast for big data. But for small data, sometimes there is a sub-second response, which can be considered slow.
In the next release, I would like to have automatic creation of APIs because they don't have it at the moment, and I spend a lot of time building them.
For how long have I used the solution?
I have been using Databricks for roughly one and a half years.
What do I think about the stability of the solution?
Stability is excellent.
What do I think about the scalability of the solution?
Databricks is scalable. You can use the power of the cloud to scale your cluster size, either CPU or memory. The data doesn't work like a standard database, so you don't have it based on files, and you don't copy the data. It's super scalable. It's only the computing that you have to scale with the data.
We probably have 40 users with roles like developers, business analysts, and data scientists. We have big plans to increase the usage and have more departments using it.
How are customer service and support?
Technical support has helped us.
On a scale from one to ten, I would give technical support a five.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We used Cloudera before switching to Databricks.
How was the initial setup?
The initial setup was fairly okay. It takes about two minutes to deploy this solution. It's all code, so we click a button, and then it's done.
On a scale from one to five, I would give the initial setup a four.
What about the implementation team?
We set up and deployed this solution.
What was our ROI?
On a scale from one to five, I would give our ROI a three.
What's my experience with pricing, setup cost, and licensing?
We only pay for the Azure compute behind the solution. If you want to compute, you have to have a database layer and Azure below.
On a scale from one to five, I would give their pricing a two.
Which other solutions did I evaluate?
We looked at other options such as Snowflake and Cloudera on the cloud,
What other advice do I have?
I would tell potential users that they need proper cloud engineers and a
cloud infrastructure team to use this solution.
On a scale from one to ten, I would give Databricks a nine.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Engineering Manager at a pharma/biotech company with 10,001+ employees
A great and easy-to-use platform for data engineers and data scientists who rely on a large dataset to do advanced analytics reporting
Pros and Cons
- "The most valuable feature is the Spark cluster which is very fast for heavy loads, big data processing and Pi Spark."
- "It would be great if Databricks could integrate all the cloud platforms."
What is our primary use case?
We use Databricks for data science work in projects that create data pipelines, pre-processing, data wrangling, big data cluster management and ML, machine learning and deep learning tasks.
How has it helped my organization?
Databricks collaborates very well with the Azure platform, Dataiku, and enterprise AI tool. Databricks is a new connection to pull the data or connect to the Spark cluster. It is helpful for us to instance it or distribute the load through the Spark cluster, and it is very user-friendly.
What is most valuable?
The most valuable feature is the Spark cluster which is very fast for heavy loads, big data processing and Pi Spark.
What needs improvement?
Databricks as a solution is integrated with Azure, but Google Cloud has some restrictions. I'm not sure about AWS Cloud, but it would be great if Databricks could integrate all the cloud platforms. Regarding additional features, we would like to see them mostly on the data engineering side, where we have a Spark cluster and some inbuilt ML. In addition, pre-processing steps will be useful.
For how long have I used the solution?
We have been using this solution for two years and are using the latest update.
What do I think about the stability of the solution?
It is a stable solution as long as the Microsoft Azure Platform is stable too.
What do I think about the scalability of the solution?
It is a scalable solution, both vertically and horizontally, which is good. My organization is big, and we have a lot of users. In my department, we have about 15 people using Databricks.
How are customer service and support?
We have not escalated any issues to technical support, but we initially struggled with configuration and the settings of Hive metastore, but we resolved it. I rate the technical support a nine out of ten.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
We were using the looped EMR elastic MapReduce from AWS before using Databricks. We switched to Databricks because the whole platform changed from AWS to Azure platform, and Databricks comes as a package.
How was the initial setup?
The initial setup was easy to complete and not complex. It may initially be challenging for a new user, but it improves over time. The CICD pipeline works well with the Microsoft Azure platform because the continuous integration, development and deployment come with the Git integration. It makes it easier for Databricks and the CICD. The deployment should be improved from the perspective of auto ML functionality, so it doesn't have intensive automation learning capability.
We don't use Databricks directly because we work on a data science project. It requires an auto ML and inbuilt machine learning capability. We found capabilities like the large language model using NLP and other deep learning models that are not that intensive. It is meant for data engineering purposes rather than data science purposes. It'll be great if Databricks could be intensive for data science.
We used a third-party, Dataiku platform for the deployment, where we connected to Databricks and completed the ML ops. We required about three people for deployment, and it is easy to maintain the solution.
What was our ROI?
We have seen an ROI but cannot differentiate because it also comes with the Azure platform.
What's my experience with pricing, setup cost, and licensing?
I do not have details about the pricing.
What other advice do I have?
I rate this solution a nine out of ten. Regarding advice, Databricks is a very good platform, popular and easy to use daily for data engineers and data scientists who rely on a large dataset to do advanced analytics reporting. It's a very good tool.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Co-founder/Senior Data Scientist at Hence
Responsive support, integrates and scales well
Pros and Cons
- "The most valuable feature of Databricks is the integration of the data warehouse and data lake, and the development of the lake house. Additionally, it integrates well with Spark for processing data in production."
- "The solution could be improved by adding a feature that would make it more user-friendly for our team. The feature is simple, but it would be useful. Currently, our team is more familiar with the language R, but Databricks requires the use of Jupyter Notebooks which primarily supports Python. We have tried using RStudio, but it is not a fully integrated solution. To fully utilize Databricks, we have to use the Jupyter interface. One feature that would make it easier for our team to adopt the Jupyter interface would be the ability to select a specific variable or line of code and execute it within a cell. This feature is available in other Jupyter Notebooks outside of Databricks and in our own IDE, but it is not currently available within Databricks. If this feature were added, it would make the transition to using Databricks much smoother for our team."
What is our primary use case?
We are using Databricks for machine learning workloads specifically.
Databricks aligns well with our skillset and overall approach. We sought out their solution specifically for a big data application we are currently working on, as we needed a platform capable of handling large amounts of data and building models. Additionally, the fact that they use open-source software and can integrate data warehouse and data lake systems was particularly appealing, as we have encountered such issues in the past. We determined that Databricks would be an effective solution for our needs.
What is most valuable?
The most valuable feature of Databricks is the integration of the data warehouse and data lake, and the development of the lake house. Additionally, it integrates well with Spark for processing data in production.
What needs improvement?
The solution could be improved by adding a feature that would make it more user-friendly for our team. The feature is simple, but it would be useful. Currently, our team is more familiar with the language R, but Databricks requires the use of Jupyter Notebooks which primarily supports Python. We have tried using RStudio, but it is not a fully integrated solution. To fully utilize Databricks, we have to use the Jupyter interface. One feature that would make it easier for our team to adopt the Jupyter interface would be the ability to select a specific variable or line of code and execute it within a cell. This feature is available in other Jupyter Notebooks outside of Databricks and in our own IDE, but it is not currently available within Databricks. If this feature were added, it would make the transition to using Databricks much smoother for our team.
The most important feature other than the Jupyter interface would be to have the RStudio interface inside Databricks. This would be perfect.
For how long have I used the solution?
We have been using Databricks for approximately one year.
What do I think about the stability of the solution?
The stability of Databricks is good.
I rate the stability of Databricks a nine out of ten.
What do I think about the scalability of the solution?
Databricks is scalable.
I rate the scalability of Databricks a nine out of ten.
How are customer service and support?
I have been receiving responsive answers from Databricks's support. I have been pleased with the support.
I rate the support from Databricks a ten out of ten.
How would you rate customer service and support?
Positive
How was the initial setup?
The initial setup of Databricks is simple. I did not experience any challenges. The time it takes for the deployment is approximately four hours.
I rate the initial setup of Databricks.
What about the implementation team?
We did the deployment of the solution in-house. There were three people involved in the deployment. A data engineer, data analyst, and machine learning engineer.
What's my experience with pricing, setup cost, and licensing?
We have only incurred the cost of our AWS cloud services. This is because during this period, Databricks provided us with an extended evaluation period, and we have not spent much money yet. We are just starting to incur costs this month, I will know more later on the full cost perspective.
We only pay standard fees for the solution.
What other advice do I have?
We use a data engineer, data analyst, and machine learning engineer for the maintenance of the solution.
I rate Databricks a nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Lead Analytics at a manufacturing company with 10,001+ employees
Useful machine learning and easy to scale
Pros and Cons
- "In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance."
- "The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes."
What is our primary use case?
Our team is currently utilizing machine learning for various applications, and a few members are also exploring Databrick's use for ML operations.
What is most valuable?
In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance.
For how long have I used the solution?
I have been using Databricks for approximately six months
What do I think about the stability of the solution?
The stability of the clusters or the instances of Databricks would be better if it was a much more stable environment. We've had issues with crashes.
What do I think about the scalability of the solution?
The scalability of Databricks is good as long as you have a data lake, and it's easy to scale.
We have approximately 50 users using this solution in my company.
How are customer service and support?
We have a different team who handles the support. I do not have contact with Databricks support.
Which solution did I use previously and why did I switch?
I have not used a similar solution to Databricks.
What was our ROI?
I have seen an ROI using Databricks.
What's my experience with pricing, setup cost, and licensing?
I rate the price of Databricks as eight out of ten.
What other advice do I have?
Having a good understanding of physical security in relation to cybersecurity in an OT (Operational Technology) environment would be beneficial, and utilizing an existing data lake prior to implementing a Databricks initiative would greatly aid in its success.
I rate Databricks an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Data Engineer Analyst at Metyis
Highly scalable, easy to use, and performs well
Pros and Cons
- "The most valuable feature of Databricks is the notebook, data factory, and ease of use."
- "When I used the support, I had communication problems because of the language barrier with the agent. The accent was difficult to understand."
What is our primary use case?
I am using Databricks in my company.
What is most valuable?
The most valuable feature of Databricks is the notebook, data factory, and ease of use.
For how long have I used the solution?
I have been using Databricks for approximately nine months.
What do I think about the stability of the solution?
The performance and stability of Databricks are good. It is quick and I have not had problems.
What do I think about the scalability of the solution?
Databricks is highly scalable.
We have 200 people using the solution in my organization.
How are customer service and support?
When I used the support, I had communication problems because of the language barrier with the agent. The accent was difficult to understand.
Which solution did I use previously and why did I switch?
I have not worked with another solution prior to Databricks.
What's my experience with pricing, setup cost, and licensing?
The price of Databricks is reasonable compared to other solutions.
What other advice do I have?
I rate Databricks an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: May 2026
Product Categories
Cloud Data Warehouse Data Science Platforms Data Management Platforms (DMP) Streaming AnalyticsPopular Comparisons
Teradata
Qlik Talend Cloud
Palantir Foundry
Azure Data Factory
SAP Business Data Cloud
Snowflake
KNIME Business Hub
Coralogix
IBM SPSS Statistics
Alteryx
Dataiku
Denodo
Amazon SageMaker
Microsoft Azure Machine Learning Studio
Confluent
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?



















