It's mainly used for data science, data analytics, visualization, and industrial analytics.
Data Architect at Three Ireland (Hutchison) - Infrastructure
Processes large data for data science and data analytics purposes
Pros and Cons
- "Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours."
- "There is room for improvement in visualization."
What is our primary use case?
What is most valuable?
Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours.
So that's why it's quite convenient to use for data science, for training machine learning models. By using more computing power, you can make it even faster.
What needs improvement?
There is room for improvement in visualization.
For how long have I used the solution?
I used it for two years. I worked with the latest update.
Buyer's Guide
Databricks
March 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
885,286 professionals have used our research since 2012.
What do I think about the stability of the solution?
I would rate the stability a nine out of ten. I didn't face performance drops.
What do I think about the scalability of the solution?
I would rate the scalability an eight out of ten.
How are customer service and support?
Databrick's support is great. If we need any support, they are very quick with it. And they genuinely want you to use Databricks. So, whatever we ask them, they come up with multiple solutions to problem statements. That's really good.
Overall, the customer service and support are very good.
Which solution did I use previously and why did I switch?
I personally prefer using Databricks. However, we also considered using Snowflake, but the pricing was different. It's price per query.
So, as per your storage, a data scientist or a data analytics team needs to query again and again, which does not suit a data-heavy organization.
What was our ROI?
It's a good return on investment for Databricks from a delivery perspective. Delivered multiple dashboards. So, it's quite a good return on investment. And being a small organization, everyone can use Databricks, and cost-wise, it's also good for small organizations.
Which other solutions did I evaluate?
If the company is a startup, Databricks might be suitable. If a big company needs a lot of storage, Teradata might be best for them. It depends on the situation.
What other advice do I have?
Overall, I would rate the solution a eight out of ten. I would definitely recommend this solution for small organizations.
Which deployment model are you using for this solution?
Private Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Principal Consultant/Manager at Tenzing
Processes tremendous data easily
Pros and Cons
- "The processing capacity is tremendous in the database."
- "There is room for improvement in the documentation of processes and how it works."
What is our primary use case?
Our primary use case is in our project; we are dealing with Duo Special Data, where we need a lot of computing resources. Here, the traditional warehouse cannot handle the amount of data we are using, and this is where Databricks comes into the picture.
What is most valuable?
The processing capacity is tremendous in the database. We are dealing with Azure as storage, so we have not faced any challenges. And also the connectors to different data sources. Moreover, it is not a language-dependent tool. Therefore, development also takes place faster. It is one of the best features of Databricks.
What needs improvement?
There is room for improvement in the documentation of processes and how it works. I was trying to get one of the certifications, so I saw an area of improvement there.
For how long have I used the solution?
I have been using Databricks for eight to nine months.
What do I think about the stability of the solution?
It is a stable product for us. We didn't see any challenges.
What do I think about the scalability of the solution?
There are around 30 to 35 users in our organization.
How was the initial setup?
The initial setup was easy because the third-party team made the clusters for us.
What about the implementation team?
A third-party team enabled the cluster to make the setup easy for us.
What other advice do I have?
I would advise using it based on the use case because it easily handles big data. It is your go-to tool if you are dealing with massive data.
Overall, I would rate the solution a nine out of ten. The tool performs well in various use cases, availability of documentation online, and compatibility with big data systems like GCP, Azure, or AWS.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Databricks
March 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
885,286 professionals have used our research since 2012.
Engineering Leader at Walmart
Fantastic features such as interactive clusters that perform at top speed
Pros and Cons
- "The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions."
- "The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions."
- "CI/CD needs additional leverage and support."
- "CI/CD needs additional leverage and support."
What is our primary use case?
Our company uses the solution's Spark module for big data analytics as a processing engine.
We do not use the module as a streaming engine. The historic perception is that Spark is for batches, machine learning, analytics, and big data processing but not for streaming and that is exactly how we use it.
What is most valuable?
The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions.
The ATC monitoring experience and the maturity of the APIs are very good.
What needs improvement?
CI/CD needs additional leverage and support. Community forums are helpful for gaining knowledge but the solution should provide specific documentation.
Streaming services such as Flink should be amplified and better supported.
There are not many connectors to MapReduce.
For how long have I used the solution?
I have been using the solution for seven years.
What do I think about the stability of the solution?
The solution is mature and stable compared to other products.
What do I think about the scalability of the solution?
The solution is scalable with no issues from a computer perspective.
How are customer service and support?
I received support for initial challenges and it was very good. The support team was very professional and provided the answers I needed.
I rate support an eight out of ten.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
I previously used Cloud-Bricks.
How was the initial setup?
The initial setup is easy for me because I access the solution on a web browser.
What about the implementation team?
Unilever had a specific team for implementing and managing the solution.
Walmart had a team of ten engineers for implementation and a couple of engineers for management.
What was our ROI?
We receive an ROI for our batch constructs.
What's my experience with pricing, setup cost, and licensing?
The solution is a good value for batch processing and huge workloads.
The price might be high for use cases that are for streaming or strictly data science.
Which other solutions did I evaluate?
I have evaluated multiple options including Cloud-Brick and Dataproc for price versus performance, technical support, and CI/CD approach.
I started as a consumer and used the solution for on-premises deployment with Unilever from a data science perspective. At that time, the solution was in its beta stage but viewed as good, far ahead of its competition, and expensive. The key comparison used to be HDInsight or Adobe Cluster for cloud data and the solution was thought of as a cluster service rather than for unified analytics.
I moved along on my journey to Walmart where I was building their platform and compared it to the solution from a cloud perspective and a cluster service with notebooks. Consumers at the time were using Project Lightspeed and ATC for streaming. Spark was used as a micro-batching engine for machine learning, analytics, and big data processing. At some point, the solution became preferred and more than 100 staff members were leveraging its use.
I found that the solution had interesting features that I liked such as its notebook, interactive clusters with fast speed, and the ATC monitoring experience. I did not like the solution from a CI/CD perspective because it had a rigidity in terms of the approval process.
The solution grew from that original space and, by the time I had moved to Microsoft, was partnered with Microsoft Azure. An integration with ADF and other products solved the CI/CD issues for me.
I am now leading streaming platforms for Walmart so my interest is in the solution's streaming capabilities. I began building a streaming platform using Spark PM in Microsoft so the solution was its key competitor. Then the solution launched a vectorized machine on Photon for the Spark engine. Its performance was a key factor in moving from Microsoft because it performed much better than other products including opensource Spark, Microsoft Synapse Spark, and Dataproc.
What other advice do I have?
It is important to do POCs and run tests to control the meter that also controls the price. The meter can go really high from a computing perspective if POCs and settings are not streamlined.
I rate the solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Tech Lead Consultant | Manager Data Engineering at Ekimetrics
Simple to set up, fast to deploy, and with regular product updates
Pros and Cons
- "We can scale the product."
- "I would love an integration in my desktop IDE. For now, I have to code on their webpage."
What is our primary use case?
We're using it to provide a unified development experience for all our data experts, including all data engineers, data scientists, and IT engineers. With the Databrick Platform we allows teams to collaborate easily towards building Data Science models for our clients. The development environment allows us to ingest data from various data sources, scale the data processing and expose them either trough API or through enriched datasets made available to web app or dashboard leveraging the serverless capacities of SQL warehouse endpoints.
How has it helped my organization?
Databricks allowed us to offer an homogeneous development environment accross different accounts and domains, and also across different clouds. The upskilling of our employees is far more linear and faster, while removing the complexity of infrastructure management. This lead to an increased collaboration between domain thanks to a better onboarding experience, more performant pipelines and a smoother industrialization process. Overall client satisfaction has increased and the time to first insight has been reduced.
What is most valuable?
The shared experience of collaborative notebooks is probably the most useful aspect since, as an expert, it allows me to help my juniors debug their books and their code live. I can do some live coding with them or help them find the errors very efficiently.
It has become very simple to set up thanks to its official Terraform provider and the open-source modules made available on GitHub.
I love Databricks due to the fact that we can now deploy it in 15 minutes and it's ready to use. That's very nice since we often help our clients in deploying their first Data Platform with Databricks.
The solution is stable, with LTS Runtimes that have proven to remain stable over the years.
What needs improvement?
I would love to be able to declare my workflows as-code, in an Airflow-like way. This would help creating more robust ingestion python modules we can test, share and update within the company.
We would also love to have access to cluster metrics in a programmatic way, so that we can analyse hardware logs and identify potential bottlenecks to optimize.
Lastly, the latest VS Code extension has proven to be useful and appreciated by the community, as it allows to develop locally and benefits from traditional software best-practices tools like pre-commits for example.
For how long have I used the solution?
I've been using the solution for more than four years now, in the context of PoC to full end-to-end Data Platform deployment.
What do I think about the stability of the solution?
The product is very stable. I've been using it for three years now, and I have projects that have been running for three years without any big issues.
What do I think about the scalability of the solution?
It's very scalable. I have a project that started as a proof of concept on connected cars. We had 100 cars to track at first - just for the proof of concept. Now we have millions of cars that are being tracked. It scales very well. We have terabytes of data every day and it doesn't even flinch.
How are customer service and support?
I've had very good experiences with technical support where they answer me in a couple of hours. Sometimes it takes a bit longer. It's usually a matter of days, so it's very good overall.
Even if it took a bit of time, I got my answer. They never left me without an answer or a solution.
How would you rate customer service and support?
Positive
How was the initial setup?
The implementation is very simple to set up. That's why we choose it over many other tools. Its Terraform provider is our way-to-go for the initial setup has we are reusing templates to get a functional workspace in minutes.
Usually, we have two to five data engineers handling the maintenance and running of our solutions.
What about the implementation team?
We deploy it in-house.
What's my experience with pricing, setup cost, and licensing?
The solution is a bit expensive. That said, it's worth it. I see it as an Apple product. For example, the iPhone is very expensive, yet you get what you pay for.
The cost depends on the size of your data. If you have lots of data, it's going to be more expensive since your paper compute units will be more. My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month.
Which other solutions did I evaluate?
We looked into Azure Synapse as an alternative, as well as Azure ML and Vertex on GCP. Vertex AI would be the main alternative.
Some people consider Snowflake a competitor; however, we can't deploy Snowflake ourselves just like we deploy Databricks ourselves. We use that as an advantage when we sell Databricks to our clients. We say, "If you go with us, we are going to deploy Databricks in your environment in 15 minutes," and they really like it.
Lately Fabric was released and can offer quite a similar product as Databricks. Yet, the user experience, the CI/CD capabilities and the frequent release cycle of Databricks remains a strong advantage.
What other advice do I have?
We're a partner.
We use the solution on various clouds. Mostly it is Aure. However, we also have Google and AWS as well.
One of the big advantages is that it works across domains. I'm responsible for a data engineering team. However, I work on the same platform with data scientists, and I'm very close to my IT team, who is in charge of the data access and data access control, and they can manage all the accesses from one point to all the data assets. It's very useful for me as a data engineer. I'm sure that my IT director would say it's very useful for him too. They managed to build a solution that can very easily cross responsibilities. It unifies all the challenges in one place and solves them all mostly.
I'd rate the solution nine out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Financial Analyst 4 (Supply Chain & Financial Analytics) at Juniper Networks
Easy to collaborate with other team members who are working on it
Pros and Cons
- "Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It is production-ready code, and scheduling the jobs is easy."
- "Databricks would have more collaborative features than it has. It should have some more customization for the jobs."
What is our primary use case?
We use the solution for reliability engineering, where we apply ML and Deep Learning models to identify the fear failure patterns across different geographies and products.
What is most valuable?
Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It is production-ready code, and scheduling the jobs is easy.
What needs improvement?
Databricks would have more collaborative features than it has. It should have some more customization for the jobs. Also, it has an average dashboarding tool. They can bring advanced features so we don't depend on other BI tools to build a dashboard. We are using Tableau to create a dashboard. If Databricks has more advanced features, we can entirely use Databricks.
For how long have I used the solution?
I have been using Databricks for one year.
What do I think about the stability of the solution?
The product is stable. It has been giving consistent outputs without any major issues.
What do I think about the scalability of the solution?
The solution is hosted on the cloud. It supports high scalability features.
10-20 users are using this solution.
How are customer service and support?
There was a training session from Databricks where they explained how to use it. We never had to contact them because they had already given us proper training on the platform.
Which solution did I use previously and why did I switch?
I have used Alteryx before. We switched to Databricks because it can compute and turn your code into production-ready code in very few seconds. Also, the stability is relatively high.
How was the initial setup?
The initial setup is easy.
What about the implementation team?
We have a dedicated team for the deployment.
What other advice do I have?
Delta Lake is a free system. We practically work on the data that we get from Snowflake. Databricks are returned to the model outputs that are returned to Delta Lake. It is easy for us to collaborate using Delta Lake, and the computation speed is also quite high for Delta Lake.
The learning curve for Databricks is not very steep. It's pretty easy, and you will find a lot of materials online. So, if you are comfortable coding in Python, it's very straightforward. There is nothing to worry about when using Databricks.
Overall, I rate the solution a ten out of ten.
Which deployment model are you using for this solution?
Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
A scalable and cost-effective solution that has excellent translation features and can be used for data analytics
Pros and Cons
- "It is a cost-effective solution."
- "The product should provide more advanced features in future releases."
What is our primary use case?
We use the solution for data analytics of industrial data.
What is most valuable?
We extensively use the product’s notebooks, jobs, and triggers. We can create activities. Wherever translation is required, we use Databricks. The product fulfills our customer requirements. It is a cost-effective solution.
What needs improvement?
The product should provide more advanced features in future releases.
For how long have I used the solution?
I have been using the solution for six months.
What do I think about the stability of the solution?
Our data was not too huge. It worked well. It is easily adaptable.
What do I think about the scalability of the solution?
The tool is scalable. We can make it available for a larger audience.
How was the initial setup?
The initial setup is not that difficult. I rate the ease of setup a seven out of ten. The solution is cloud-based. We use native services like Data Factory for orchestration. Sometimes, the customers require us to use Amazon as the cloud provider instead of Azure.
What's my experience with pricing, setup cost, and licensing?
The pricing is average.
What other advice do I have?
There are many services which are coming up. They are still in the preview stage. Overall, I rate the product an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Computer Scientist at Adobe
Pumps up performance and the processing power; comes with helpful Lakehouse and SQL environments
Pros and Cons
- "When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
- "I believe that this product could be improved by becoming more user-friendly."
What is our primary use case?
Our primary use case is for data analytics. Essentially, we use it for the financial reporting for Adobe.
How has it helped my organization?
The way Databricks has improved my organization is definitely through giving us improved performance and the processing power. We are usually never able to achieve it using traditional data warehouses. When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks.
What is most valuable?
The features I found most helpful with Databricks are the Lakehouse and SQL environments.
What needs improvement?
I believe that this product could be improved by becoming more user-friendly.
In the next release, I would like to see more flexibility in the dashboard. It has plenty of features but it can be enhanced so that it matches with other visualization tools, like Power BI and Tableau. Also, the integrations with other tools could be better.
For how long have I used the solution?
I have been using Databricks for the last three years.
What do I think about the stability of the solution?
I would rate the stability of Databricks an eight, on a scale from one to 10, with one being the worst and 10 being the best.
What do I think about the scalability of the solution?
I would rate the scalability of this solution a nine, on a scale from one to 10, with one being the worst and 10 being the best. I would say there are around 2,000 to 3,000 users of this solution in our organization.
How are customer service and support?
I've been in contact with the Databricks support team and received timely support from them. I would rate their support an eight, on a scale from one to 10, with one being the worst and 10 being the best.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to Databricks, we initially used Hadoop. Afterwards, we used HANA, SAP HANA, and the Microsoft SQL Server.
How was the initial setup?
The initial setup was relatively straightforward. I would rate it nine, on a scale from one to 10, with one being the easiest and 10 being the hardest.
There is no need to worry about the deployment as it can be done quickly. It is relatively automated. We used Terraform for auto-deployment, which happens in Azure. With Terraform, there are two options. As option one, you can deploy manually by creating services. For option two, you use Terraform and automate. Terraform is like infrastructure as a code where you can code the deployment part using it.
There were two or three persons involved in the deployment of this solution.
What other advice do I have?
The new version of the Databricks solution requires code maintenance. This is done by the platform team.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Principal at a computer software company with 5,001-10,000 employees
Has advanced modeling and machine-learning features; highly scalable, with no stability issues
Pros and Cons
- "What I like about Databricks is that it's one of the most popular platforms that give access to folks who are trying not just to do exploratory work on the data but also go ahead and build advanced modeling and machine learning on top of that."
- "I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement."
What is our primary use case?
I've worked with Databricks primarily in the pharmaceuticals and life sciences space, which means a lot of work on patient-level data and the predictive analytics around that.
Another use case for Databricks is in the manufacturing industry. I'm a consultant, so the use cases for the product vary, but my primary use case for it is in the pharma space.
What is most valuable?
From a data science and applied analytics perspective, what I like about Databricks is that it's probably one of the most popular platforms that give access to folks who are trying not just to do exploratory work on the data but also go ahead and build advanced modeling and machine learning on top of that, and then go ahead and make that available for dissemination of insights. For example, you can save all data and build out endpoints, so business analysts and users can access that data through a dashboard.
During the process, I also like that Databricks allows you to do portion control to keep track of your operations on the data and maintain that lineage to create reproducible results.
The most significant Databricks advantage is that you can do everything within the platform. You don't need to exit the platform because it's a one-stop shop that can help you do all processes.
The solution is top-notch from a data science, applied ML, or advanced analytics perspective.
What needs improvement?
I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement. Still, I am generally unaware of any super-critical issues.
For how long have I used the solution?
My experience with Databricks is two and a half years.
What do I think about the stability of the solution?
Databricks stability is an eight out of ten because I never had issues with its stability.
What do I think about the scalability of the solution?
Databricks has high scalability. Most of my work on the solution has been in the pharma space, which has massive data sets, so it's a nine out of ten, scalability-wise.
How are customer service and support?
I've never dealt with the Databricks technical support team.
How was the initial setup?
I don't have experience setting up Databricks because that's generally taken care of by the IT, data, or software engineering team before the data science team comes in and starts leveraging the platform. I have yet to experience setting up the Databricks environment personally. However, I have had experience setting up clusters, which was pretty straightforward. Still, in the overall environment of an enterprise-wide system, I have yet to gain experience setting Databricks up.
What's my experience with pricing, setup cost, and licensing?
The cost for Databricks depends on the use case. I work on it as a consultant, so I'm using the client's Databricks, so it depends on how big the client is. If it's a global organization, that cost varies versus a smaller organization that has just adopted the platform and is trying to onboard a small team of five people. It depends.
What other advice do I have?
I'm a data scientist, so I frequently use Databricks and Domino Data Science Platform.
I'm a consultant, so every client has a different version or a different runtime in Databricks, so the versions used would vary per client.
The deployment for the solution is on the cloud, predominantly on AWS or Azure.
My clients adopted Databricks as the platform of choice, and with different use cases and more teams coming on board, the usage of Databricks will increase. I don't see that going down. It can only go up.
My advice to anyone looking into implementing Databricks is that it should be one of your top choices, especially if you're looking to focus on data processing, standard ETL operations, advanced analytics, or the ML type of work.
I'd rate the solution as nine out of ten. It checks almost all the boxes that modern applications need to have.
My organization is an active partner and implementer of Databricks, but it doesn't resell the solution.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: March 2026
Product Categories
Cloud Data Warehouse Data Science Platforms Data Management Platforms (DMP) Streaming AnalyticsPopular Comparisons
Teradata
Azure Data Factory
Palantir Foundry
Qlik Talend Cloud
Snowflake
KNIME Business Hub
IBM SPSS Statistics
Dataiku
Alteryx
Amazon SageMaker
Microsoft Azure Machine Learning Studio
Coralogix
Confluent
Altair RapidMiner
OpenText Analytics Database (Vertica)
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?






















