We use this solution for advanced civilization power.
Vice President - Data Engineering and Analytics at a financial services firm with 10,001+ employees
A good, but expensive, web-based platform for automated cluster management with some coding limitations
Pros and Cons
- "We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time."
- "We looked at both Snowflake and BigQuery as a comparison with this solution, and we chose this product as it offered more scalability and a higher level of security, which is extremely important in our banking environment."
- "This solution only supports queries in SQL and Python, which is a bit limiting."
- "This is a fairly expensive solution for any service outside of the basic package, and costs can add up quite quickly if there are large scaling requirements."
What is our primary use case?
What is most valuable?
We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time.
This product allows us to write the email models in a way that allows us to take the advantage of the parallel scaling computer window backend on any of the satellite services.
What needs improvement?
This solution only supports queries in SQL and Python, which is a bit limiting.
This is a fairly expensive solution for any service outside of the basic package, and costs can add up quite quickly if there are large scaling requirements.
What do I think about the stability of the solution?
This is a stable solution in our experience.
Buyer's Guide
Databricks
March 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
885,286 professionals have used our research since 2012.
What do I think about the scalability of the solution?
We have found that part of the beauty of this platform is that it is easy to scale and expand.
How are customer service and support?
The support for this product uses Microsoft as a middle man, and due to this there have been times when we experienced communication delays, as well as misunderstandings of what our issues are.
How was the initial setup?
The initial setup for this solution is very simple.
What's my experience with pricing, setup cost, and licensing?
The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive.
Which other solutions did I evaluate?
We looked at both Snowflake and BigQuery as a comparison with this solution. We choose this product as it offered more scalability and a higher level of security, which is extremely important in our banking environment.
What other advice do I have?
We would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
A stable, scalable solution that simplifies the development process but needs more debuggers and components
Pros and Cons
- "The simplicity of development is the most valuable feature."
- "The simplicity of development is the most valuable feature."
- "Databricks has a lack of debuggers, and it would be good to see more components."
- "Databricks has a lack of debuggers, and it would be good to see more components."
What is our primary use case?
We use the solution for data engineering.
How has it helped my organization?
The tool helps us manage large amounts of data.
What is most valuable?
The simplicity of development is the most valuable feature.
What needs improvement?
Databricks has a lack of debuggers, and it would be good to see more components.
Another issue is that the D4 data format keeps changing on our cluster. This doesn't affect me much because I use functions to define it, but it is very frustrating for some more casual users. One day the output will be in a particular format, and then it becomes an object without us changing the cluster configuration. As a small team, we don't have the capacity to dig deeply into the issue, which has been frustrating.
For how long have I used the solution?
We have been using the solution for three years.
What do I think about the stability of the solution?
The solution's stability is good.
What do I think about the scalability of the solution?
The product is scalable. We're a small organization with 12 users, and we don't currently have any plans to increase our usage.
What was our ROI?
We see an ROI from Databricks.
What other advice do I have?
I would rate the solution seven out of ten.
It's a good solution and more for handling large amounts of data. Databricks is better as a batch processing system than as an interactive system. The performance is a little disappointing because the memory processing is supposed to be excellent, but it's not as competitive as some other solutions out there in this regard. Even classical databases can respond and process faster.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Databricks
March 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: March 2026.
885,286 professionals have used our research since 2012.
Sr Data Engineer at PIMCO
Supports several coding languages, good performance, and facilitates team collaboration
Pros and Cons
- "The load distribution capabilities are good, and you can perform data processing tasks very quickly."
- "The most valuable feature is the versatility of the ecosystem."
- "In the future, I would like to see Data Lake support. That is something that I'm looking forward to."
- "The cost of this solution is high, on the expensive side."
What is our primary use case?
Our primary use case is ETL.
How has it helped my organization?
Using Databricks enables us to use the Data Mesh methodology, where every team performs their own ETL.
What is most valuable?
The most valuable feature is the versatility of the ecosystem. You can write code in SQL, Python, or Java.
The load distribution capabilities are good, and you can perform data processing tasks very quickly.
You can save and share notebooks between different teams.
The interface is easy to use.
What needs improvement?
The cost of this solution is high, on the expensive side.
In the future, I would like to see Data Lake support. That is something that I'm looking forward to.
For how long have I used the solution?
I worked with Databricks for approximately two years in my previous company.
What do I think about the scalability of the solution?
This is a very scalable solution. We have twenty-five data engineers that use it, and we may grow our usage.
How are customer service and support?
The technical support is okay. I would rate them a seven out of ten.
How would you rate customer service and support?
Neutral
Which solution did I use previously and why did I switch?
We did not use another similar solution prior to Databricks.
How was the initial setup?
The cloud-based deployment is simple.
If you use an on-premises deployment then there is more to do.
What about the implementation team?
We deployed it with our in-house team.
There is no maintenance required.
What was our ROI?
We have seen a return on our investment with Databricks.
What's my experience with pricing, setup cost, and licensing?
Price-wise, I would rate Databricks a three out of five.
Which other solutions did I evaluate?
When we looked into Databricks, we evaluated Azure Data Factory and some of the others on the market. We found that Databricks was one of the easiest ones to use.
What other advice do I have?
My advice for anybody that is looking into Databricks is not to use the on-premises deployment. Instead, use the cloud-based setup.
In summary, this is a good product.
I would rate this solution an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Team Lead at a tech services company with 1,001-5,000 employees
Gives us the ability to write analytics code in multiple languages
Pros and Cons
- "Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user."
- "Databricks is a one-stop shop for everything data related, and it can scale with you."
- "Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics."
- "I would like to see improvement with the UI. It is functional and useful, but it's a bit clunky at times."
What is our primary use case?
We use Databricks for batch data processing and stream data processing.
How has it helped my organization?
Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user.
What is most valuable?
The flexibility of Databricks is the most valuable feature. It gives us the ability to write analytics code in multiple languages.
There is a single workspace for different data roles like data engineers, machine learning engineers, and the end user, who can connect to the same system.
Databricks computes separate from storage, so you are not coupled with the underlying data sets, allowing for multiple processes and multiple programs to be written on the same code.
What needs improvement?
I would like to see improvement with the UI. It is functional and useful, but it's a bit clunky at times. It should be more user-friendly.
In future releases, Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics.
For how long have I used the solution?
I have been using Databricks for eight months.
What do I think about the stability of the solution?
Databricks is very stable.
What do I think about the scalability of the solution?
The scalability of this solution is good. In our organization, users include analysts, data engineers, and data scientists.
How are customer service and support?
I would give Databrick service and support a four and a half out of five overall.
How would you rate customer service and support?
Positive
Which solution did I use previously and why did I switch?
Prior to using Databricks, we used Azure Stream Analytics. We made the switch because of the scalability and integrated platform.
How was the initial setup?
The initial setup of Databricks is more complex. I would rate it a four out of five on the complexity of the setup. It took two days to deploy the solution.
What about the implementation team?
We used a third party for some of the implementations of Databricks. The number of staff required to deploy and maintain this solution depends on the number of processes you have. Due to the cloud nature of the technology, it is easy to deploy and maintain.
What's my experience with pricing, setup cost, and licensing?
The licensing of Databricks is a tiered licensing regime, so it is flexible. I feel their pricing is a five out of five.
What other advice do I have?
Databricks is a one-stop shop for everything data related, and it can scale with you.
I would rate this solution a 9.5 out of 10 overall.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Head CEO at bizmetric
A user-friendly and customizable solution that offers excellent integration
Pros and Cons
- "The solution is built from Spark and has integration with MLflow, which is important for our use case."
- "Databricks is also user-friendly, providing customizable codes and models that allow people to experiment quickly."
- "The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
- "The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
What is our primary use case?
Our use case is confidential, but I can say we use it for a deep learning model for machine learning.
What is most valuable?
The solution is built from Spark and has integration with MLflow, which is important for our use case.
Databricks is also user-friendly, providing customizable codes and models that allow people to experiment quickly.
Integration of Delta Lake is another useful feature.
What needs improvement?
Writing pandas-profiling reports could be easier.
The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps.
For how long have I used the solution?
I have been using this product for one and a half years.
What do I think about the stability of the solution?
For now the solution seems stable.
What do I think about the scalability of the solution?
The solution is easy to scale horizontally and it has a useful auto-scaling feature. For vertical scaling, you need to bring the system down and make some adjustments.
On my current project I have a team of 30 members under me, including data engineers and data science people. Our data science, engineering, and MLOps projects are expanding, so we are planning to do some vertical scaling to increase the team size to over 100 members. In our company, we are trying to certify more and more people in Databricks because it's cloud-agnostic.
How are customer service and support?
We have never needed to contact customer support, online resources have been sufficient to solve our problems.
How was the initial setup?
The initial setup of the solution is straightforward, once you understand the UI it is easy to implement. I would rate Databricks a four out of five for ease of setup.
One migration project took two to three months, including writing all the code and implementing end-to-end pipelines.
We are planning to deploy the solution in stages over the next 15 months to completely implement MLOps for our organization.
What's my experience with pricing, setup cost, and licensing?
I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five.
I find that deployed systems work out cheaper than having to operate manually, which appeals to our customers.
What other advice do I have?
I would rate this solution an eight out of ten.
There is an issue where clusters are automatically deleted after termination or after 100 days of non-usage. This could be more user-friendly, and they could include an enabler to pin the clusters you want to keep, instead of having to go and research why clusters got deleted after implementing the product. That documentation needs to be right in front of the user to avoid issues.
I definitely recommend this product to other users.
Which deployment model are you using for this solution?
Hybrid Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
Technical Architect at Infosys
Enables us to find anomalies and apply rules to the streaming data
Pros and Cons
- "The ability to stream data and the windowing feature are valuable."
- "The ability to stream data and the windowing feature are valuable."
- "Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing."
- "Databricks is still having some stability issues."
What is our primary use case?
We use this solution for finding anomalies and applying the rules to the streaming data.
There are around 50 people using this solution in my organization, including data scientists.
What is most valuable?
The ability to stream data and the windowing feature are valuable. There are a number of targeted integration points, so that is a difference between Stream Analytics and Databricks. The integrations input or output are better in Databricks. It's accessible to use any of the Python or even Java. I can use the third party, deploy it, and use it.
What needs improvement?
Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing. There should be reliability between these two. Databricks is based on open sources. If it's more synchronous between the Microsoft technology and the programming languages, it'll be better. Python has better languages, but compatibility would be a great help.
I would like to have better support for Microsoft technology and better language components.
With Azure or Cosmo DB, I can store other data links or time series data tables. That would be a great help for analytics in real time.
For how long have I used the solution?
I have been using Databricks for eight months.
What do I think about the scalability of the solution?
The scalability is fine. We had thousands of devices and were sending data infrequently, so that worked for us. If the amount increases, the windowing function and job schedule may not perform as expected.
How are customer service and support?
I would rate technical support 4 out of 5. We had some issues with setup, and they were finally solved but it was after following up a few times.
Which solution did I use previously and why did I switch?
Azure Stream Analytics is easy to use and easy to deploy. It's a little bit better. Databricks is still having some stability issues. Azure Stream Analytics has a few input and output sources, and it's scalable to all types of third party or interfaces.
How was the initial setup?
Setup was complex. There were some issues with setting up a database and installing the third party component on top of services. I would rate the setup 3 out of 5.
What about the implementation team?
Implementation was done in-house.
What's my experience with pricing, setup cost, and licensing?
The cost is around $600,000 for 50 users.
I would rate the price 2 out of 5.
What other advice do I have?
I would rate this solution 8 out of 10.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Helps users with data processing and analytics
Pros and Cons
- "The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale."
- "The biggest problem associated with the product is that it is quite pricey."
What is our primary use case?
I use Databricks to manage the setting up of data lakes for SaaS.
What needs improvement?
The biggest problem associated with the product is that it is quite pricey. We cannot find a better solution than Databricks in the market currently.
For how long have I used the solution?
I have been using Databricks for a year.
What's my experience with pricing, setup cost, and licensing?
It is an expensive tool. The licensing model is a pay-as-you-go one.
What other advice do I have?
The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale.
For my general use cases, I would say that I am not a technical person, so I cannot explain to you how the tool helps with the area of data engineering tasks.
There is another team in my company that is involved in the use of machine learning and AI features in Databricks. My team is mostly into operations. The tool is used in a multi-country project.
For example, in my company, they make some shopping decisions related to solutions based on what is the product chosen by the whole company.
I rate the tool an eight out of ten.
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Strategic Alliances& Ecosystems Manager at a outsourcing company with 501-1,000 employees
Helps to have a good data presence but needs to incorporate learning aspects
Pros and Cons
- "Databricks has helped us have a good presence in data."
- "The product should incorporate more learning aspects. It needs to have a free trial version that the team can practice."
What is our primary use case?
The product has helped in data fabrication.
How has it helped my organization?
Databricks has helped us have a good presence in data.
What needs improvement?
The product should incorporate more learning aspects. It needs to have a free trial version that the team can practice.
For how long have I used the solution?
I have been using the product for more than six months.
What do I think about the stability of the solution?
I rate Databricks' an eight out of ten.
What do I think about the scalability of the solution?
I rate the tool's scalability an eight out of ten.
How was the initial setup?
The transition to Databricks was smooth.
What's my experience with pricing, setup cost, and licensing?
Databricks' price is high.
What other advice do I have?
I rate the solution a nine out of ten.
Disclosure: My company has a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Updated: March 2026
Product Categories
Cloud Data Warehouse Data Science Platforms Data Management Platforms (DMP) Streaming AnalyticsPopular Comparisons
Teradata
Azure Data Factory
Palantir Foundry
Qlik Talend Cloud
Snowflake
KNIME Business Hub
IBM SPSS Statistics
Dataiku
Alteryx
Amazon SageMaker
Microsoft Azure Machine Learning Studio
Coralogix
Confluent
Altair RapidMiner
OpenText Analytics Database (Vertica)
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which do you prefer - Databricks or Azure Machine Learning Studio?
- How would you compare Databricks vs Amazon SageMaker?
- Which would you choose - Databricks or Azure Stream Analytics?
- Which product would you choose for a data science team: Databricks vs Dataiku?
- Which ETL or Data Integration tool goes the best with Amazon Redshift?
- What are the main differences between Data Lake and Data Warehouse?
- What are the benefits of having separate layers or a dedicated schema for each layer in ETL?
- What are the key reasons for choosing Snowflake as a data lake over other data lake solutions?
- Are there any general guidelines to allocate table space quota to different layers in ETL?
- What cloud data warehouse solution do you recommend?






















