Try our new research platform with insights from 80,000+ expert users
Monalisha Nayak - PeerSpot reviewer
Senior Data Engineer at a energy/utilities company with 10,001+ employees
Real User
Top 5
Nov 17, 2024
Transformative data analytics with enhanced AI functionalities and good value for money
Pros and Cons
  • "It offers AI functionalities that assist with code management and machine learning processes."
  • "While Databricks is generally a robust solution, I have noticed a limitation with debugging in the Delta Live Table, which could be improved."

What is our primary use case?

Databricks is used for transformations and streaming data processing. We utilize it primarily for data analytics, including the use of Delta Lake and Delta Life tables for ETL processes, dashboards for analysis, and the Unity catalog for role management.

How has it helped my organization?

Databricks improves our data analysis tasks with its powerful functionality, offering real-time analytics and machine learning features that help improve model accuracy. It is easy to use, which helps in saving time and, ultimately, costs.

What is most valuable?

The most valuable features of Databricks include the Delta Lake, a user-friendly interface, Delta Life tables for ETL, dashboard features for analysis, and the Unity catalog for role management. It also offers AI functionalities that assist with code management and machine learning processes.

What needs improvement?

While Databricks is generally a robust solution, I have noticed a limitation with debugging in the Delta Live Table, which could be improved. The issue with Delta type tables not loading into multiple places in a single pipeline has been fixed recently.

Buyer's Guide
Databricks
December 2025
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,259 professionals have used our research since 2012.

For how long have I used the solution?

I have been working with Databricks for four years.

How are customer service and support?

We regularly contact Databricks support and are satisfied with their service. I would rate them eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup was straightforward after the first week. Deployment processes became quick and efficient using Git.

What's my experience with pricing, setup cost, and licensing?

In terms of cost-effectiveness, Databricks is worth the money.

What other advice do I have?

I'd rate the solution nine out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Solution Architect at a insurance company with 10,001+ employees
Real User
Jan 4, 2023
A nice interface with good features for turning off clusters to save on computing
Pros and Cons
  • "There are good features for turning off clusters."
  • "It would be nice to have more guidance on integrations with ETLs and other data quality tools."

What is our primary use case?

Our company uses the solution for big data and as an interface for analytics. 

We also create custom APIs to get data and provide SQL endpoints so users can access it over traditional tools like JDVC or ODBC. 

We use the solution on AWS and Azure. The data lake is wide open for departmental use. We have ten departments and two or three people from each department access the solution. 

How has it helped my organization?

The platform as a service allows us to ramp up a new database pretty fast. We deploy some of the infrastructure as a code. End users can access data immediately and connect with Power BI for reporting. 

What is most valuable?

There are good features for turning off clusters. Basically, if we aren't using it, then it is turned off. When a user starts accessing, it starts up so we save on computing. 

Our data lake team likes the interface very much because it is straightforward. Of, course you need to understand the different clusters when they are started. 

There are nice features for matching the learning and analytics. 

The security features allow us to integrate with the active directory and assign different people to different databases. 

The solution has good a good interface with Python. 

There is good integration with Azure so we can access the solution over the standard Azure interface and use the storage pro measure. 

What needs improvement?

It would be nice to have more guidance on integrations with ETLs and other data quality tools. The solution is not really a product for ETL or data quality so we use other DBT tools. 

For how long have I used the solution?

I have been using the solution for four months but my company has been using it for one year. 

What do I think about the stability of the solution?

The solution is very stable with no issues so I rate stability a ten out of ten. 

What do I think about the scalability of the solution?

The solution is scalable to the cluster size and Azure storage. 

Scalability is rated an eight out of ten. 

How are customer service and support?

I have not used technical support. 

The company has regular calls with Databricks and they are pretty good but are more on the technical presale side. 

Which solution did I use previously and why did I switch?

We previously used Azure's data lake product and possibly some Hortonworks. 

How was the initial setup?

The setup is not easy but also is not too complicated. An infrastructure needs to be set up first. We use Azure storage or SQL S3 and create private end points. 

This is maybe a little more complex or a bit different than other databases in the cloud. For a traditional setup, you need to also think about file systems and disks. Here, you just transform it into the storage and private end point. 

The first setup might be a bit of a struggle until you learn and understand what is necessary. 

What about the implementation team?

We implemented the solution in-house with support from Databricks. Two team members were involved in the implementation. 

Three team members handle ongoing development and maintenance. 

What's my experience with pricing, setup cost, and licensing?

The solution is affordable. 

What other advice do I have?

The solution is pretty good because it uses Azure's data lake storage. It is basically the tool on top that provides the SQL interface and APIs for Python. I like the solution because it enables people to work with it.

I rate the solution a nine out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Databricks
December 2025
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,259 professionals have used our research since 2012.
Business Architect at a computer software company with 5,001-10,000 employees
Real User
Dec 16, 2022
Very quick run time but there are some limitations for legacy integrations
Pros and Cons
  • "The solution is an impressive tool for data migration and integration."
  • "The solution has some scalability and integration limitations when consolidating legacy systems."

What is our primary use case?

Our company uses the solution for series-based and panel-based migrations. We collect and store user requirements, use apps to fetch data, and provide customers with better data for business reports. There are 30 to 40 users in our company.

What is most valuable?

The solution is an impressive tool for data migration and integration. 

The run time is very quick.

What needs improvement?

The solution has some scalability and integration limitations when consolidating legacy systems.

For how long have I used the solution?

I have been using the solution for two years. 

What do I think about the stability of the solution?

The solution is stable. 

What do I think about the scalability of the solution?

It is not really scalability but more about the combination of the structure, consolidation, and different formats we can split and merge. We do a lot of things while storing the target operational model. Snowflake is more flexible and scalable in that regard. 

How are customer service and support?

We have contacted technical support a lot about replicating values in PDF files. So far, they have not been able to provide a viable solution. 

How was the initial setup?

The setup is of average difficulty but tougher than Snowflake. 

Deployment is easy and run time is quick. 

What about the implementation team?

We implemented the solution in-house.

One resource manages services for end-to-end monitoring and maintenance activities. 

What's my experience with pricing, setup cost, and licensing?

The solution is based on a licensing model. Updates occur automatically by the task base. 

Which other solutions did I evaluate?

Snowflake is quite impressive in comparison to the solution because there is flexibility in the way you consolidate. In contrast, the solution has some scalability and integration limitations when consolidating legacy systems. Tool wise, Snowflake is easy from the technical perspective because connectors are included.  

We are evaluating options for one particular use case. The customer wants to replicate values from PDFs and enter them in the data model. We contacted the solution's technical support but do not yet have a viable answer. There are gaps in what we do and how we capture. The only option right now is for the customer to manually upload values that we integrate using Synapse to consolidate report data. We haven't yet found another tool that maps to meet our customer's requirement. 

What other advice do I have?

I rate the solution a seven out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Other
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Heba Ismail - PeerSpot reviewer
Senior Data Engineer at a computer software company with 1,001-5,000 employees
Real User
Top 10
Nov 6, 2024
Enhancing data integration and processing across cloud services with seamless transformations
Pros and Cons
  • "It helps integrate data science and machine learning capabilities."
  • "Performance could be improved."

What is our primary use case?

I work in a project where I build data pipelines using Azure Data Factory. I ingest data from on-premises to Azure Data Lake. After that, I perform transformations using Databricks notebooks and Spark, building the Databricks bronze, silver, and gold layers. We export reports from the gold layer.

How has it helped my organization?

Recently, we started using Databricks in our organization. It helps integrate data science and machine learning capabilities.

What is most valuable?

The Unity Catalog is a central governance for all data around the workspaces, and also Databricks' integration capabilities with cloud services like Azure Event Hub and Azure Data Factory. It is user-friendly for data processing, and Spark is a strong language for big data processing.

What needs improvement?

Performance could be improved. It is crucial to check coding, configure Spark correctly, implement caching, and monitor performance metrics to enhance performance.

For how long have I used the solution?

I have used Databricks for over two years.

What do I think about the stability of the solution?

I would rate stability as eight out of ten. It is quite stable.

What do I think about the scalability of the solution?

Databricks is perfect for scalability. It is easy to scale clusters.

How are customer service and support?

I haven't faced any issues requiring customer support, so I don't have experience with their customer support.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

We used Informatica before, which is perfect for data management solutions. We started using Databricks for its capabilities in data science and machine learning.

How was the initial setup?

I would rate the initial setup as nine out of ten. It is quite easy for someone experienced with Spark.

What's my experience with pricing, setup cost, and licensing?

For my company, it's okay to upgrade to Databricks because it's comparable in price to Informatica. It is not considered expensive for the company.

Which other solutions did I evaluate?

For machine learning, I used Python and its libraries manually. Prior to Databricks, there was no special tool used for these purposes.

What other advice do I have?

If a company focuses on data science and machine learning, I recommend using Databricks. It's a great solution in this field. For data management needs, Informatica is advantageous due to its comprehensive tools.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Karan  Sharma - PeerSpot reviewer
Data Analyst at a financial services firm with 10,001+ employees
Real User
Aug 21, 2023
An easy to setup tool that provides its users with an insight into the metadata of the data they process
Pros and Cons
  • "The initial setup phase of Databricks was good."
  • "Scalability is an area with certain shortcomings. The solution's scalability needs improvement."

What is our primary use case?

My company uses Databricks to process real-time and batch data with its streaming analytics part. We use Databricks' Unified Data Analytics Platform, for which we have Azure as a solution to bring the unified architecture on top of that to handle the streaming load for our platform.

What is most valuable?

The most valuable feature of the solution stems from the fact that it is quite fast, especially regarding features like its computation and atomicity parts of reading data on any solution. We have a storage account, and we can read the data on the go and use that since we now have the unity catalog in Databricks, which is quite good for giving you an insight into the metadata of the data you're going to process. There are a lot of things that are quite nice with Databricks.

What needs improvement?

Scalability is an area with certain shortcomings. The solution's scalability needs improvement.

For how long have I used the solution?

I have been using Databricks for a few years. I use the solution's latest version. Though currently my company is a user of the solution, we are planning to enter into a partnership with Databricks.

What do I think about the stability of the solution?

It is a stable solution. Stability-wise, I rate the solution an eight to nine out of ten.

What do I think about the scalability of the solution?

It is a scalable solution. Scalability-wise, I rate the solution an eight to nine out of ten.

My company has a team of 50 to 60 people who use the solution.

How are customer service and support?

Sometimes, my company does need support from the technical team of Databricks. The technical team of Databricks has been good and helpful. I rate the technical support an eight out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup phase of Databricks was good. You can spin up clusters and integrate those with DevOps as well. Databricks it's quite nice owing to its user-friendly UI, DPP, and workspaces.

The solution is deployed on the cloud.

The time taken for the deployment depends on the workload.

What's my experience with pricing, setup cost, and licensing?

I cannot judge whether the product is expensive or cheap since I am unaware of the prices of the other products, which are competitors of Databricks. The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts.

What other advice do I have?

It is a state-of-the-art product revolutionizing data analytics and machine learning workspaces. Databricks are a complete solution when it comes to working with data.

I rate the overall product an eight out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Jeremy Salt - PeerSpot reviewer
Sr. Data Quality Analyst at a computer software company with 11-50 employees
Real User
Jan 22, 2023
Can use different technologies to do data analysis and can quickly get data
Pros and Cons
  • "Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes."
  • "Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present."

What is our primary use case?

We use it for data analysis and testing of high volume web user behavioral data.

What is most valuable?

Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes.

I'm starting to build a solution using Delta Live Tables and Delta Live pipelines, and it is proving to be exceptionally easy to use. I have also been able to quickly implement a pipeline.

What needs improvement?

Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present.

For how long have I used the solution?

I've been using Databricks for a year.

What do I think about the stability of the solution?

It is a stable and reliable solution. I'd rate stability at eight out of ten.

What do I think about the scalability of the solution?

Databricks is absolutely scalable, and I'd rate scalability at eight out of ten. We probably have between 60 and 100 users in our organization, and we hope to increase usage in the future.

How are customer service and support?

The technical support staff we have worked with have been amazing. They helped us initially with our Delta Live pipelines. I would give them a rating of ten out of ten.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

I have previously worked with Apache Hadoop, and Databricks is definitely a better product. It's much easier to get data quickly in Databricks. As a result, a lot of the drudgery is taken away. Whereas with Hadoop, it's a bit more tricky to get data together.

What's my experience with pricing, setup cost, and licensing?

We're charged on what the data throughput is and also what the compute time is.

What other advice do I have?

I'd strongly recommend giving Databricks a try. We have found it to be a fantastic tool that has accelerated some of our solutions. We're an AI-heavy shop, and there are a lot of data scientists using the MLflow capabilities. I hear a lot of good things from that side as well. From a data analysis point of view, Databricks has been fantastic, and I would rate it at eight on a scale from one to ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Kevin McAllister - PeerSpot reviewer
Executive Manager at a computer software company with 10,001+ employees
Real User
Jan 20, 2023
Excellent data transformation but data-serving performance could be better
Pros and Cons
  • "Databricks' most valuable feature is the data transformation through PySpark."
  • "Databricks' performance when serving the data to an analytics tool isn't as good as Snowflake's."

What is our primary use case?

We mainly use Databricks to process ingest and do the ELT processes of data to get it ready for analytics and to serve the data to ThoughtSpot, which calls queries and Databricks to get the data.

How has it helped my organization?

We didn't have any good tooling for ELT processing prior to Databricks. We were using Microsoft HD Insight, but it was taking too long to process the data. When we changed our data-processing ELT processes over to Databricks, the amount of time to process the data was reduced to a fraction of what HD Insight used, so we were able to run jobs much faster.

What is most valuable?

Databricks' most valuable feature is the data transformation through PySpark.

What needs improvement?

Databricks' performance when serving the data to an analytics tool isn't as good as Snowflake's. In the next release, Databricks should include a better data-sharing platform to facilitate data sharing between companies.

For how long have I used the solution?

I've been using Databricks for three years.

What do I think about the stability of the solution?

Databricks' stability has been great, and I would rate it eight out of ten.

What do I think about the scalability of the solution?

Databricks is very scalable because it's very easy to spin up multiple clusters, but the cost of doing that is tremendous. I'd rate its scalability nine out of ten, but you'll pay for it.

How are customer service and support?

The technical support has been really bad, but that's because we don't have a direct agreement with Databricks.

How would you rate customer service and support?

Neutral

Which solution did I use previously and why did I switch?

I previously used HD Insight from Microsoft, but it took many, many hours to process data, so we switched to Databricks.

How was the initial setup?

The initial setup was pretty complex and required three people.

What about the implementation team?

We used an in-house team with some consulting help.

What was our ROI?

We've had a low ROI from Databricks.

What's my experience with pricing, setup cost, and licensing?

I would rate Databricks' pricing seven out of ten.

What other advice do I have?

I would advise anyone thinking of implementing Databricks to know their use case. For example, if you're looking for a big data repository to query data and do ELT processing, I recommend looking at other platforms, like Snowflake. However, if you're going to do AI and machine learning, then Databricks is probably stronger in that area. Overall, I would rate Databricks seven out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
PankajKumar13 - PeerSpot reviewer
Computer Scientist at a computer software company with 10,001+ employees
Real User
Jan 2, 2023
Pumps up performance and the processing power; comes with helpful Lakehouse and SQL environments
Pros and Cons
  • "When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
  • "I believe that this product could be improved by becoming more user-friendly."

What is our primary use case?

Our primary use case is for data analytics. Essentially, we use it for the financial reporting for Adobe.

How has it helped my organization?

The way Databricks has improved my organization is definitely through giving us improved performance and the processing power. We are usually never able to achieve it using traditional data warehouses. When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks.

What is most valuable?

The features I found most helpful with Databricks are the Lakehouse and SQL environments.

What needs improvement?

I believe that this product could be improved by becoming more user-friendly. 

In the next release, I would like to see more flexibility in the dashboard. It has plenty of features but it can be enhanced so that it matches with other visualization tools, like Power BI and Tableau. Also, the integrations with other tools could be better.

For how long have I used the solution?

I have been using Databricks for the last three years.

What do I think about the stability of the solution?

I would rate the stability of Databricks an eight, on a scale from one to 10, with one being the worst and 10 being the best.

What do I think about the scalability of the solution?

I would rate the scalability of this solution a nine, on a scale from one to 10, with one being the worst and 10 being the best. I would say there are around 2,000 to 3,000 users of this solution in our organization.

How are customer service and support?

I've been in contact with the Databricks support team and received timely support from them. I would rate their support an eight, on a scale from one to 10, with one being the worst and 10 being the best.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

Prior to Databricks, we initially used Hadoop. Afterwards, we used HANA, SAP HANA, and the Microsoft SQL Server.

How was the initial setup?

The initial setup was relatively straightforward. I would rate it nine, on a scale from one to 10, with one being the easiest and 10 being the hardest.

There is no need to worry about the deployment as it can be done quickly. It is relatively automated. We used Terraform for auto-deployment, which happens in Azure. With Terraform, there are two options. As option one, you can deploy manually by creating services. For option two, you use Terraform and automate. Terraform is like infrastructure as a code where you can code the deployment part using it.

There were two or three persons involved in the deployment of this solution.

What other advice do I have?

The new version of the Databricks solution requires code maintenance. This is done by the platform team.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros sharing their opinions.
Updated: December 2025
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros sharing their opinions.