No more typing reviews! Try our Samantha, our new voice AI agent.
Mullai Selvan - PeerSpot reviewer
Project Manager at MAQ Software
Real User
Aug 2, 2022
Integrates well, is scalable, and high availability
Pros and Cons
  • "The most valuable feature of Databricks is the integration with Microsoft Azure."
  • "We chose Databricks because the processing power was better and it was a better fit for our use case."
  • "Databricks can improve by making the documentation better."

What is our primary use case?

I am using Databricks for creating business intelligence solutions.

What is most valuable?

The most valuable feature of Databricks is the integration with Microsoft Azure.

What needs improvement?

Databricks can improve by making the documentation better.

For how long have I used the solution?

I have been using Databricks for approximately one year.

Buyer's Guide
Databricks
May 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

The scalability of Databricks is good.

We have approximately 500 users using this solution in my organization.

How are customer service and support?

I have not used the support from Databricks.

Which solution did I use previously and why did I switch?

We previously used Microsoft stacks. We chose Databricks because the processing power was better and it was a better fit for our use case.

How was the initial setup?

The initial setup of Databricks was not straightforward. We had to do trial and error and we learned as we went along.

I rate the initial setup of Databricks a four out of five.

What about the implementation team?

We did the implementation of Databricks in-house. The solution requires ongoing maintenance.

What other advice do I have?

I would recommend this solution to others.

My advice to others is for them to first do a small proof of concept and then see how it works out and then take it from there.

I rate Databricks an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Lead Architect at Birlasoft IndiaLtd.
Real User
Jul 11, 2022
Data analytics platform that supports large volumes of data and related activities
Pros and Cons
  • "This solution offers a lake house data concept that we have found exciting, as we are able to have a large amount of data in a data lake and can manage all relational activities, with all asset complaints properties available to ensure the quality of all data."
  • "The connectivity with various BI tools could be improved, specifically the performance and real time integration."

What is most valuable?

This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities. All asset complaints properties are available and this is very useful to ensure the quality of all data.

What needs improvement?

The connectivity with various BI tools could be improved, specifically the performance and real time integration. There is also some improvement required in the semantic layers to manage the data match as well as the data warehouse features.

In a future release, we would like to have features to better manage all ML development activities.

For how long have I used the solution?

I have been using this solution for three years. 

What do I think about the stability of the solution?

This is a stable solution, especially compared to other technology on the market.

What do I think about the scalability of the solution?

It is a scalable solution but this depends on the platform that is being used. If you use a cloud platform such as Azure, it offers scalability. However, some platforms will not support scalability using Databricks.

We have around 20 users in our development team using Databricks. 

How are customer service and support?

The customer service and support for this solution is good. 

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup is pretty simple and requires minimal configuration compared to other technology.

What's my experience with pricing, setup cost, and licensing?

I would rate the pricing for this solution a four out of five. This does depend on the environment or the infrastructure that one is using. There is a difference in pricing between using Azure or being on-premises. 

Which other solutions did I evaluate?

Azure Synapse is a competitor that we evaluated but it is not mature enough to provide better performance than Databricks. We choose Databricks due to the ability to have a lot of data in Data Lakes and the Data Warehouse. We are also able to run data science activities using ML flow.

What other advice do I have?

If you are looking for custom model development and a lot of data management in a cloud agnostic manner, then Databricks is a good solution.

I would rate this solution an eight out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Databricks
May 2026
Learn what your peers think about Databricks. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
896,942 professionals have used our research since 2012.
it_user1498449 - PeerSpot reviewer
Senior Data Engineer at TCS
Real User
Jun 19, 2022
Supports multiple languages, plenty of Python libraries, but user-interface could improve
Pros and Cons
  • "Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily."
  • "The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."

What is our primary use case?

We are using Databricks to receive the data from Data Lake where we are processing it and doing the transformation, and cleansing. Once it is processed, we are sending the data to the Azure SQL database.

What is most valuable?

Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily.

What needs improvement?

The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well.

For how long have I used the solution?

I have been using Databricks for approximately three years.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

The salability of Databricks is good. However, if I want to use the higher clusters and high concurrency clusters, you will need to wait more time to spin up the clusters. 

We have different teams. Among them, I'm part of the data analytics where, our team, almost 10 people are using it. But I'm not sure about the rest of the teams.

We are using Databricks extensively. We have a team of 10 using the solution.

How was the initial setup?

The initial setup of Databricks is not straightforward. You need to create VLANs, VPNs, and networks. We are two ways of deployment, we are having the legacy PowerShell for the deployment and the template method to deploy the Databricks code to higher levels.

We have not integrated Databricks directly into the DevOps architecture. We are downloading the notebooks manually and we are uploading them.

What's my experience with pricing, setup cost, and licensing?

The billing of Databricks can be difficult and should improve.

Which other solutions did I evaluate?

We have evaluated Azure Synapse and SQL. Both Databricks and Azure Synapse are similar, the UI is the only difference. SQL and Databricks are the same, and one of the largest setbacks is the processing of a lot of data takes a long time.

What other advice do I have?

I rate Databricks a seven out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Head of Credit Risk and Data at Cegid Invoice and Financing
Real User
May 26, 2022
It's a reasonably priced all-in-one platform that enables us to build a lakehouse framework
Pros and Cons
  • "Databricks gives us the ability to build a lakehouse framework and do everything implicit to this type of database structure. We also like the ability to stream events. Databricks covers a broad spectrum, from reporting and machine learning to streaming events. It's important for us to have all these features in one platform."
  • "I'm not the guy that I'm working with Databricks on a daily basis. I'm on the management team. However, my team tells me there are limitations with streaming events. The connectors work with a small set of platforms. For example, we can work with Kafka, but if we want to move to an event-driven solution from AWS, we cannot do it. We cannot connect to all the streaming analytics platforms, so we are limited in choosing the best one."
  • "However, my team tells me there are limitations with streaming events. The connectors work with a small set of platforms."

What is our primary use case?

We primarily use Databricks for reporting and machine learning.

What is most valuable?

Databricks gives us the ability to build a lakehouse framework and do everything implicit to this type of database structure. We also like the ability to stream events. Databricks covers a broad spectrum, from reporting and machine learning to streaming events. It's important for us to have all these features in one platform.

What needs improvement?

I'm not the guy that I'm working with Databricks on a daily basis. I'm on the management team. However, my team tells me there are limitations with streaming events. The connectors work with a small set of platforms. For example, we can work with Kafka, but if we want to move to an event-driven solution from AWS, we cannot do it. We cannot connect to all the streaming analytics platforms, so we are limited in choosing the best one.

Also, this is an all-in-one platform, but it might be preferable if there were an a la carte model where we could select the best tool in each class for reporting, machine learning, etc. I'm not yet sure if this strategy is the best one. 

For how long have I used the solution?

We've been using Databricks since the start of the year.

What do I think about the stability of the solution?

Databricks is quite stable. We haven't had any issues with stability. It's always working perfectly with no downtime.

What do I think about the scalability of the solution?

Databricks is based on Spark, which is based on Scala. These languages aren't easy to handle, and it's challenging to find people who know them well. At the same time, a couple of other vendors that work on top of Databricks are low-code platforms. We have to work around Databrick's lack of scalability by using low-code platforms that work on top of Databricks to give us scalability.

How are customer service and support?

I'll give Databricks support 10 out of 10. They are always prompt even though we didn't buy a support package. They have done an excellent job.

How would you rate customer service and support?

Positive

How was the initial setup?

Setting up Databricks is a bit complex, and the initial deployment took a few days—closer to a week. Of course, not everyone is working full-time on this. There are intervals when people are doing other stuff. 

What was our ROI?

It's too soon to tell what kind of return we're getting because we just started using it, and we're still migrating.

What's my experience with pricing, setup cost, and licensing?

The cost of Databricks is in the lower range compared to other solutions. That was one of the main reasons we chose Databricks over other vendors and platforms.  

We pay as we go, so there isn't a fixed price. It's charged by the unit. I don't have any details detail about how they measure this, but it should be a mix between processing and quantity of data handled. We run a simulation based on our use cases, which gives us an estimate. We've been monitoring this, and the costs have met our expectations. 

What other advice do I have?

I give Databricks nine out of 10. The solution has met all our expectations. I'd recommend it to a friend. It's a reasonably priced all-in-one solution that gives us data lake and lakehouse capabilities. Those were the primary reasons we chose Databricks.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Elizabeth Ho - PeerSpot reviewer
Manager, Customer Journey at a retailer with 10,001+ employees
Real User
May 21, 2022
You can connect multiple data sources and share work easily
Pros and Cons
  • "I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."
  • "Databricks tech support has been great every time I've dealt with them, and their team is highly knowledgeable."
  • "I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data."
  • "I would like it if Databricks adopted an interface more like R Studio."

What is our primary use case?

I use Databricks for customer marketing analytics.

What is most valuable?

Databricks lets you schedule jobs pretty easily, and you can use SQL, Spark SQL, Python, or R. It also allows you to save a table or view. 

I like that you can connect to multiple data sources. Most of our data is stored in the Azure data lake, but my previous company connected to SQL databases or even blob storage. 

They've improved on many features. I don't do data engineering, but I had an issue a couple of years ago at my two companies ago. It took a long time to read and save tables, but I think the new Delta feature helped. 

I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature.

What needs improvement?

I would like it if Databricks adopted an interface more like R Studio. When I create a data frame or a table, R Studio provides a preview of the data. In R Studio, I can see that it created a table with so many columns or rows. Then I can click on it and open a preview of that data. 

Because I work in analytics and not data engineering, I think that's probably the biggest one. There are better graphical tools, so I don't think Databricks can compete. You can do a simple graph, and it's not that great. However, I don't think they can ever stack up to Tableau, so it's probably not worth it to improve upon that. 

For how long have I used the solution?

I've been using Databricks for two years.

What do I think about the stability of the solution?

Databricks is stable.

What do I think about the scalability of the solution?

Databricks is scalable.

How are customer service and support?

Databricks tech support has been great every time I've dealt with them. Their team is highly knowledgeable. 

How was the initial setup?

Setting up Databricks is easy. I set it up at my previous company. That was on Azure as well, but they utilized a third-party team with expertise in Databricks to ensure everything was optimized. 

What other advice do I have?

I rate Databricks 10 out of 10. I recommend taking advantage of Databricks support or a third-party provider to ensure it's set up optimally. I don't know if it's an additional service you must pay for, but we always had access to Databricks support in my last company. 

I think that's worth the money because there are so many different scenarios with distributed computing. Even people who study analytics may not understand the ins and out of Spark. It's worth it to have a service contract for support.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Sanjay Bheemasenarao - PeerSpot reviewer
Director - Data Engineering expert at Sankir Technologies
Real User
Mar 19, 2022
Is user friendly and has great performance, but documentation needs improvement
Pros and Cons
  • "Databricks has a scalable Spark cluster creation process, and the creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."
  • "If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks."
  • "Databricks is a very expensive solution. Pricing is an area that could definitely be improved."

What is our primary use case?

I use Databricks to explore new features and provide the industry visibility and scalability of Databricks to the companies that I work with.

I create proof of concepts for companies. As a consultant, I also create training courses on Databricks. If a company wants to leverage a service provided by Databricks and needs to train people, they use our courses.

What is most valuable?

Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance.

Databricks has made great strides in terms of performance. 

It is very user friendly. I like the ease of creating a Spark cluster, submitting a job, or creating a notebook.

The UI has also changed for the better compared to what it was two years ago.

What needs improvement?

If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks.

It's a big ask to have people jump through a lot of hoops to get approval to create a Databricks cluster just to explore it, but if they can try it on their own with a free trial without an underlying cloud account it would be more convenient.

Documentation can be improved as well. There are so many versions of documents. For example, when I tried to create a DBU vault and secrets file, I had to go through multiple versions of documents. This could be improved so that the documentation is easy to use.

For how long have I used the solution?

I've been using this solution for about two years.

What do I think about the stability of the solution?

Stability wise, it's quite okay. In my experience, it doesn't crash.

What do I think about the scalability of the solution?

I have not used autoscaling because it consumes a lot of money and because my experience has been alright. In some cases, though, it is tied to the quota of the underlying infrastructure. I have not tested the scalability to its fullest extent, but with the workloads I run, it has been fine.

How are customer service and support?

When I wanted to create an AWS account and contacted technical support via email, I never received a response. Recently, however, I think they have improved their support a little bit, and I did get a call in response to my question. Overall, I've not faced any issues with the person I had to contact directly.

How was the initial setup?

The initial setup is not very easy, but it's medium in complexity.

What's my experience with pricing, setup cost, and licensing?

Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price.

What other advice do I have?

I would rate Databricks at seven on a scale from one to ten. If you compare it to Snowflake, for example, Snowflake doesn't mandate an underlying cloud account. It creates one on its own. That's a subtle convenience that Snowflake has and one that Databricks could also build.

Snowflake's documentation is easy to use in comparison to that of Databricks. 

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Jorge Alvarado - PeerSpot reviewer
Owner at a marketing services firm with 1-10 employees
Real User
Jan 24, 2022
The data governance has been absolutely efficient in between other kinds of solutions
Pros and Cons
  • "Databricks' Lakehouse architecture has been most useful for us, and the data governance has been absolutely efficient in between other kinds of solutions."
  • "I would like it if Databricks made it easier to set up a project."

What is our primary use case?

We use Databricks for video streaming and security purposes.

What is most valuable?

Databricks' Lakehouse architecture has been most useful for us. The data governance has been absolutely efficient in between other kinds of solutions.

What needs improvement?

I would like it if Databricks made it easier to set up a project. The use case determines which services we are going to use. You have the application engine, and you generate a potential budget for your workloads, so you can understand what you are going to do, what you are going to use, and what you will invest in.

Because I'm deploying on the Google Cloud Platform, measuring the investment, value, and use case is extremely difficult. So I leave it and move on without the risk. It would be easier if I had one page where you can see three columns: one for the use cases of a specific architecture, a second one for the prices based on the volume of data or machine time, and the third column for the budget. That would make it easier to know if I am using the appropriate architecture for the right solution.

I have seen something like that in Microsoft Azure, but obviously Microsoft Azure costs a lot of money. Amazon has something like that, but it's very complicated to use.

For how long have I used the solution?

We've been using Databricks for about five years.

What do I think about the stability of the solution?

Databricks is very stable and powerful.

What do I think about the scalability of the solution?

It was simple to make Databricks scalable. We found that we could set up an alert to tell us if we needed more resources, money, or time from our team. We're alerted when the system detects some trigger for any use of the instance. If you have another alert from your side, that would be extremely useful because it takes a lot of time to develop that kind of trigger. 

How are customer service and support?

Databricks technical support was lovely. We don't need it so much, but the few questions we had were answered immediately.

How was the initial setup?

I am not a data engineer because I just started data science at the company, but it was straightforward and clear for the architect to set up. He provided me with that idea because he realized it would take time if we had use cases. You can select and change the data or add some modules or products. You have all the technology to do so.

What other advice do I have?

I rate Databricks eight out of 10. I like to move my customers into Databricks, but I take care of the internal system infrastructure so they can continue to use familiar software or operating systems and databases. They have a lot of doubts because they don't know the solution. We need to train them, explain things, and show the solution's potential value. 

Generally, companies try to keep the same flavor when they migrate. For example, if they are using many Microsoft products, they want to work with Azure. If they are open to other options, they go with GCP or AWS. However, Databricks doesn't have enough customers here in my market because it's not a visible brand. Azure, GCP, and AWS are highly visible here, so the local teams are friendly with the three brands.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Other
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Coordenador Financeiro at Icatu
Real User
Jan 10, 2022
Good technical support, but is difficult to set up and integrate
Pros and Cons
  • "The technical support is good."
  • "Our company makes comprehensive use of the solution to consolidate data and do a certain amount of reporting and analytics."
  • "The initial setup is difficult."
  • "Data governance should be addressed. We have some trouble connecting all the governance solutions with Databricks, which means the integrative capabilities are problematic."

What is our primary use case?

I believe we are using the new version.

Our company makes comprehensive use of the solution to consolidate data and do a certain amount of reporting and analytics. All the data consumers use Databricks to develop the information.

What needs improvement?

Data governance should be addressed. We have some trouble connecting all the governance solutions with Databricks. This means the integrative capabilities are problematic. 

The initial setup is difficult. 

For how long have I used the solution?

We have been using Databricks for a year-and-a-half.

What do I think about the stability of the solution?

The solution is stable. 

What do I think about the scalability of the solution?

The solution is scalable. 

How are customer service and support?

The technical support is good. 

Which solution did I use previously and why did I switch?

As we are talking about a corporate solution, the deployment of Databricks lasted longer than the one day it took for Alteryx. 

We used Alteryx prior to Databricks and continue to do so, it being the only other solution we have employed. We use the two with different software. 

How was the initial setup?

The initial setup is difficult. 

While I don't know exactly how long the deployment took, I do know that it lasted longer than the one day needed for Alteryx. 

What about the implementation team?

I believe we used a partner for the deployment, although I cannot say for certain, as this is not within my purview. 

I don't know how many people are needed for maintenance and deployment. 

What's my experience with pricing, setup cost, and licensing?

As the licensing is not within my purview, I am not in a position to comment on this. 

What other advice do I have?

My company makes use of the solution. It is employed by my data team and the technology one. I do not have personal experience using the solution. 

The solution is deployed on base, on data. 

I am not aware of how many people make use of it. 

I rate Databricks as a seven out of ten. 

Which deployment model are you using for this solution?

Private Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros sharing their opinions.
Updated: May 2026
Buyer's Guide
Download our free Databricks Report and get advice and tips from experienced pros sharing their opinions.