Try our new research platform with insights from 80,000+ expert users

Databricks vs Redpanda comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 17, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Databricks
Ranking in Streaming Analytics
1st
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
89
Ranking in other categories
Cloud Data Warehouse (7th), Data Science Platforms (1st)
Redpanda
Ranking in Streaming Analytics
15th
Average Rating
8.8
Reviews Sentiment
7.8
Number of Reviews
4
Ranking in other categories
No ranking in other categories
 

Featured Reviews

ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
Vishal M Godi - PeerSpot reviewer
High-performance message brokering with excellent documentation and an easy setup
The industry standard for this kind of platform is Kafka. Confluent Kafka has acquired it. Kafka is an open-source platform built by Apache. Confluent is the commercial version of it. The major improvement of Redpanda over Kafka is firstly, good documentation. Redpanda's documentation is very easily understandable, and they have a lot of examples. In addition to that, most of the setups include using another technology called Docker, which I am very familiar with. Setting up technologies using Docker is very convenient to me, and Redpanda has provided many templates for that. Redpanda has its own built-in metrics exporter, making it easier to monitor and check performance. What makes Redpanda superior is its performance since it's written in C++. C++ is pretty much the standard for high-performance applications.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The ability to stream data and the windowing feature are valuable."
"Databricks covers end-to-end data analytics workflow in one platform, this is the best feature of the solution."
"The setup was straightforward."
"Can cut across the entire ecosystem of open source technology to give an extra level of getting the transformatory process of the data."
"Databricks' most valuable features are the workspace and notebooks. Its integration, interface, and documentation are also good."
"The most valuable feature of Databricks is the notebook, data factory, and ease of use."
"I like that Databricks is a unified platform that lets you do streaming and batch processing in the same place. You can do analytics, too. They have added something called Databricks SQL Analytics, allowing users to connect to the data lake to perform analytics. Databricks also will enable you to share your data securely. It integrates with your reporting system as well."
"We can scale the product."
"The UI is modern."
"What makes Redpanda superior is its performance since it's written in C++. C++ is pretty much the standard for high-performance applications."
"The cost savings have been significant."
"I tested it with ten-plus nodes, and it's highly scalable."
 

Cons

"The API deployment and model deployment are not easy on the Databricks side."
"I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement."
"The solution has some scalability and integration limitations when consolidating legacy systems."
"As a data engineer, I see cluster failure in our Databricks user databases as a major issue."
"The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."
"Implementation of Databricks is still very code heavy."
"Can be improved by including drag-and-drop features."
"Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing."
"Recently, for the documentation, they've built their own AI chatbot, which is focused on giving you answers based on their documentation. While using that, I did not find it to be very good."
"The command-line tools need to be improved. To quickly check the status of the topics and all."
"The version control mechanism must be improved."
"When it comes to self-hosting, their documentation could be improved."
 

Pricing and Cost Advice

"It is an expensive tool. The licensing model is a pay-as-you-go one."
"The solution is a good value for batch processing and huge workloads."
"We implement this solution on behalf of our customers who have their own Azure subscription and they pay for Databricks themselves. The pricing is more expensive if you have large volumes of data."
"Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
"The billing of Databricks can be difficult and should improve."
"I rate the price of Databricks as eight out of ten."
"The solution is based on a licensing model."
"We pay as we go, so there isn't a fixed price. It's charged by the unit. I don't have any details detail about how they measure this, but it should be a mix between processing and quantity of data handled. We run a simulation based on our use cases, which gives us an estimate. We've been monitoring this, and the costs have met our expectations."
"It's free. Everybody can use it, only support is paid."
"Redpanda is cheaper than its competitors."
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
849,963 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
What is your experience regarding pricing and costs for Redpanda?
Redpanda is actually a commercial platform, but they do provide free versions as well. I've been working only with the free versions.
What needs improvement with Redpanda?
Recently, for the documentation, they've built their own AI chatbot, which is focused on giving you answers based on their documentation. While using that, I did not find it to be very good. Maybe ...
What is your primary use case for Redpanda?
I have worked with Redpanda for the past two to three months. Mainly in the tech industry or software industry, there's a huge rise of streaming data. Redpanda serves as a very reliable and fast me...
 

Comparisons

 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
No data available
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Information Not Available
Find out what your peers are saying about Databricks vs. Redpanda and other solutions. Updated: April 2025.
849,963 professionals have used our research since 2012.