Try our new research platform with insights from 80,000+ expert users

Databricks vs Infobright DB comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Databricks
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
89
Ranking in other categories
Cloud Data Warehouse (7th), Data Science Platforms (1st), Streaming Analytics (1st)
Infobright DB
Average Rating
7.6
Reviews Sentiment
6.3
Number of Reviews
10
Ranking in other categories
Relational Databases Tools (37th), Data Warehouse (27th)
 

Featured Reviews

ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
SD
If you need a real big data solution, look for a distributed solution that actually has a proven track record.
This version of Infobright has zero support for distributed scalability. The internal smart grid employed for each table has a major flaw in that the data size cannot be expunged until 2GB of data is reached at the column-level. This is a major flaw, making usage in a big-data scenario impossible. This means that you can delete as many records from a database table as you want. However, unless the 2GB aggregate size threshold was reached for some of the columns in the table, no reduction in disk space usage will occur. Only the data from the columns that reached 2GB will actually decrease. Other columns below 2GB in size do not leave the disk. I spent countless hours trying to find some workaround for this. I have nightmares of my e-mail inbox full of unsolvable questions about data size reduction from our field engineers.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It helps integrate data science and machine learning capabilities."
"The built-in optimization recommendations halved the speed of queries and allowed us to reach decision points and deliver insights very quickly."
"It is fast, it's scalable, and it does the job it needs to do."
"It can send out large data amounts."
"The simplicity of development is the most valuable feature."
"I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."
"Imageflow is a visual tool that helps make it easier for business people to understand complex workflows."
"We have the ability to scale, collaborate and do machine learning."
"It has very amazing smart grid query feature for very fast aggregate queries across millions of rows"
 

Cons

"The integration and query capabilities can be improved."
"Databricks' technical support takes a while to respond and could be improved."
"Databricks could improve in some of its functionality."
"Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."
"There is room for improvement in the documentation of processes and how it works."
"There would also be benefits if more options were available for workers, or the clusters of the two points."
"The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration."
"The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau."
"Only the data from the columns that reached 2GB will actually decrease. Other columns below 2GB in size do not leave the disk."
 

Pricing and Cost Advice

"I rate the price of Databricks as eight out of ten."
"Databricks are not costly when compared with other solutions' prices."
"We pay as we go, so there isn't a fixed price. It's charged by the unit. I don't have any details detail about how they measure this, but it should be a mix between processing and quantity of data handled. We run a simulation based on our use cases, which gives us an estimate. We've been monitoring this, and the costs have met our expectations."
"The cost is around $600,000 for 50 users."
"We're charged on what the data throughput is and also what the compute time is."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
"Databricks uses a price-per-use model, where you can use as much compute as you need."
"I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
"Our pricing was based on server instances and it was actually very cheap compared to Oracle. I guess you get what you pay for."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
850,028 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
Ask a question
Earn 20 points
 

Comparisons

No data available
 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Infobright
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
REZ-1, SonicWALL, IntegriChain, Fuseforward International Inc., Polystar, Live Rail, Mavenir Systems, JDSU Partners, Bango
Find out what your peers are saying about Databricks vs. Infobright DB and other solutions. Updated: April 2025.
850,028 professionals have used our research since 2012.