Try our new research platform with insights from 80,000+ expert users

Dataiku vs Starburst Galaxy comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Dataiku
Ranking in Data Science Platforms
6th
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
13
Ranking in other categories
No ranking in other categories
Starburst Galaxy
Ranking in Data Science Platforms
9th
Average Rating
9.8
Reviews Sentiment
1.0
Number of Reviews
9
Ranking in other categories
Streaming Analytics (12th)
 

Mindshare comparison

As of November 2025, in the Data Science Platforms category, the mindshare of Dataiku is 10.5%, down from 11.5% compared to the previous year. The mindshare of Starburst Galaxy is 0.8%, down from 0.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms Market Share Distribution
ProductMarket Share (%)
Dataiku10.5%
Starburst Galaxy0.8%
Other88.7%
Data Science Platforms
 

Featured Reviews

RichardXu - PeerSpot reviewer
The platform organizes workflows visually and efficiently
One of the valuable features of Dataiku is the workflow capability. It allows us to organize a workflow efficiently. The platform has a visual interface, making it much easier for educated professionals to organize their work. This feature is useful because it simplifies tasks and eliminates the need for a data scientist. If you are knowledgeable about AI, you can directly write using primitive tools like Pantera flow, PyTorch, and Scikit-learn. However, Dataiku makes this process much easier.
Stephen-Howard - PeerSpot reviewer
Federated querying delivers integrated data at record speed and reduces processing time
The biggest win has been the ability to combine data from multiple sources and deliver it to the business at record speed. This capability has allowed us to query directly through Starburst Galaxy, enabling teams to access integrated data that would otherwise be hard to pull together. This has reduced both our ETL processing time and storage costs. We are answering questions that would have been hard, if not impossible, to answer previously because the data came from disparate, disconnected sources.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Traceability is vital since I manage many cohorts, and collaboration is key as I have multiple engineers substituting for one another."
"I believe the return on investment looks positive."
"The solution is quite stable."
"If many teams are collaborating and sharing Jupyter notebooks, it's very useful."
"Data Science Studio's data science model is very useful."
"I like the interface, which is probably my favorite part of the solution. It is really user-friendly for an IT person."
"Our clients can easily drag and drop components and use them on the spot."
"The advantage is that you can focus on machine learning while having access to what they call 'recipes.' These recipes allow me to preprocess and prepare data without writing any code."
"Starburst Galaxy has improved our organization by unifying access to all major data sources, reducing the need for complex ETL processes."
"Starburst Galaxy serves as our primary SQL-based data processing engine, a strategic decision driven by its seamless integration with our AWS cloud infrastructure and its ability to deliver high performance with low-latency responses."
"Starburst on Trino, combined with our SQL-native data transformation tool SQLMesh, has delivered anywhere from a two to five times improvement in compute performance across our transformation DAG."
"Starburst has provided us with virtually guaranteed performance on complex queries across datasets that are in the tens of gigabytes which complete in seconds."
"Starburst has provided us with virtually guaranteed performance on complex queries across datasets that are in the tens of gigabytes which complete in seconds."
"Starburst Galaxy is becoming a cornerstone of our data platform, empowering us to make smarter and faster decisions across the organization."
"The most fundamental feature is the query engine, which is much faster than any of the competitors; Starburst is able to finish most queries within 10 seconds, which is especially important for many non-technical employees."
"Starburst Galaxy has improved our organization by unifying access to all major data sources, reducing the need for complex ETL processes."
 

Cons

"In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin."
"We still encounter some integration issues."
"The license is very expensive. It would be great to have an intermediate license for basic treatments that do not require extensive experience."
"There is room for improvement in terms of allowing for more code-based features."
"Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable."
"Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days)."
"Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku."
"The license is very expensive."
"I would like Starburst to leverage AI to improve usability. Data lakes are complicated and difficult for users to explore."
"Cluster startup time is another pain point, typically 3 to 5 minutes, which is not the worst with proper planning but can be annoying for ad-hoc work."
"Multi-tenancy could be improved. In order to have multiple environments for SSO, we maintain multiple tenants that are connected to different AWS accounts via the Marketplace."
"The most persistent issue is the cluster spin-up time."
"Cluster startup time can be slow, sometimes taking over a minute."
 

Pricing and Cost Advice

"Pricing is pretty steep. Dataiku is also not that cheap."
"The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything."
Information not available
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
872,837 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Manufacturing Company
10%
Computer Software Company
9%
Energy/Utilities Company
6%
Financial Services Firm
28%
Computer Software Company
14%
Government
8%
Consumer Goods Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business4
Midsize Enterprise1
Large Enterprise8
By reviewers
Company SizeCount
Small Business4
Midsize Enterprise2
Large Enterprise1
 

Questions from the Community

What is your experience regarding pricing and costs for Dataiku Data Science Studio?
I find the pricing of Dataiku quite affordable for our customers, as they are usually large companies. However, it is a pricey solution and I primarily recommend it to bigger companies.
What needs improvement with Dataiku Data Science Studio?
In terms of enhancing collaboration within my team, I would not say Dataiku is the best one because it's so expensive. We are not able to provide it to everyone. There are very few people who have ...
What is your primary use case for Dataiku Data Science Studio?
My main use cases in Dataiku include ensuring a strong data pipeline ingestion. We have people from data management, so we need to take care of the pipeline, their data quality, data drifting, all ...
What is your experience regarding pricing and costs for Starburst Galaxy?
I recommend experimenting with different cluster sizes to determine what works best for your particular use case.
What needs improvement with Starburst Galaxy?
Multi-tenancy could be improved. In order to have multiple environments for SSO, we maintain multiple tenants that are connected to different AWS accounts via the Marketplace. On the AWS side this ...
What is your primary use case for Starburst Galaxy?
I use the solution for processing large simulation datasets into aggregated datasets that can either be used for real-time data analysis or stored for later analysis.
 

Comparisons

 

Also Known As

Dataiku DSS
No data available
 

Overview

 

Sample Customers

BGL BNP Paribas, Dentsu Aegis, Link Mobility Group, AramisAuto
Information Not Available
Find out what your peers are saying about Dataiku vs. Starburst Galaxy and other solutions. Updated: September 2025.
872,837 professionals have used our research since 2012.