Try our new research platform with insights from 80,000+ expert users

Alteryx vs Cloudera Data Science Workbench vs Databricks comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of June 2025, in the Data Science Platforms category, the mindshare of Alteryx is 6.1%, down from 7.6% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.2%, down from 1.6% compared to the previous year. The mindshare of Databricks is 16.5%, down from 19.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

Theresa McLaughlin - PeerSpot reviewer
Quick development enables seamless data processing despite occasional support issues
There were times when the product would fail during development without an apparent reason. The support structure changed; initially, we received great support, however, it later became less reliable due to licensing issues and a tiered support system. Licensing negotiations were problematic, affecting our product usage. For instance, our licenses were temporarily lost during negotiations when an agreement couldn't be reached.
Ismail Peer - PeerSpot reviewer
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Its initial setup is easy."
"I think the most valuable feature for Alteryx in a health facility is that it permits cleaning, organizing, and merging of databases such as Excel and Access."
"It has everything that one needs. Whatever you want to do with the data can be done with Alteryx."
"The product's most valuable features include its ease of use for non-technical users and machine learning capabilities."
"It offered quick development, with the ability to process large datasets."
"I believe that the ability to leverage the gallery for scalability, as well as the general data blending functionality, is most beneficial to our core-based users."
"Alteryx is a low-code platform, and that's the biggest reason why we chose it."
"I like the solution's velocity, the speed with which it processes data, and its ease of use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"The Cloudera Data Science Workbench is customizable and easy to use."
"The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks."
"The most valuable features of the solution are the hardware and the resources it quickly provides without much hassle."
"Databricks gives you the flexibility of using several programming languages independently or in combination to build models."
"There are good features for turning off clusters."
"The initial setup is pretty easy."
"The most valuable feature is the Spark cluster which is very fast for heavy loads, big data processing and Pi Spark."
"The ease of use and its accessibility are valuable."
"I like how easy it is to share your notebook with others. You can give people permission to read or edit. I think that's a great feature. You can also pull in code from GitHub pretty easily. I didn't use it that often, but I think that's a cool feature."
 

Cons

"The tool could include more native connectors, such as for global ERPs, instead of requiring additional fees for these connections."
"There are a few hiccups with specific data sets and languages or formats that the data comes in. That may be a minor problem, but we can work through it. We had some issues looking at XML format in added data, but it wasn't significant."
"In the database, it should be more functional and connect to more big data, especially using IPI."
"Alteryx can improve the model management and deployment processing of large workloads."
"Alteryx can improve in data science. They have to have more features and components in the data science aspect because they claim to be a data science tool. However, in order to be more competitive, they have to improve on their data science propositions. Thre are other solutions on the market, such as other players in the market, Data2Go or DataIQ, and Alteryx needs to catch up."
"The GUI interface functions but it could stand to be updated to a more modern look and feel."
"It's a technical product and those that don't have proper training will have to deal with a steep learning curve."
"Even when it already includes some AI models, this area could be improved."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The tool's MLOps is not good. It's pricing also needs to improve."
"It would be great if Databricks could integrate all the cloud platforms."
"The product should provide more advanced features in future releases."
"The initial setup is difficult."
"Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems."
"The product could be improved regarding the delay when switching to higher-performing virtual machines compared to other platforms."
"I believe that this product could be improved by becoming more user-friendly."
"The connectivity with various BI tools could be improved, specifically the performance and real time integration."
"Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."
 

Pricing and Cost Advice

"The solution has a more costly license than other tools in the market."
"The price for Alteryx Designer is reasonable but the price for Alteryx Server for universal collaborations is too expensive."
"​Very transparent.​"
"Its price should be lower. The key thing that we see is that talking about ROI is an important element at the time of purchase. Cost becomes a factor in every discussion. Justifying the ROI for these kinds of workflows is always a challenge, and the only way to counter the challenge is by addressing the pricing."
"It can be a bit pricey, especially after the first year."
"The designer license costs 5000 euros. The server edition is 1000 euros."
"Opt for the three year subscription. It is 20% less than the yearly one."
"The designer has a list price of $5,995 USD."
"The product is expensive."
"The price is okay. It's competitive."
"Databricks' cost could be improved."
"The solution is a good value for batch processing and huge workloads."
"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"The cost is around $600,000 for 50 users."
"We're charged on what the data throughput is and also what the compute time is."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
856,873 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
24%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
5%
Financial Services Firm
33%
Healthcare Company
9%
Manufacturing Company
9%
Computer Software Company
8%
Financial Services Firm
18%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What is the Biggest Difference Between Alteryx and IBM SPSS Modeler?
One of the differences is that with Alteryx you can use it as an ETL and analytics tool. Please connect with me direc...
What is the Biggest Difference Between Alteryx and IBM SPSS Modeler?
Alteryx is an extremely easy and flexible data tool, flexible in terms of drag and drop toolset and also has python, ...
What is the Biggest Difference Between Alteryx and IBM SPSS Modeler?
I am not familiar with IBM SPSS Modeler, therefore, I cannot compare these two products. Regarding Alteryx I can say...
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
 

Also Known As

No data available
CDSW
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
 

Overview

 

Sample Customers

AnalyticsIq Inc., belk, BloominBrands Inc., Cardinalhealth, Cineplex, Dairy Queen
IQVIA, Rush University Medical Center, Western Union
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Find out what your peers are saying about Databricks, Amazon Web Services (AWS), Knime and others in Data Science Platforms. Updated: June 2025.
856,873 professionals have used our research since 2012.