Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs Databricks vs KNIME comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Mindshare comparison

As of June 2025, in the Data Science Platforms category, the mindshare of Cloudera Data Science Workbench is 1.2%, down from 1.6% compared to the previous year. The mindshare of Databricks is 16.5%, down from 19.7% compared to the previous year. The mindshare of KNIME is 12.0%, up from 10.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

Ismail Peer - PeerSpot reviewer
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
Laurence Moseley - PeerSpot reviewer
Has a drag-and-drop interface and AI capabilities
It's difficult to pinpoint one single feature because KNIME has so many. For starters, it's very easy to learn. You can get started with just a one-hour video. The drag-and-drop interface makes it user-friendly. For example, if you want to read an Excel file, drag the "read Excel file" node from the repository, configure it by specifying the file location, and run it. This gives you a table with all your data. Next, you can clean the data by handling missing values, selecting specific columns you want to analyze, and then proceeding with your analysis, such as regression or correlation. KNIME has over 4,500 nodes available for download. In addition, KNIME offers various extensions. For instance, the text processing extension allows you to process text data efficiently. It's more powerful than other tools like NVivo and Palantir. KNIME also has AI capabilities. If you're unsure about the next step, the AI assistant can suggest the most frequently used nodes based on your previous work. Another valuable feature is the integration with Python. If you need to perform a task that requires Python, you can simply add a Python node, write the necessary code,

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"The Cloudera Data Science Workbench is customizable and easy to use."
"A very valuable feature is the data processing, and the solution is specifically good at using the Spark ecosystem."
"Easy to use and requires minimal coding and customizations."
"Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily."
"Databricks is based on a Spark cluster and it is fast. Performance-wise, it is great."
"Imageflow is a visual tool that helps make it easier for business people to understand complex workflows."
"The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes."
"Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours."
"The most valuable feature of Databricks is the notebook, data factory, and ease of use."
"KNIME is quite scalable, which is one of the most important features that we found."
"It's a very powerful and simple tool to use."
"I've never had any problems with stability."
"Clear view of the data at every step of ETL process enables changing the flow as needed."
"There are a lot of connectors available in KNIME."
"It provides very fast problem solving and I don't need to do much coding in it. I just drag and drop."
"The product is user-friendly."
"The most valuable feature is the data wrangling, which is what I mainly use it for."
 

Cons

"The tool's MLOps is not good. It's pricing also needs to improve."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau."
"I think setting up the whole account for one person and giving access are areas that can be difficult to manage and should be made a little easier."
"The tool should improve its integration with other products."
"The initial setup is difficult."
"Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics."
"Databricks' technical support takes a while to respond and could be improved."
"The product should incorporate more learning aspects. It needs to have a free trial version that the team can practice."
"The solution could be improved by adding a feature that would make it more user-friendly for our team. The feature is simple, but it would be useful. Currently, our team is more familiar with the language R, but Databricks requires the use of Jupyter Notebooks which primarily supports Python. We have tried using RStudio, but it is not a fully integrated solution. To fully utilize Databricks, we have to use the Jupyter interface. One feature that would make it easier for our team to adopt the Jupyter interface would be the ability to select a specific variable or line of code and execute it within a cell. This feature is available in other Jupyter Notebooks outside of Databricks and in our own IDE, but it is not currently available within Databricks. If this feature were added, it would make the transition to using Databricks much smoother for our team."
"I would prefer to have more connectivity."
"Data visualization needs improvement."
"KNIME is not good at visualization."
"Both RapidMiner and KNIME should be made easier to use in the field of deep learning."
"It's pretty straightforward to understand. So, if you understand what the pipeline is, you can use the drag-and-drop functionality without much training. Doing the same thing in Python requires so much more training. That's why I use KNIME."
"The graphic features of KNIME need improvement"
"In the last update, KNIME started hiding a lot of the nodes. It doesn't mean hiding, but you need to know what you're looking for. Before that, you had just a tree that you could click, and you could get an overview of what kind of nodes do I have. Right now, it's like you need to know which node you need, and then you can start typing, but it's actually more difficult to find them."
"KNIME needs to provide more documentation and training materials, including webinars or online seminars."
 

Pricing and Cost Advice

"The product is expensive."
"I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
"We only pay for the Azure compute behind the solution."
"The solution is based on a licensing model."
"I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five."
"Databricks' cost could be improved."
"The price is okay. It's competitive."
"Databricks are not costly when compared with other solutions' prices."
"We're charged on what the data throughput is and also what the compute time is."
"This is an open-source solution that is free to use."
"KNIME Business Hub is expensive for small companies."
"While there are certain limitations in functionality, you can still utilize it efficiently free of charge."
"Scaling to the on-premises version requires a licensing fee per user that is a bit expensive in comparison to R, Python, and SAS."
"The client versions are mostly free, and we pay only for the KNIME server version. It's not a cheap solution."
"It is expensive to procure the license."
"KNIME desktop is free, which is great for analytics teams. Server is well priced, depending on how much support is required."
"It's an open-source solution."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
857,162 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
33%
Healthcare Company
9%
Manufacturing Company
9%
Computer Software Company
8%
Financial Services Firm
18%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
12%
Manufacturing Company
11%
Computer Software Company
9%
University
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What do you like most about KNIME?
Since KNIME is a no-code platform, it is easy to work with.
What is your experience regarding pricing and costs for KNIME?
I rate the product’s pricing a seven out of ten, where one is cheap and ten is expensive.
What needs improvement with KNIME?
I have seen the potential to interact with Python, which is currently a bit limited. I am interested in the newer ver...
 

Also Known As

CDSW
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
KNIME Analytics Platform
 

Overview

 

Sample Customers

IQVIA, Rush University Medical Center, Western Union
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Infocom Corporation, Dymatrix Consulting Group, Soluzione Informatiche, MMI Agency, Estanislao Training and Solutions, Vialis AG
Find out what your peers are saying about Databricks, Amazon Web Services (AWS), Knime and others in Data Science Platforms. Updated: June 2025.
857,162 professionals have used our research since 2012.