Dataiku vs RapidMiner comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
9th
Average Rating
8.0
Number of Reviews
36
Ranking in other categories
Data Mining (3rd)
Dataiku
Ranking in Data Science Platforms
7th
Average Rating
8.2
Number of Reviews
7
Ranking in other categories
No ranking in other categories
RapidMiner
Ranking in Data Science Platforms
6th
Average Rating
8.6
Number of Reviews
22
Ranking in other categories
Predictive Analytics (3rd)
 

Mindshare comparison

As of July 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 3.0%, up from 2.3% compared to the previous year. The mindshare of Dataiku is 12.9%, up from 6.8% compared to the previous year. The mindshare of RapidMiner is 6.5%, up from 6.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
Unique Categories:
Data Mining
21.6%
No other categories found
Predictive Analytics
13.7%
 

Featured Reviews

Ali Bin Tahir - PeerSpot reviewer
Mar 1, 2024
Provides comprehensive data analysis and has a simple setup process
We use the product to conduct multiple and diverse statistical analyses across various datasets The software offers consistency across multiple research projects helping us with predictive analytics capabilities. The product’s most valuable capability is to handle large datasets and ensure…
GN
Nov 28, 2019
Good data preparation tools and integrates well with BigQuery
From an administrative point of view, I would like to be able to communicate with the users who are logged into the system. For example, I would like to be able to send a broadcast message that says "I am shutting down the system." I would like to see more organization and better cohesion within the tool. In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin. I would like to have a better way to manage images and sound. The error messages are not self explanatory and can sometimes be difficult to understand.
AA
Aug 30, 2023
Easy to use and has a huge community
RapidMiner interacts well with data. It is valuable, easy to use, and easy to look at, and that is awesome.  It is easy to use and has a huge community that I can rely on for help. Moreover, it is interactive.  In terms of the UI and SaaS, the user interface with KNIME is more appealing than…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Custom tables and macros: They allow us to create useful reports quickly for a broad audience."
"The most valuable features mainly include factor analysis, correlation analysis, and geographic analysis."
"The solution has numerous valuable features. We particularly like custom tabs. It's very useful. We end up analyzing a lot of software data, so features related to custom tabs are really helpful."
"The most valuable features are the small learning curve and its ability to hold a lot of data."
"They have many existing algorithms that we can use and use effectively to analyze and understand how to put our data to work to improve what we do."
"The most valuable feature of IBM SPSS Statistics is all the functionality it provides. Additionally, it is simple to do the five-way analysis that you can into multidimensional setup space. It's the multidimensional space facility that is most useful."
"It offers very good visualization."
"Capability analysis is one of the main and valuable functions. We also do some hypothesis testing in Minitab and summary stats. These are the functions that we find very useful."
"The solution is quite stable."
"The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction."
"The most valuable feature is the set of visual data preparation tools."
"I like the interface, which is probably my favorite part of the solution. It is really user-friendly for an IT person."
"Data Science Studio's data science model is very useful."
"If many teams are collaborating and sharing Jupyter notebooks, it's very useful."
"Cloud-based process run helps in not keeping the systems on while processes are running."
"Extremely easy to use with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors."
"The documentation for this solution is very good, where each operator is explained with how to use it."
"What I like about RapidMiner is its all-in-one nature, which allows me to prepare, extract, transform, and load data within the same tool."
"The GUI capabilities of the solution are excellent. Their Auto ML model provides for even non-coder data scientists to deploy a model."
"I like not having to write all solutions from code. Being able to drag and drop controls, enables me to focus on building the best model, without needing to search for syntax errors or extra libraries."
"The solution is very intuitive and powerful."
"Using the GUI, I can have models and algorithms drag and drop nodes."
"The solution is stable."
"Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
 

Cons

"In some cases, the product takes time to load a large dataset. They could improve this particular area."
"The reports could be better."
"I know that SPSS is a statistical tool but it should also include a little bit of analytical behavior. You can call it augmented analysis or predictive analysis. The bottom line is it should have more graphical and analytical capabilities."
"If there is any self-generation data collection plan (DCP), it would be helpful in gathering data. It would also be useful if there is a function to scale it up to, let's say, UiPath and have it consolidate and integrate into a UiPath solution."
"It could provide even more in the way of automation as there are many opportunities."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"SPSS slows down the computer or the laptop if the data is huge; then you need a faster computer."
"SPSS is a tool that's been around since the late 60s, and it's the universal worldwide standard for quantitative social science data analysis. That said, it does seem a bit strange to me that the graphical output functions are so clunky after all these years. The output of charts and graphs that SPSS produces is hideous."
"Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable."
"In the next release of this solution, I would like to see deep learning better integrated into the tool and not simply an extension or plugin."
"There were stability issues: 1) SQL operations, such as partitioning, had bugs and showed wrong results. 2) Due to server downtime, scheduled processes used to fail. 3) Access to project folders was compromised (privacy issue) with wrong people getting access to confidential project folders."
"Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days)."
"I think it would help if Data Science Studio added some more features and improved the data model."
"The ability to have charts right from the explorer would be an improvement."
"I find that it is a little slow during use. It takes more time than I would expect for operations to complete."
"Dataiku still needs some coding, and that could be a difference where business data scientists would go for DataRobot more than Dataiku."
"In the Mexican or Latin American market, it's kind of pricey."
"In terms of the UI and SaaS, the user interface with KNIME is more appealing than RapidMiner."
"If they could include video tutorials, people would find that quite helpful."
"RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models."
"I would like to see more integration capabilities."
"The visual interface could use something like the-drag-and-drop features which other products already support. Some additional features can make RapidMiner a better tool and maybe more competitive."
"About twenty-five percent of my problems involve image processing, and I found RapidMiner lacking in this domain. While we work on OCR and similar tasks, RapidMiner hasn't been as engaged in that field as other models. Some other models also support email processing, but RapidMiner doesn't offer this feature."
"I would like to see all users have access to all of the deep learning models, and that they can be used easily."
 

Pricing and Cost Advice

"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"More affordable training for new staff members."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"I rate the tool's pricing a five out of ten."
"The price of this solution is a little bit high, which was a problem for my company."
"It's quite expensive, but they do a special deal for universities."
"Pricing is pretty steep. Dataiku is also not that cheap."
"The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything."
"For the university, the cost of the solution is free for the students and teachers."
"I used an educational license for this solution, which is available free of charge."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
793,295 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
University
17%
Educational Organization
13%
Computer Software Company
9%
Financial Services Firm
8%
Financial Services Firm
18%
Educational Organization
15%
Manufacturing Company
9%
Computer Software Company
7%
Computer Software Company
11%
University
11%
Educational Organization
10%
Financial Services Firm
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
While the pricing of the product may be higher, the accompanying service and features justify the investment. However...
What needs improvement with IBM SPSS Statistics?
In some cases, the product takes time to load a large dataset. They could improve this particular area.
What is your experience regarding pricing and costs for Dataiku Data Science Studio?
Pricing is pretty steep. Dataiku is also not that cheap. It depends on the client and how much they want to spend tow...
What needs improvement with Dataiku Data Science Studio?
The no-code/low-code aspect, where DataRobot doesn't need much coding at all. Dataiku still needs some coding, and th...
What is your primary use case for Dataiku Data Science Studio?
My current client has Dataiku. We do sentiment analysis and some small large language models right now. We use Dataik...
What do you like most about RapidMiner?
RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. I...
What is your experience regarding pricing and costs for RapidMiner?
I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately pri...
What needs improvement with RapidMiner?
The product must provide data-cleaning features. I could not use RapidMiner for data cleaning in one of my projects a...
 

Also Known As

SPSS Statistics
Dataiku DSS
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
BGL BNP Paribas, Dentsu Aegis, Link Mobility Group, AramisAuto
PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen
Find out what your peers are saying about Dataiku vs. RapidMiner and other solutions. Updated: July 2024.
793,295 professionals have used our research since 2012.