No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Data Platform vs Cloudera DataFlow comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Data Platform
Average Rating
7.6
Reviews Sentiment
5.5
Number of Reviews
37
Ranking in other categories
Cloud Master Data Management (MDM) (7th), Data Management Platforms (DMP) (4th), AI Data Analysis (8th)
Cloudera DataFlow
Average Rating
7.4
Reviews Sentiment
6.5
Number of Reviews
5
Ranking in other categories
Streaming Analytics (19th)
 

Mindshare comparison

Cloudera Data Platform and Cloudera DataFlow aren’t in the same category and serve different purposes. Cloudera Data Platform is designed for Data Management Platforms (DMP) and holds a mindshare of 9.0%, up 1.4% compared to last year.
Cloudera DataFlow, on the other hand, focuses on Streaming Analytics, holds 1.9% mindshare, up 1.0% since last year.
Data Management Platforms (DMP) Mindshare Distribution
ProductMindshare (%)
Cloudera Data Platform9.0%
Palantir Foundry15.6%
Informatica Intelligent Data Management Cloud (IDMC)10.1%
Other65.3%
Data Management Platforms (DMP)
Streaming Analytics Mindshare Distribution
ProductMindshare (%)
Cloudera DataFlow1.9%
Apache Flink10.9%
Databricks9.0%
Other78.2%
Streaming Analytics
 

Featured Reviews

T Sarwar - PeerSpot reviewer
Data architect at SentientAI, Karachi
Has enabled efficient big data processing and querying but remains complex to manage and configure
Cloudera Data Platform should use fewer tools and remove the complexity between them. It should make it easier for the end user to change the configuration and understand it better. The UI tool for jobs in Cloudera Data Platform can be improved to provide a proper image of ETL jobs and detailed consolidated graphs to monitor Spark-based Hue jobs.
Mohamed-Saied - PeerSpot reviewer
Senior Data Architect at Teradata Corporation
Efficient data integration and workflow scheduling elevate project performance
Cloudera DataFlow is used as an ETL or ELT solution within Cloudera's data pipeline. Our organization heavily relies on it for data ingestion, transformation, and warehousing. It is also used daily for operational tasks, and it integrates well within Cloudera's ecosystem for high performance and…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The data platform is pretty neat. The workflow is also really good."
"Cloudera Data Platform has impacted my organization positively by providing cost-saving benefits, which is the North Star because of which we have shifted to it."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"Its ability to scale out seamlessly with little to no effort is very valuable to us."
"From a product standpoint, their Ambari UI is incredibly valuable for cluster monitoring."
"It is one of the better technology in terms of Hadoop."
"The most valuable part of this product is what Cloudera Data Science Workbench can do as a whole for modeling and analysis."
"Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
"This solution is very scalable and robust."
"DataFlow's performance is okay."
"This solution is very scalable and robust."
"The initial setup was not so difficult"
"The most effective features are data management and analytics."
"Cloudera DataFlow is fully compatible with Cloudera's ecosystem and offers high efficiency through native connectors for various ecosystems."
 

Cons

"I would like to see more support for containers such as Docker and OpenShift."
"We face downtime and reliability issues many times a week with Cloudera Data Platform because it is a very complex system and all configurations are managed by the end user."
"The version control of the software is also an issue."
"The technical support is okay, but not excellent."
"Customer Service: 3/10 Technical Support: 3/10"
"It would also be nice if there were less coding involved."
"It requires too much coding work; we're not good Java and Python developers."
"There have been some governance initiatives, but they are far from production ready."
"It is not easy to use the R language. Though I don't know if it's possible, I believe it is possible, but it is not the best language for machine learning."
"Cloudera DataFlow's UI interface could be enhanced significantly. Memory handling can also be improved to be better than it is today."
"Although their workflow is pretty neat, it still requires a lot of transformation coding; especially when it comes to Python and other demanding programming languages."
"It's an outdated legacy product that doesn't meet the needs of modern data analysts and scientists."
"Although their workflow is pretty neat, it still requires a lot of transformation coding; especially when it comes to Python and other demanding programming languages."
 

Pricing and Cost Advice

"Currently, we are using the product in a sandbox environment, and there is no licensing. We might choose a licensing option once we get the results."
"It is priced well and it is affordable"
"DataFlow isn't expensive, but its value for money isn't great."
report
Use our free recommendation engine to learn which Data Management Platforms (DMP) solutions are best for your needs.
885,444 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
11%
Construction Company
9%
Marketing Services Firm
9%
Performing Arts
7%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise7
Large Enterprise26
No data available
 

Questions from the Community

What is your experience regarding pricing and costs for Hortonworks Data Platform?
The experience with pricing, setup cost, and licensing is very good.
What needs improvement with Hortonworks Data Platform?
Areas for improvement with Cloudera Data Platform could be the initial learning curve that can be a step for teams new to big data economy systems. Platform setup and configuration require careful ...
What is your primary use case for Hortonworks Data Platform?
Cloudera Data Platform on AWS was adopted as the core enterprise data platform, covering the full data lifecycle from ingestion to analytics and advanced use cases. Cloudera Data Platform was used ...
What do you like most about Cloudera DataFlow?
The most effective features are data management and analytics.
What needs improvement with Cloudera DataFlow?
Cloudera DataFlow's UI interface could be enhanced significantly. Memory handling can also be improved to be better than it is today.
What is your primary use case for Cloudera DataFlow?
Cloudera DataFlow is used as an ETL or ELT solution within Cloudera's data pipeline. Our organization heavily relies on it for data ingestion, transformation, and warehousing. It is also used daily...
 

Also Known As

No data available
CDF, Hortonworks DataFlow, HDF
 

Overview

 

Sample Customers

Information Not Available
Clearsense
Find out what your peers are saying about Palantir, Informatica, Denodo and others in Data Management Platforms (DMP). Updated: March 2026.
885,444 professionals have used our research since 2012.