User Reviews of Apache Spark & Cloudera Distribution for Hadoop

Updated March 2024

Would you like to learn about products from people using them now? Simplify your research with trusted advice from people like you.

Download our FREE report comparing Apache Spark and Cloudera Distribution for Hadoop based on reviews, features, and more!

Where should we email your report?






Apache Spark review

Ilya Afanasyev
Ilya Afanasyev
Senior Software Development Engineer at Yahoo!
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a...
Cloudera Distribution for Hadoop review

Thishen Govender
Thishen Govender
BI Manager at Discovery Health
Includes several useful proprietary tools
Integration is one of the main things we struggle with because we're working with several other environments. For example, we've got an MPP environment outside the Hadoop environment. Many cloud-based platforms like Azure are fully integrated with technology that gives you MPP machine learning and data lakes all in one environment. We've...

Since 2012, we've had 765,234 professionals use our research.

As seen in