No more typing reviews! Try our Samantha, our new voice AI agent.

Cloudera Distribution for Hadoop vs IBM InfoSphere BigInsights [EOL] comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (2nd), NoSQL Databases (12th)
IBM InfoSphere BigInsights ...
Average Rating
7.6
Number of Reviews
7
Ranking in other categories
No ranking in other categories
 

Featured Reviews

SA
Head of Advaced Analytics & Intelligence; AGM at Alinma Bank
Integration of multiple features supports data analytics and processing
Cloudera Distribution for Hadoop provides numerous features and capabilities combined into one platform.The solution offers power processing and supports different file systems and query engines. It provides parallel processing for handling many requests. The platform includes role-based access control in Cloudera Distribution for Hadoop. It secures the data itself and provides users with different roles and privileges.
it_user743022 - PeerSpot reviewer
BigData Consultant at a tech services company with 10,001+ employees
Served our customers better by giving real-time suggestions and proactive maintenance, however the UI was not interactive
* The UI was not interactive: Responses used to be very slow and hang up at times. * The UI was not really helping to track the real-time jobs and its logs. * You can bring in a better UI for job management and health checks. * Developer API documentation needs improvement.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The solution is reliable and stable, it fits our requirements."
"This is the only solution that is possible to install on-premise."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"Cloudera is a very manageable solution with good support."
"The solution has good features connected to end-to-end security."
"Implementing a Hadoop cluster has become relatively straight-forward using CDH."
"The tool can be deployed using different container technologies, which makes it very scalable."
"Cloudera is one of the best solutions for on-prem."
"It integrates with JSqsh, enabling us to submit long-running exports from the shell."
"It gives us the option of extending our analytics system."
"The thing that I have found most valuable in this solution is the BIQSQL implementation which is fully SQL ANSI compliant."
"This helped us to serve our customers better by giving real-time suggestions and proactive maintenance."
"This is a very helpful product, with continuous improvements by IBM and a great customer service which enables easy access to valuable information for both Hadoop developers and system administrators."
"Watson is the perfect engine for text analysis for us, but in 2014 it doesn’t support the Russian language."
"InfoSphere Streams was the one core product from the platform in which we were using. We were building a real-time response system and we built it on InfoSphere Streams."
"Definitely a product worth evaluating, esp if you are an IBM shop and if done on Bluemix, it gives a jump start on protoypes/POCs."
 

Cons

"The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation."
"There are better solutions out there that have more features than this one."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"On same ground I didn't see much training materials from Cloudera."
"It needs more standardized documentation on Hive."
"I encountered issues with having the appropriate documentation resources, as well as getting the right stability when explored virtualized environments based on Virtualbox and HyperV software."
"The UI was not interactive: Responses used to be very slow and hang up at times."
"I have found a lot of issues in Fluid Query and BigInsights Applications to move data in the enterprise version."
"I'd like to see faster execution time, especially for simple queries that don't touch on many rows and don't involve many operations (Joins, Unions, Groupbys)."
"The UI was not interactive: Responses used to be very slow and hang up at times."
"Initial setup is rather complex in comparison with Cloudera."
"Unfortunately the stability of the platform was an issue."
"For our business customer pricing is very important motivation, so I can advise change licensing policy from “by volume in the cluster” to “number of machines in the cluster”."
 

Pricing and Cost Advice

"It is an expensive product."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"I wouldn't recommend CDH to others because of its high cost."
"The pricing must be improved."
"The price could be better for the product."
"I believe we pay for a three-year license."
"The solution is expensive."
"Cloudera requires a license to use."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
895,990 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Marketing Services Firm
9%
Comms Service Provider
6%
Healthcare Company
6%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise32
By reviewers
Company SizeCount
Small Business3
Large Enterprise4
 

Questions from the Community

What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
What is your primary use case for Cloudera Distribution for Hadoop?
We use Cloudera Distribution for Hadoop for many use cases including analytics, storing huge data sets, and various data processing tasks.
Ask a question
Earn 20 points
 

Also Known As

No data available
InfoSphere BigInsights
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Coherent Path Inc., Optibus, Delhaize America, Diyotta Inc., Ernst & Young, Teikoku Databank Ltd., NCSU, Vestas
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: April 2026.
895,990 professionals have used our research since 2012.