Try our new research platform with insights from 80,000+ expert users
it_user347787 - PeerSpot reviewer
Lead Instructor at a tech company with 501-1,000 employees
Vendor
It has fairly matured tools like Cloudera Navigator and Cloudera Manager, but it is lacking Spark SQL support.

What is most valuable?

The features I find most valuable are--

  • Enterprise security features (authentication, authorization, data governance, and data protection)
  • Proactive support 
  • Training

How has it helped my organization?

  • Providing robust infrastructure
  • Fairly matured tools like Cloudera Navigator, Cloudera Manager, etc. 
  • Professional support enabled us to provide great customer service
  • Our clients are able to perform proactive maintenance in an efficient manner

What needs improvement?

Spark with R integration is missing. Also, it is lacking Spark SQL support.

For how long have I used the solution?

I've used it for over eight months.

Buyer's Guide
Cloudera Distribution for Hadoop
June 2025
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
856,873 professionals have used our research since 2012.

What was my experience with deployment of the solution?

We faced issues in deploying Azure with Cloudera. Our machine hard disks were getting corrupted whenever we used to get patches on weekends. Now these have been resolved.

How are customer service and support?

They offer excellent support.

How was the initial setup?

It was complex because we were doing first time deployment of Cloudera on Azure. Also complexity was high due to lot of security features.

What about the implementation team?

We are Big Data consultants, so we implement it.

Which other solutions did I evaluate?

Cloudera is a leader in providing distributions for Hadoop so it was no brainer for us to decide.

What other advice do I have?

There were initial hiccups when deploying Cloudera on Azure but now this combo is working fine in production, so you can go for it.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user347592 - PeerSpot reviewer
Senior Analyst - Strategy Analytics at a consultancy with 10,001+ employees
Real User
We were able to utilize data which was untapped previously, but the documentation on Hive could be more standardized.

What is most valuable?

The features we've found most valuable are--

  • Fast processing of data
  • Easy to manipulate using HiveQL

How has it helped my organization?

We were able to utilize data which was untapped previously. We've got great use cases now to drive business revenue.

What needs improvement?

It needs more standardized documentation on Hive.

For how long have I used the solution?

I've used it for two and a half years.

How are customer service and technical support?

Customer Service:

It's great.

Technical Support:

The level of technical support is great.

Which solution did I use previously and why did I switch?

No previous solution was used, and senior management chose to bring it in.

How was the initial setup?

I was not directly involved in deployment.

What about the implementation team?

It was done by the vendor team, who were great.

What other advice do I have?

It's good for Big Data analytics.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Cloudera Distribution for Hadoop
June 2025
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: June 2025.
856,873 professionals have used our research since 2012.
PeerSpot user
Software Design Engineer at a marketing services firm with 501-1,000 employees
Vendor
It automates the installation and configuration of Hadoop, but it should not provide generic logs for failed installations.

What is most valuable?

It automates the installation and configuration of Hadoop and different Big Data services.

What needs improvement?

We're currently trying to perform a failed installation and it's little bit difficult. It should restart the installation where it left off.

For how long have I used the solution?

I've used it for two years.

What was my experience with deployment of the solution?

  • In some cases, logs are clear about failed services.
  • While deploying in some failed steps it should not provide generic logs.

How are customer service and technical support?

7/10 - they have forums where they will answer your query within a day.

Which solution did I use previously and why did I switch?

We previously used Hortonworks and changed because Cloudera is simpler and more interactive.

How was the initial setup?

It was very straightforward.

What about the implementation team?

We did it in-house. They have good technical support to help with implementation.

What's my experience with pricing, setup cost, and licensing?

We use the free version, and they provide everything we need.

What other advice do I have?

Implement the free version as it provides enough services. If you want a backup service, or any extra service, then you can implement the enterprise version.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user347565 - PeerSpot reviewer
Lead Bigdata Developer at a tech services company with 10,001+ employees
Real User
We used it to build an enterprise data hub, but Apache Kudu needs improvement.

Valuable Features:

The most valuable feature for me are--

  • Sentry - provides granular-level security
  • Impala - open-source, MPP database

Improvements to My Organization:

We used it to build an enterprise data hub.

Room for Improvement:

Apache Kudu needs improvement. It's a real-time updatable database.

Implementation Team:

We used a vendor team to implement the solution.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user347535 - PeerSpot reviewer
Software Engineer at a tech services company with 501-1,000 employees
Consultant
It provides the ability to update configuration through the UI. I think licensing by size of data managed would be a useful improvement.

Valuable Features

The features most valuable to me are--

  • Installation (very easy initial setup)
  • Configuration
  • Ability to update configuration through UI

Improvements to My Organization

It made Hadoop easy to use and made it easy to get started.

Room for Improvement

The licensing was by node. I think licensing by size of data managed would be a useful improvement.

Use of Solution

I used Cloudera Manager to evaluate Hadoop and HBase for one year.

Deployment Issues

No issues encountered.

Stability Issues

No issues encountered.

Scalability Issues

No issues encountered.

Customer Service and Technical Support

Customer Service:

It's excellent.

Technical Support:

It's excellent.

Initial Setup

It was very easy.

Implementation Team

It was implemented in-house.

Other Solutions Considered

We compared it to Amazon EMR but found Cloudera Manager to be more functional.

Other Advice

It's a great product and must be evaluated if you are planning to use Hadoop..

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user347172 - PeerSpot reviewer
System Engineer at a tech company with 10,001+ employees
Vendor
For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters. But, it has HBase 1.0 stability issues and processing speed needs improvement.

What is most valuable?

  • Cluster rolling restarts 
  • Cluster wide configuration management

How has it helped my organization?

For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters. 

We are currently running six production clusters totaling 900+ nodes, and are building three more clusters. Knowing that if someone has some custom configuration on a node that they haven’t communicated out, and that I can ignore that configuration and bring that node into line with where we’ve decided to run the cluster, is very beneficial.

What needs improvement?

HBase 1.0 stability issues and processing speed is a major area for improvement. Right now, our Cloudera 5 clusters run four to seven times slower than our Cloudera 4 clusters using our storm and kafka topologies, which causes real-time processing to be a major challenge.

CM’s API is very limited and difficult when used on multiple clusters in the same CM instance

For how long have I used the solution?

We've used it for approximately two years. We also use Cloudera Manager, which is 6/10.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

Cloudera 5 is currently very unstable. Between two Cloudera 5 clusters, we have an incident at least twice a week due to what are now outstanding bugs.

What do I think about the scalability of the solution?

It's very easy to deploy and scale as large as you want. Once created on the CM management cluster, is difficult to scale up as needed, as you add more clusters to the same CM instance.

Which solution did I use previously and why did I switch?

No previous solution was used.

How was the initial setup?

We were already running one production cluster with approximately 75 nodes when I joined, so I’m not familiar with what was needed to get the initial production cluster up. Once I joined, I assisted in standing up the additional nodes and clusters using our chef automation.

What about the implementation team?

In house via chef automation. Chef, or similar systems, makes it much simpler to stand up large scale clusters. That said, I have not used or evaluated vendor team implementation methods.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user2700 - PeerSpot reviewer
Architect at a marketing services firm with 501-1,000 employees
Vendor
Cloudera Manager Hadoop Cluster Installation Evaluation

I decided to give Cloudera's Manager software a try, and was pleasantly surprised at how simple it becomes to deploy a substantial Hadoop cluster.

I began by creating an automated kickstart installer for RHEL 6.2 (booting off a custom isolinux image created for this purpose), with all of the required packages, so that from server power on to creating a 20+ node cluster takes less than 15 minutes. The limitation for the number of concurrent node installs is based on network and disk i/o bottlenecks on the deployment server. If you wanted to PXE boot the cluster in a production environment, you would want a bank of servers behind a load balancer, optimally.

Once the Manager is installed on the master node, you simply log into the administration webpage, and from there, add all of the hosts to deploy the cluster on. One nice discovery was that it takes advantage of regular expressions for host names or IP addresses, so you can literally create a cluster containing hundreds of nodes with a trivial amount of effort.

Once the software is deployed, you can select the roles for each of the servers. It's an incredibly painless deployment. That being said, it is not without its flaws.

One of the primary flaws is that all of the configuration and log files are in non-standard locations, and are split in non-standard ways. It's obvious from the way that the files are arranged that it simplifies programmatic deployment. It also makes it a bit harder for a human who is used to standard Hadoop deployments to figure out where everything is located.

And finally, I discovered a bug with one of the packaged software products, Oozie. One of the resource files, oozie-bundle-0.1.xsd contains an invalid regular expression on line 22. I haven't tracked down the behavior, but for some reason JDK 1.6.30 will parse that invalid regex, but JDK 1.7U2 will exit with errors. Naturally, I was running JDK 1.7U2, so it took me a little extra time to debug the problem.

Overall, I quite liked Cloudera's Manager. It's certainly one of the better cluster deployment products I've seen.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
it_user217290 - PeerSpot reviewer
it_user217290Senior DBA Consultant at a tech services company with 10,001+ employees
Real User

Hi

Can I have Cloudera's Manager software for free to test and deploy it on a sandBox to work on a POC purposes.

reviewer1324029 - PeerSpot reviewer
IT expert at a comms service provider with 201-500 employees
Real User
Reliable, stable, but difficult to use
Pros and Cons
  • "The solution is reliable and stable, it fits our requirements."
  • "The procedure for operations could be simplified."

What is our primary use case?

We are in the testing phase of Cloudera Distribution for Hadoop, and we will be in production soon.

What needs improvement?

The procedure for operations could be simplified.

For how long have I used the solution?

I have used Cloudera Distribution for Hadoop within the past 12 months.

What do I think about the stability of the solution?

The solution is reliable and stable, it fits our requirements.

How was the initial setup?

The implementation of Cloudera Distribution for Hadoop is not easy. It works on multiple nodes and can be complex for testing. The whole process took us one and a half days.

What about the implementation team?

We used a local system integrator for the implementation. We had approximately five people for the implementation.

We have not had to do maintenance of the solution because we are still in the testing phase.

What other advice do I have?

My advice to others is this solution can be complex.

I rate Cloudera Distribution for Hadoop a seven out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros sharing their opinions.
Updated: June 2025
Product Categories
Hadoop NoSQL Databases
Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros sharing their opinions.