it_user364431 - PeerSpot reviewer
Consultant at a tech consulting company with 51-200 employees
Consultant
The Cloudera Hadoop manager eased the work of orchestrating scripts.

What is most valuable?

Very solid. Excellent user experience. good documentation. The Cloudera Manager is definitely a deal breaker. Packaging for Ubuntu is great for all the components.

How has it helped my organization?

Before the introduction of Cloudera Manager (that actually works), all the orchestration was done with scripts and Chef, and inexperienced team members had difficulties to participate in maintenance. The Cloudera Hadoop manager eased the work.

What needs improvement?

More customization, better documentation for the API (basically it's the same for all Cloudera Hadoop components).

For how long have I used the solution?

I've used it for two years.

Buyer's Guide
Cloudera Distribution for Hadoop
May 2024
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2024.
769,630 professionals have used our research since 2012.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

No issues encountered.

What do I think about the scalability of the solution?

No issues encountered.

How are customer service and support?

Didn't use dedicated service or support. The documentation is a bit of a mess, but it is decent and sufficient.

How was the initial setup?

Straightforward. The CDH VirtualBox with preconfigured environment helps for demonstration purposes

What about the implementation team?

We did it in-house.

Which other solutions did I evaluate?

We also looked at Hortonworks, but chose Cloudera because of my familiarity with it.

What other advice do I have?

Do a comparisomn with Hortonworks as it's always good to compare to another major vendor

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user347565 - PeerSpot reviewer
Lead Bigdata Developer at a tech services company with 10,001+ employees
Real User
We used it to build an enterprise data hub, but Apache Kudu needs improvement.

Valuable Features:

The most valuable feature for me are--

  • Sentry - provides granular-level security
  • Impala - open-source, MPP database

Improvements to My Organization:

We used it to build an enterprise data hub.

Room for Improvement:

Apache Kudu needs improvement. It's a real-time updatable database.

Implementation Team:

We used a vendor team to implement the solution.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Cloudera Distribution for Hadoop
May 2024
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: May 2024.
769,630 professionals have used our research since 2012.
Technical Presales Engineer at a tech services company with 51-200 employees
Reseller
Top 20
Provides extensive data storage capacity and ensures better performance
Pros and Cons
  • "The solution's most valuable feature is the enterprise data platform."
  • "They should focus on upgrading their technical capabilities in the market."

What is our primary use case?

We use the solution to maintain our legacy data warehouse for better performance and more extensive storage.

What is most valuable?

The solution's most valuable feature is the enterprise data platform.

What needs improvement?

They should work on the solution's pricing. Also, finding resources with good experience in the solution is difficult. Thus, they should upgrade their technical capabilities in the market. 

They should add features like AutoML and AutoDev for enhanced machine-learning experiences. In addition, they should consider developing an integration capability similar to Informatica for an end-to-end enterprise solution.

For how long have I used the solution?

We have been using the solution for one year.

How are customer service and support?

The solution's customer support team could be better. We received their assistance only with installation and configuration.

What's my experience with pricing, setup cost, and licensing?

The solution is expensive. The license costs around 10k.

What other advice do I have?

Cloudera is a cost-effective solution if you need more storage space. In this case, I advise you to opt for it. I rate the solution as an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer: Reseller
PeerSpot user
Senior Consultant & Training at a tech services company with 51-200 employees
Consultant
The valuable combination of all the tools enable me to solve use cases I'm working on
Pros and Cons
  • "We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
  • "We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there is a lot of things that need to improve."

What is our primary use case?

I've been working on the software installation from the beginning, and we have a client for global supply change, so we get information from Telefonica's sales and distributions. Getting all that information into this system allows us to process it, get KPIs, and create outgoing information for business intelligence tools. 

In the cloud provider enterprise we get all the information from the gamers, like delays, response, and information from the games. It allows us to see if gamers are having trouble, high latency or any other kind of issue. They test that and get information about the issues in order to solve them.

What is most valuable?

I like the combination of all the tools that allow me to provide solutions and enable me to solve the use cases I'm working on. You need tools or components to foresee everything, and they are all in our emails. Sometimes you try several of them, and sometimes one will work better than the other. So you have to test the tools to see what works for you. 

What needs improvement?

We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that. 

For how long have I used the solution?

I've been using this solution for about a year and a half now.

How was the initial setup?

It's been quite easy to install. We only had to follow the instructions and there weren't many problems. That's important for us.

What other advice do I have?

I will rate this solution a nine out of ten because nothing is ever perfect. You will always face problems, but I'm quite happy with Cloudera. 

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user374703 - PeerSpot reviewer
Data Consultant with 10,001+ employees
Vendor
Features like Hive, Pig, Impala, Flume and Spark are valuable to us.

Valuable Features

Cloudera Manager is the most valuable feature for it’s ease of use, features, ease of upgrade and install components. CM can also be use to set up high availability within minutes. Others features like Hive, Pig, Impala, Flume and Spark are also valuable.

Improvements to My Organization

It's improved our storage and the availability of analytics tools such as Hive, Pig, Impala, and Spark helps us tremendously.

Room for Improvement

I'd like to see improvements to Impala. Also, it needs a more integrated environment with Spark, data warehouse, storage systems, cloud. Additionally, I'd want more UIs for components of ecosystem, preferably those UIs are centralized in a gateway.

Use of Solution

I've used it for 3.5 years.

Deployment Issues

For experimental and production clusters alike, use Cloudera Manager right from the beginning. RPM installation is good for learning.

Stability Issues

It has compatibility issues if installed in specialized hardware such as EMC Isilon or if node manager and data nodes are not co-located. For production, draw out a detailed plan on how to manage local repo for installation and upgrade. Never install from internet for production clusters.

Customer Service and Technical Support

Most of the clusters are for experimentation that don’t require support. For production clusters, implementations are through major vendors which are handled by them.

Initial Setup

It depends on mode of installation. Cloudera Manager is always more straight forward and manageable. Avoid RPM installation as much as possible. Lay out plans with system admin on upgrade plan, commission and decommission nodes. Investigate impact and consequences of having HBase and Hadoop in the same cluster or as separate cluster, what are the impacts on system admin, cost, upgrades, data migrations, resources, etc?

The complexity kicks in when performing parameter configurations. Find out what are the use cases, are there disk IO or compution IO bound, are there lots of structured data or unstructured data for text analytics, etc.

Implementation Team

Both vendor team and in-house depending on the cluster size and use cases. Some customers may require certain number of certified personnel, something to think about when choosing a partner.

Other Advice

Be prepared for fast changing landscape in how Hadoop works under the hood and how it is used. Each major release usually involved change of file system and data structure. How would they impact data migration. Ask questions like should they Upgrade or create a new cluster? Plans for training and skill upgrades.

Disclosure: My company has a business relationship with this vendor other than being a customer: We're a system integration partner.
PeerSpot user
it_user347592 - PeerSpot reviewer
Senior Analyst - Strategy Analytics at a consultancy with 10,001+ employees
Real User
We were able to utilize data which was untapped previously, but the documentation on Hive could be more standardized.

What is most valuable?

The features we've found most valuable are--

  • Fast processing of data
  • Easy to manipulate using HiveQL

How has it helped my organization?

We were able to utilize data which was untapped previously. We've got great use cases now to drive business revenue.

What needs improvement?

It needs more standardized documentation on Hive.

For how long have I used the solution?

I've used it for two and a half years.

How are customer service and technical support?

Customer Service:

It's great.

Technical Support:

The level of technical support is great.

Which solution did I use previously and why did I switch?

No previous solution was used, and senior management chose to bring it in.

How was the initial setup?

I was not directly involved in deployment.

What about the implementation team?

It was done by the vendor team, who were great.

What other advice do I have?

It's good for Big Data analytics.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
IT expert at a comms service provider with 201-500 employees
Real User
Reliable, stable, but difficult to use
Pros and Cons
  • "The solution is reliable and stable, it fits our requirements."
  • "The procedure for operations could be simplified."

What is our primary use case?

We are in the testing phase of Cloudera Distribution for Hadoop, and we will be in production soon.

What needs improvement?

The procedure for operations could be simplified.

For how long have I used the solution?

I have used Cloudera Distribution for Hadoop within the past 12 months.

What do I think about the stability of the solution?

The solution is reliable and stable, it fits our requirements.

How was the initial setup?

The implementation of Cloudera Distribution for Hadoop is not easy. It works on multiple nodes and can be complex for testing. The whole process took us one and a half days.

What about the implementation team?

We used a local system integrator for the implementation. We had approximately five people for the implementation.

We have not had to do maintenance of the solution because we are still in the testing phase.

What other advice do I have?

My advice to others is this solution can be complex.

I rate Cloudera Distribution for Hadoop a seven out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user356769 - PeerSpot reviewer
Director of Data Architecture at a financial services firm with 501-1,000 employees
Vendor
It has enabled us to move BI out of our OLTP database and build a data warehouse, but although Spark under rapid development, it needs improvement.

What is most valuable?

  • Cloudera Manager
  • Impala
  • Sentry

How has it helped my organization?

It has enabled us to move BI out of our OLTP database and build a data warehouse.

What needs improvement?

Some areas are under rapid development, like Spark.

For how long have I used the solution?

I've used it for three years.

What was my experience with deployment of the solution?

No issues with the current version.

What do I think about the stability of the solution?

No issues with the current version.

What do I think about the scalability of the solution?

No issues with the current version.

How are customer service and technical support?

Customer Service:

It's excellent.

Technical Support:

It's excellent.

Which solution did I use previously and why did I switch?

We switched because Cloudera just works.

How was the initial setup?

Cloudera Manager greatly simplifies initial setup.

What about the implementation team?

In-house.

What other advice do I have?

Make sure you have clearly articulated, doable use cases before you start.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros sharing their opinions.
Updated: May 2024
Product Categories
Hadoop NoSQL Databases
Buyer's Guide
Download our free Cloudera Distribution for Hadoop Report and get advice and tips from experienced pros sharing their opinions.