2019-01-23T17:11:00Z

What is your primary use case for Cloudera Distribution for Hadoop?

Miriam Tover - PeerSpot reviewer
  • 0
  • 4
PeerSpot user
25

25 Answers

Hamid M. Hamid - PeerSpot reviewer
Real User
Top 5Leaderboard
2024-01-09T11:02:00Z
Jan 9, 2024

There are multiple use cases of Cloudera. It is a big data platform where we collect all the data and connect other sources to get data from multiple sources. Cloudera has a Data Lake.

Search for a product comparison
Miodrag-Stanic - PeerSpot reviewer
Real User
Top 10
2023-12-19T13:37:00Z
Dec 19, 2023

We share company data leaks based on cloud data on their clusters.

Thishen Govender - PeerSpot reviewer
Real User
Top 10
2023-11-27T11:32:15Z
Nov 27, 2023

We use it for machine learning.

EricLin - PeerSpot reviewer
Real User
Top 10
2023-10-26T08:23:34Z
Oct 26, 2023

I use the solution because my data is too big. It is almost 100 TB.

Miodrag Milojevic - PeerSpot reviewer
Real User
Top 5Leaderboard
2023-07-21T11:07:17Z
Jul 21, 2023

Cloudera Distribution for Hadoop is used for our data lake and big data solutions.

MA
Reseller
Top 20
2023-03-27T13:44:29Z
Mar 27, 2023

We use the solution to maintain our legacy data warehouse for better performance and more extensive storage.

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: March 2024.
765,234 professionals have used our research since 2012.
Hamid M. Hamid - PeerSpot reviewer
Real User
Top 5Leaderboard
2022-11-21T14:49:00Z
Nov 21, 2022

We used this solution as a data platform.

LS
Real User
Top 20
2022-11-04T13:34:09Z
Nov 4, 2022

We use this solution as a data lake, pre-processing the large amount of data we have for further consumption by relational databases or advanced analytics. We use HDFS and Spark for that purpose and we are using Cloudera Machine Learning, a Jupyter Notebook-like environment with model monitoring opportunities, model catalog, and things like that. We are customers of Cloudera and I'm head of big data and the analytics competency center.

Mohammed Hamad - PeerSpot reviewer
Real User
Top 5
2022-05-20T12:33:27Z
May 20, 2022

I primarily use CDH for data storage and regular dashboard reports.

KG
Real User
Top 20
2022-04-29T11:53:00Z
Apr 29, 2022

In my previous organization, we used Cloudera Distribution for Hadoop for compiling website logs and application logs. We used it for log analytics.

AK
Consultant
Top 5
2022-04-07T05:06:59Z
Apr 7, 2022

This product is a framework for edge AI, it comes with multiple ecosystems as a project. I'm a senior data architect manager and we are consultants. We offer Cloudera to our customers but we don't have a partnership with them.

ND
Real User
2022-01-05T07:24:49Z
Jan 5, 2022

We are in the testing phase of Cloudera Distribution for Hadoop, and we will be in production soon.

Suresh_Srinivasan - PeerSpot reviewer
Real User
2021-12-28T11:34:00Z
Dec 28, 2021

We use Cloudera Distribution for file storage. This solution is deployed on-premise.

KG
Real User
2021-03-09T16:58:10Z
Mar 9, 2021

We use this solution to process data. When using an SQL Server you have to build indexes and you need to fine-tune the data. We import the data that is in the SQL Source. With a single script, we are able to run the jobs within minutes, which is an advantage. We are using the Power BI model for the business convention. The performance in Power BI will be reduced if you incorporate more calculations. Those calculations are captured in the Hadoop layer and processed.

GW
Real User
2021-01-07T00:06:48Z
Jan 7, 2021

We use the solution for the data warehousing.

RS
Real User
2020-09-13T07:02:21Z
Sep 13, 2020

We are using this solution for storing Big Data in one centralized location.

EricLin - PeerSpot reviewer
Real User
Top 10
2020-04-01T07:25:54Z
Apr 1, 2020

We are a solution provider and this is one of the systems that we implement for our clients. Our clients for this product are in the financial industry and they use it to perform cost analysis tasks.

MG
Real User
2020-03-25T15:24:00Z
Mar 25, 2020

Our primary use case for this solution is to host a big amount of data in our platform, processing, analysis and all of this stuff on the platform.

NK
Real User
2020-03-09T08:07:55Z
Mar 9, 2020

We are dealing with data from the telecom industry. We were using an Oracle system but our volume has increased. We now have a lot of real-time data that needs to be transformed so that it can be made available and used.

it_user900987 - PeerSpot reviewer
Real User
2019-07-16T05:40:00Z
Jul 16, 2019

We primarily use it only for big data support for analytical applications.

AD
Consultant
2019-07-16T05:40:00Z
Jul 16, 2019

I've been working on the software installation from the beginning, and we have a client for global supply change, so we get information from Telefonica's sales and distributions. Getting all that information into this system allows us to process it, get KPIs, and create outgoing information for business intelligence tools. In the cloud provider enterprise we get all the information from the gamers, like delays, response, and information from the games. It allows us to see if gamers are having trouble, high latency or any other kind of issue. They test that and get information about the issues in order to solve them.

DS
Real User
2019-07-16T05:40:00Z
Jul 16, 2019

I'm part of the IT team at my company, and our primary use case of this solution is building infrastructure for advanced analytics, where we copy data from our data warehouse that is now our relational database. We copy it to the Cloudera Distribution for Hadoop and then analyze it with Python and machine learning.

MI
Real User
2019-07-14T10:21:00Z
Jul 14, 2019

We primarily use the solution for external storage.

Thishen Govender - PeerSpot reviewer
Real User
Top 10
2019-07-14T10:21:00Z
Jul 14, 2019

We make recommendations to clients for using different models of this solution to handle data intelligently.

SC
Consultant
2019-01-23T17:11:00Z
Jan 23, 2019

Our core product is an insurance product and the actuarial module is quite complex. SMEs so far collect data from various sources into Excel sheets and through macros do the analytics which is a very crude form of doing the analysis. So we thought to use big data for such analysis.

Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
Download Cloudera Distribution for Hadoop ReportRead more