No more typing reviews! Try our Samantha, our new voice AI agent.

Apache Hadoop pros and cons

Vendor: Apache

4.0 out of 5

41 reviews
89% willing to recommend

Pros & Cons summary

Apache Hadoop offers scalability and elasticity, enabling seamless expansion for projects. Its capacity to ingest large unstructured and semi-structured data supports quick analytics integration. Cost-effective due to its open-source nature, it handles vast data volumes with efficiency. Despite its advantages, Apache Hadoop faces challenges like insufficient memory handling, steep learning curve, complex integration, and lacking security measures, requiring improvements in its upgrade path and querying processes.

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.

Prominent pros & cons

PROS

Apache Hadoop effectively centralizes data management and processing, significantly reducing maintenance and development time.

Its scalability and elastic nature allow for on-demand expansion and contraction, making it ideal for Proof of Concept projects.

The system handles vast data volumes with ease, providing powerful ingestion tools and seamless integration with a range of technologies.

Apache Hadoop's open-source nature offers cost-effectiveness, allowing users to avoid reliance on third-party vendors.

The platform's resilience and fault tolerance ensure uninterrupted operation, even in cases of hardware failure.

CONS

Its inability to handle queries with insufficient memory can be bypassed by processing data in chunks.

Real-time data processing is weak, which contributes to its difficulty in implementation and operation.

Apache Hadoop lacks robust technical support and extensive community resources for implementing new features or addressing technical issues.

The integration of Apache Hadoop with different business processes poses a challenge due to the need for significant technical expertise and configuration efforts.

Apache Hadoop's security features require enhancement to effectively manage large volumes of data.

Apache Hadoop Pros review quotes

NR

Financial Advisor at a financial services firm with 10,001+ employees

Mar 27, 2025

Hadoop is a distributed file system, and it scales reasonably well provided you give it sufficient resources.

Read full review

CM

Database Administrator at Lacoste

Jul 1, 2024

I recommend it for the telecom sector. I know it well, and it's a good fit.

Read full review

Software Development Consultant at Synechron

Jun 25, 2024

It is a reliable product.

Read full review

Free Report: Apache Hadoop Reviews and More

Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.

902,588 professionals have used our research since 2012.

Kenechukwu Murphy Ezeoka

IT Support Specialist at Convergys Corporation

Sep 4, 2024

Its flexibility in handling and storing large volumes of data is particularly beneficial, as is its resilience, which ensures data redundancy and fault tolerance.

Read full review

Head of Data at a energy/utilities company with 51-200 employees

Jul 9, 2024

The platform's quick data processing capabilities have been instrumental in supporting our AI-driven projects.

Read full review

AC

Akhilesh Chipre

Senior Assosiate Consultant at Applied Materials

Apr 11, 2024

It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.

Read full review

Syed Afroz Pasha

Head Of Data Governance at Alibaba Group

Feb 27, 2024

Hadoop File System is compatible with almost all the query engines.

Read full review

Miodrag Milojevic

Senior Data Archirect at Yettel

Aug 1, 2023

It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.

Read full review

Juliet Hoimonthi

Manager at Robi Axiata Limited

Jul 5, 2022

What I like about Apache Hadoop is that it's for big data, in particular big data analysis, and it's the easier solution. I like the data processing feature for AI/ML use cases the most because some solutions allow me to collect data from relational databases, while Hadoop provides me with more options for newer technologies.

Read full review

reviewer2324613

Data Architect at a computer software company with 51-200 employees

Dec 29, 2023

It's open-source, so it's very cost-effective.

Read full review

Show 10 more reviews (out of 39)

Apache Hadoop Cons review quotes

NR

Financial Advisor at a financial services firm with 10,001+ employees

Mar 27, 2025

The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.

Read full review

CM

Database Administrator at Lacoste

Jul 1, 2024

Oracle BI is difficult to integrate.

Read full review

Software Development Consultant at Synechron

Jun 25, 2024

There are certain shortcomings when it comes to the product's technical support part, making it an area where improvements are required.

Read full review

Free Report: Apache Hadoop Reviews and More

Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: June 2026.

902,588 professionals have used our research since 2012.

Kenechukwu Murphy Ezeoka

IT Support Specialist at Convergys Corporation

Sep 4, 2024

Improvements in security measures would be beneficial, given the large volumes of data handled.

Read full review

Head of Data at a energy/utilities company with 51-200 employees

Jul 9, 2024

The product's availability of comprehensive training materials could be improved for faster onboarding and skill development among team members.

Read full review

AC

Akhilesh Chipre

Senior Assosiate Consultant at Applied Materials

Apr 11, 2024

Since it is an open-source product, there won't be much support.

Read full review

Syed Afroz Pasha

Head Of Data Governance at Alibaba Group

Feb 27, 2024

In certain cases, the configurations for dealing with data skewness do not make any sense.

Read full review

Miodrag Milojevic

Senior Data Archirect at Yettel

Aug 1, 2023

The stability of the solution needs improvement.

Read full review

Juliet Hoimonthi

Manager at Robi Axiata Limited

Jul 5, 2022

What could be improved in Apache Hadoop is its user-friendliness. It's not that user-friendly, but maybe it's because I'm new to it. Sometimes it feels so tough to use, but it could be because of two aspects: one is my incompetency, for example, I don't know about all the features of Apache Hadoop, or maybe it's because of the limitations of the platform. For example, my team is maintaining the business glossary in Apache Atlas, but if you want to change any settings at the GUI level, an advanced level of coding or programming needs to be done in the back end, so it's not user-friendly.

Read full review

reviewer2324613

Data Architect at a computer software company with 51-200 employees

Dec 29, 2023

The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support.

Read full review

Show 10 more reviews (out of 39)

Product Categories

Product Categories

Popular Comparisons

Popular Comparisons

Dell PowerStore vs Apache Hadoop

Teradata vs Apache Hadoop

Snowflake vs Apache Hadoop

VMware Tanzu Data Solutions vs Apache Hadoop

Oracle Exadata vs Apache Hadoop

OpenText Analytics Database (Vertica) vs Apache Hadoop

Amazon Redshift vs Apache Hadoop

IBM Netezza Performance Server vs Apache Hadoop

SAP IQ vs Apache Hadoop

Oracle Database Appliance vs Apache Hadoop

Actian Ingres vs Apache Hadoop

SAP BW4HANA vs Apache Hadoop

IBM Db2 Warehouse vs Apache Hadoop

Microsoft Parallel Data Warehouse vs Apache Hadoop

Infobright DB vs Apache Hadoop

See all alternatives

Product Categories

Product Categories

Popular Comparisons

Popular Comparisons

Dell PowerStore vs Apache Hadoop

Teradata vs Apache Hadoop

Snowflake vs Apache Hadoop

VMware Tanzu Data Solutions vs Apache Hadoop

Oracle Exadata vs Apache Hadoop

OpenText Analytics Database (Vertica) vs Apache Hadoop

Amazon Redshift vs Apache Hadoop

IBM Netezza Performance Server vs Apache Hadoop

SAP IQ vs Apache Hadoop

Oracle Database Appliance vs Apache Hadoop

Actian Ingres vs Apache Hadoop

SAP BW4HANA vs Apache Hadoop

IBM Db2 Warehouse vs Apache Hadoop

Microsoft Parallel Data Warehouse vs Apache Hadoop

Infobright DB vs Apache Hadoop

See all alternatives

Related questions

118

Which data catalog can provide support for BI data sources such as SAP BO and Tableau?

131

Which is the best RDMBS solution for big data?

90

Apache Spark without Hadoop -- Is this recommended?

111

What is the biggest difference between Apache Hadoop and Snowflake?

87

Which solution is better for setting up a data lake: Apache Hadoop or Oracle Exadata?

108

Oracle Exadata vs. HPE Vertica vs. EMC GreenPlum vs. IBM Netezza

87

When evaluating Data Warehouse solutions, what aspect do you think is the most important to look for?

72

At what point does a business typically invest in building a data warehouse?

86

Is a data warehouse the best option to consolidate data into one location?

310

What are the main differences between Data Lake and Data Warehouse?