Try our new research platform with insights from 80,000+ expert users
Apache Hadoop Logo

Apache Hadoop pros and cons

Vendor: Apache
4.0 out of 5

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

Apache Hadoop effectively centralizes data management and processing, significantly reducing maintenance and development time.
Its scalability and elastic nature allow for on-demand expansion and contraction, making it ideal for Proof of Concept projects.
The system handles vast data volumes with ease, providing powerful ingestion tools and seamless integration with a range of technologies.
Apache Hadoop's open-source nature offers cost-effectiveness, allowing users to avoid reliance on third-party vendors.
The platform's resilience and fault tolerance ensure uninterrupted operation, even in cases of hardware failure.

CONS

Its inability to handle queries with insufficient memory can be bypassed by processing data in chunks.
Real-time data processing is weak, which contributes to its difficulty in implementation and operation.
Apache Hadoop lacks robust technical support and extensive community resources for implementing new features or addressing technical issues.
The integration of Apache Hadoop with different business processes poses a challenge due to the need for significant technical expertise and configuration efforts.
Apache Hadoop's security features require enhancement to effectively manage large volumes of data.
 

Apache Hadoop Pros review quotes

RC
Senior Associate at a financial services firm with 10,001+ employees
Sep 4, 2017
As compared to Hive on MapReduce, Impala on MPP returns results of SQL queries in a fairly short amount of time, and is relatively fast when reading data into other platforms like R.
Mar 6, 2018
Initially, with RDBMS alone, we had a lot of work and few servers running on-premise and on cloud for the PoC and incubation. With the use of Hadoop and ecosystem components and tools, and managing it in Amazon EC2, we have created a Big Data "lab" which helps us to centralize all our work and solutions into a single repository. This has cut down the time in terms of maintenance, development and, especially, data processing challenges.
CB
Database/Middleware Consultant (Currently at U.S. Department of Labor) at a tech services company with 51-200 employees
Mar 13, 2018
​​Data ingestion: It has rapid speed, if Apache Accumulo is used.
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,422 professionals have used our research since 2012.
SF
Analytics Platform Manager at a consultancy with 10,001+ employees
Aug 14, 2018
Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.
Jul 16, 2019
The most valuable features are the ability to process the machine data at a high speed, and to add structure to our data so that we can generate relevant analytics.
reviewer1040328 - PeerSpot reviewer
IT Expert at a tech services company with 1,001-5,000 employees
Jul 28, 2019
The best thing about this solution is that it is very powerful and very cheap.
Lucas Dreyer - PeerSpot reviewer
Data Engineer at BBD
Sep 29, 2019
What comes with the standard setup is what we mostly use, but Ambari is the most important.
YM
CEO at AM-BITS LLC
Nov 27, 2019
The ability to add multiple nodes without any restriction is the solution's most valuable aspect.
it_user1208307 - PeerSpot reviewer
Practice Lead (BI/ Data Science) at a tech services company with 11-50 employees
Dec 16, 2019
It's good for storing historical data and handling analytics on a huge amount of data.
it_user1093134 - PeerSpot reviewer
Technical Architect at RBSG Internet Operations
Dec 16, 2019
The most valuable feature is the database.
 

Apache Hadoop Cons review quotes

RC
Senior Associate at a financial services firm with 10,001+ employees
Sep 4, 2017
The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks.
Mar 6, 2018
Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them.
CB
Database/Middleware Consultant (Currently at U.S. Department of Labor) at a tech services company with 51-200 employees
Mar 13, 2018
It needs better user interface (UI) functionalities.
Learn what your peers think about Apache Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,422 professionals have used our research since 2012.
SF
Analytics Platform Manager at a consultancy with 10,001+ employees
Aug 14, 2018
I would like to see more direct integration of visualization applications.
Jul 16, 2019
We would like to have more dynamics in merging this machine data with other internal data to make more meaning out of it.
reviewer1040328 - PeerSpot reviewer
IT Expert at a tech services company with 1,001-5,000 employees
Jul 28, 2019
The upgrade path should be improved because it is not as easy as it should be.
Lucas Dreyer - PeerSpot reviewer
Data Engineer at BBD
Sep 29, 2019
In the next release, I would like to see Hive more responsive for smaller queries and to reduce the latency.
YM
CEO at AM-BITS LLC
Nov 27, 2019
There is a lack of virtualization and presentation layers, so you can't take it and implement it like a radio solution.
it_user1208307 - PeerSpot reviewer
Practice Lead (BI/ Data Science) at a tech services company with 11-50 employees
Dec 16, 2019
The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment.
it_user1093134 - PeerSpot reviewer
Technical Architect at RBSG Internet Operations
Dec 16, 2019
It would be good to have more advanced analytics tools.