There are several use cases for Hadoop. Sometimes it's used for data warehousing. Other times, it's analytics. And In some cases, it's used to do transformation. For example, I have one client using it to decompress, compress, or encrypt data on ingestion. So, he used it like an ETL engine.
Apache Software and Solutions
Apache JMeter
26 reviews
10 discussions
Performance Testing Tools
Load Testing Tools
API Testing Tools
Apache Kafka
20 reviews
10 discussions
Message Queue (MQ) Software
Apache Spark
11 reviews
6 discussions
Hadoop
Compute Service
Java Frameworks
Apache Flink
9 reviews
9 discussions
Streaming Analytics
Cassandra
9 reviews
6 discussions
NoSQL Databases
Apache Airflow
6 reviews
15 discussions
Business Process Management (BPM)
Apache Hadoop
6 reviews
10 discussions
Data Warehouse
Tomcat
6 reviews
5 discussions
Application Server
Apache Spark Streaming
3 reviews
9 discussions
Streaming Analytics
Spark SQL
3 reviews
5 discussions
Hadoop
Apache NiFi
2 reviews
5 discussions
Compute Service
Solr
2 reviews
Search as a Service
ActiveMQ
1 review
8 discussions
Message Queue (MQ) Software
Apache Web Server
1 review
Application Infrastructure
CloudStack
Cloud Management
Lucene
Indexing and Search
Accumulo
NoSQL Databases
CouchDB
NoSQL Databases
Apache HBase
NoSQL Databases
Apache Subversion
Version Control
Apache Storm
Compute Service
Archiva
Repository Managers
Apache Derby
Relational Databases
MXNet
AI Development Platforms
Apache Syncope
Identity Management (IM)
Apache Reviews
Partner at a tech services company with 11-50 employees
Highly elastic and stable, but it needs better security
What is our primary use case?
What is most valuable?
Hadoop is extensible — it's elastic.
What needs improvement?
Hadoop's security could be better.
For how long have I used the solution?
I've been using Hadoop for about eight years. I'm not sure exactly.
What do I think about the stability of the solution?
Performance is one of the reasons people choose Hadoop.
What do I think about the scalability of the solution?
Scalability is one of Hadoop's strong suits.
How are customer service and support?
I've never had to use Hadoop support.
How was the initial setup?
The complexity of Hadoop's setup depends on the customer and their needs. However, most of my customers wind up using Hadoop as a service, which makes it very easy. It doesn't need much maintenance. My staff maintains multiple systems, so it's not like there would ever be somebody dedicated to one, and Hadoop is not a high-touch platform.
What other advice do I have?
I rate Hadoop seven out of 10. It's very good, but it could always be better. To anyone considering Hadoop, I recommend that you be mindful of what you're trying to achieve.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer: Implementer
Last updated: Oct 10, 2021
Flag as inappropriatee-Business Department Professor at MANU MEDITEC
Not dependent on third-party vendors
Pros and Cons
- "We selected Apache Hadoop because it is not dependent on third-party vendors."
- "Real-time data processing is weak. This solution is very difficult to run and implement."
What needs improvement?
Apache Hadoop's real-time data processing is weak and is not enough to satisfy our customers, so we may have to pick other products. We are continuously researching other solutions and other vendors.
Another weak point of this solution, technically speaking, is that it's very difficult to run and difficult to smoothly implement. Preparation and integration are important.
The integration of this solution with other data-related products and solutions, and having other functions, e.g. API connectivity, are what I want to see in the next release.
For how long have I used the solution?
We've started using Apache Hadoop since 2011.
Which solution did I use previously and why did I switch?
We selected Apache Hadoop because it is not dependent on third-party vendors. Previously, our main business unit was related to big vendors like IBM, Oracle, and EMC, etc. We wanted to have a competitive advantage in technology, so we selected the Apache project and used Apache open source.
What about the implementation team?
The solution was implemented through a local vendor team here in Korea.
Which other solutions did I evaluate?
We evaluated IBM, Oracle, and EMC solutions.
What other advice do I have?
My position in the company falls under the research and development of new technologies and solutions. I investigate, research, download, and read information and reports as part of my job.
Our company has a big data business division, and we propose, develop, and implement things which are related to big data projects. We are using Cloud Hadoop open source versions, distributed versions, and commercial Hadoop distributed versions. We propose all these versions to our customers from any industry.
Our focus is on the public sector. Big data is our strong point in Korea. Our company is the leader in big data technology, including infrastructure and visualization. This is a solution we provide to our customers. We are also in partnership with IBM. Our main focus is on Apache Hadoop.
We provide Apache Hadoop to our customers. I work for a systems integrator and technical consulting company.
Overall, our satisfaction with this solution is so-so. We continuously investigate new technologies and other solutions.
The Hadoop open source version was implemented in 95% of our company's customer base. Our remaining customers had the local vendor's Hadoop platform package implemented for them.
Our company is in the big data business. Before the big data business back in 1976, we implemented BI (business intelligence), DW (data warehouse), EIS, and DSS (decision support system), so we are in partnership with IBM.
I don't have advice for people looking into implementing this solution because I'm not in the business unit. I'm in the research field. My role is to plan new technology and provide consultation to our customers for big data projects in the early stages.
My rating for Apache Hadoop from a technical standpoint is eight out of ten.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Last updated: Jan 26, 2022
Flag as inappropriateApache Projects
Check out these projects from our community members.

Managed Project Online Mark System
University Student Management System it cover may entire student life cycle management from registration to student… more »

UK-POST OFFICE
As part of its long-term strategy, the Post Office outlined plans to modernise its business to become a modern, digital… more »

Oil n Gas Entitlement Model
Heading the Technical Services team from Telesis, delivered $4+ million Oracle EBS program for a Oil conglomerate –… more »

Benchmarked solution for 7 million users
Tested and benchmarked a server solution to cater to 5000 simultaneous users. We built a messaging server for OTT mobile… more »

Realtime webservices over BigData
Using Ab Initio continuous flow and web services capabilities, I was able to build re-usable application frameworks… more »

Jenkins Pipeline Project for Auto Recycling AWS EMR Spark Cluster
In this project I created a Jenkins Pipeline that Auto Recycles our Production AWS EMR Spark Cluster once every week… more »

Security-Focused and Cost-Effective Google Cloud Infrastructure
Inventify AG is a Swiss software startup, focusing on the development of Software-as-a-Service (SaaS) and cloud… more »

Set up Internal IP Network Infrastructure NMS
Planned and Implemented two monitoring systems. One for monitoring internal IP Core, Distribution and Access Devices… more »
Apache Questions

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
May 19 2022
If you were talking to someone whose organization is considering Apache JMeter, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 19 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
May 19 2022
Please share with the community what you think needs improvement with Apache JMeter.
What are its weaknesses? What would you like to see changed in a future version?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 19 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 19 2022
Hi Everyone,
What do you like most about Apache JMeter?
Thanks for sharing your thoughts with the community!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 18 2022
If you were talking to someone whose organization is considering Cassandra, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 18 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
May 18 2022
Please share with the community what you think needs improvement with Cassandra.
What are its weaknesses? What would you like to see changed in a future version?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 18 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 18 2022
Hi Everyone,
What do you like most about Cassandra?
Thanks for sharing your thoughts with the community!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
May 17 2022
If you were talking to someone whose organization is considering Apache Kafka, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 17 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
May 17 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
May 17 2022
Hi Everyone,
What do you like most about Apache Kafka?
Thanks for sharing your thoughts with the community!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
If you were talking to someone whose organization is considering Apache Spark, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Gopi KrishnanApache Spark can be used in multiple use case in big data and in data… more »

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Please share with the community what you think needs improvement with Apache Spark.
What are its weaknesses? What would you like to see changed in a future version?

Gopi KrishnanThere is still enough space of improvement on Apache Spark in term of… more »

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Hi Everyone,
What do you like most about Apache Spark?
Thanks for sharing your thoughts with the community!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
If you were talking to someone whose organization is considering Apache Hadoop, what would you say?
How would you rate it and why? Any other tips or advice?

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Please share with the community what you think needs improvement with Apache Hadoop.
What are its weaknesses? What would you like to see changed in a future version?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Hi Everyone,
What do you like most about Apache Hadoop?
Thanks for sharing your thoughts with the community!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 27 2022
Please share with the community what you think needs improvement with Apache Kafka.
What are its weaknesses? What would you like to see changed in a future version?

Tomasz Rabong
Client Engagement Leader at Sanmargar Team
Apr 20 2022
Hello peers,
I am looking for a data catalog vendor or open-source with the following DB data sources:
Teradata
MS SQL
HANA
Hadoop/Hive
and BI data sources:
SAP BO 4.0
Tableau Server 2022.1
Can you please advise?
I appreciate the help. Read More »

Tomasz RabongDear Community,
Many thanks for yor support and help!

Leandro SodréHi Tomasz Rabong,
I believe that if you have a developer team in Amundsen it… more »

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 11 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 11 2022
If you were talking to someone whose organization is considering Apache Spark Streaming, what would you say?
How would you rate it and why? Any other tips or advice?

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Apr 11 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 11 2022
Please share with the community what you think needs improvement with Apache Spark Streaming.
What are its weaknesses? What would you like to see changed in a future version?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Apr 11 2022
Hi Everyone,
What do you like most about Apache Spark Streaming?
Thanks for sharing your thoughts with the community!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Mar 30 2022
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Mar 30 2022
If you were talking to someone whose organization is considering ActiveMQ, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Mar 30 2022
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Mar 30 2022
Please share with the community what you think needs improvement with ActiveMQ.
What are its weaknesses? What would you like to see changed in a future version?

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Mar 30 2022
Hi Everyone,
What do you like most about ActiveMQ?
Thanks for sharing your thoughts with the community!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
If you were talking to someone whose organization is considering Spark SQL, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Please share with the community what you think needs improvement with Spark SQL.
What are its weaknesses? What would you like to see changed in a future version?

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi Everyone,
What do you like most about Spark SQL?
Thanks for sharing your thoughts with the community!

Netanya Carmi
Content Manager
PeerSpot (formerly IT Central Station)
Which is better and why?

Stephen W. BoydWhich is better; there be dragons. They each have strengths and weaknesses, but… more »

Arif AhmedPostman is for API verification. It can be used for inspections of API as well… more »

reviewer1650858Postman lets you easily define variables, which then get updated automatically… more »

Netanya Carmi
Content Manager
PeerSpot (formerly IT Central Station)

Netanya Carmi
Content Manager
PeerSpot (formerly IT Central Station)

Netanya Carmi
Content Manager
PeerSpot (formerly IT Central Station)

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi Everyone,
What do you like most about Apache Airflow?
Thanks for sharing your thoughts with the community!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Hello peers,
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

NitinKumarThe primary use case is the orchestration and automation of ELT/ETL data… more »

Menachem D Pritzker
Director of Growth
PeerSpot (formerly IT Central Station)
Is one more resilient than the other?
How supportive are the communities?
Which use cases are better for each?

Jorge OlmedoHi, everyone. In my humble opinion, both Cassandra and MongoDB are great data… more »

RajneeshShukla
Solution Architect at a tech vendor with 10,001+ employees
Hi,
I am working as a solution architect for a tech vendor with 10,000+ employees.
I need to perform comparative analysis for ETL tools available to populate data from OLTP to OLAP.
The main criteria I am considering are:
Price
Functionality (transformation, error logging and handling, lo... Read More »

Stefan SchäferWe usually use Talend.
Look here:… more »

Karoly KrokovayWe have experiences only in Pentaho Data Integrator (open source competitor of… more »

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Padmanesh NC
Big Data Solution Architect - Spatial Data Specialist at Sciera, Inc.
Hi community,
I'm aware that we can use Apache Spark with/without Hadoop.
But I am sure that the majority of people are using Apache Spark with Hadoop, and I read one article that states how using Apache Spark without Hadoop is not good for deployment, and can be usable for the development en... Read More »

NitinKumarI don't think using Apache Spark without Hadoop has any major drawbacks or… more »

Padmanesh NC
Big Data Solution Architect - Spatial Data Specialist at Sciera, Inc.
Hi community,
I'm aware that we can use Apache Spark with/without Hadoop.
But I am sure that the majority of people are using Apache Spark with Hadoop, and I read one article that states how using Apache Spark without Hadoop is not good for deployment, and can be usable for the development en... Read More »

NitinKumarI don't think using Apache Spark without Hadoop has any major drawbacks or… more »

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi Everyone,
What do you like most about Apache Airflow?
Thanks for sharing your thoughts!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
If you were talking to someone whose organization is considering Apache Flink, what would you say?
How would you rate it and why? Any other tips or advice?

JAMAL AL MAHAMIDToday, Flink is the fastest Streaming solution. It is the core of an Azure and… more »

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Please share with the community what you think needs improvement with Apache Flink.
What are its weaknesses? What would you like to see changed in a future version?

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Hi Everyone,
What do you like most about Apache Flink?
Thanks for sharing your thoughts with the community!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
If you were talking to someone whose organization is considering Tomcat, what would you say?
How would you rate it and why? Any other tips or advice?

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Hi,
We all know it's really hard to get good pricing and cost information.
Please share what you can so you can help your peers.

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
How do you or your organization use this solution?
Please share with us so that your peers can learn from your experiences.
Thank you!

Julia Frohwein
Content and Social Media Manager
PeerSpot (formerly IT Central Station)
Please share with the community what you think needs improvement with Tomcat.
What are its weaknesses? What would you like to see changed in a future version?
Ts Nurul HaszeliTo have an admin console that is more user friendly and simplified like XAMPP… more »

Miriam Tover
Senior Delivery Ops Manager
PeerSpot (formerly IT Central Station)
Hi Everyone,
What do you like most about Tomcat?
Thanks for sharing your thoughts with the community!
Popular Comparisons

Red Hat

Oracle

Informatica

Akamai

Google

Amazon

MuleSoft

Eclipse Foundation

VMware

TmaxSoft

Perforce

Fortinet

Cisco

Cloudera

Rocket Software

WSO2

Micro Focus

SmartBear

Keysight Technologies