If you were talking to someone whose organization is considering Apache Hadoop, what would you say?
How would you rate it and why? Any other tips or advice?
I rate Hadoop seven out of 10. It's very good, but it could always be better. To anyone considering Hadoop, I recommend that you be mindful of what you're trying to achieve.
Usually, people need to study and prepare for a few use cases and compare multiple ecosystems before choosing one. When people think of using a big data solution, Hadoop comes to mind. For certain use cases, Hadoop is comparable with other technologies. For example, when building a sort of real-time data warehouse — an enterprise data hub —, people don't think about using Hadoop directly. People often use solutions like DROID for building. At the end of the day, you need to compare technologies — existing technologies against their use cases. You need to study your use case and select the technology inside of Hadoop that will fit your use case. You may find another ecosystem that solves your problem, just keep in mind, Hadoop is not the only solution, there are a lot of solutions. It depends on the use case. Overall, on a scale from one to ten, I would give Hadoop a rating of eight.
The solution is perfect for those dealing with a huge amount of data. Still, you need to check to make sure it meets your company's requirements. You need to understand them before actually choosing the technology you'll ultimately use. Overall, I would rate the solution at a seven out of ten.
We're just a customer. We don't have a business relationship with Hadoop. My day-to-day job is data modeling and architecting. Originally we used it as an open-source solution. We downloaded it, then we went for a commercial version of it. In terms of advice, I'd tell other potential users that whether the solution is right for them depends on a few items. If the data volume is too big, it's IoT data, or the stream of data is too much, this solution can handle it and I would definitely recommend Apache Hadoop. Recently, in the last 18 months, I've been working with the Snowflake, it's a Data Lake project, and I am really impressed with that one. I got a certification so that we started using Snowflake set for our Data Lake environment. I'd rate the solution eight out of ten.
We use the on-premises deployment model. We're more inclined towards an operational data source to fill our customer's needs. Hadoop is good for analytics and some reporting requirements. It's a good solution for those needing something for the purposes of management reporting. I'd rate the solution eight out of ten.
I've used the solution under cloud, hybrid and on-premises deployment models. I'd recommend the solution, but it depends on the company's requirements. If you don't have huge amounts of data, you probably don't need Hadoop. If you need a completely private environment, and you have lots of big data, consider Hadoop. You don't even need to invest in the infrastructure as you can just use a cloud deployment. I'd rate the solution seven out of ten. I'd rate it higher if it had a better user interface.
We use the on-premises deployment model. It's a requirement for the company we work with, which is a bank. Often customers demand we work with on-premises deployment models. I'd rate the solution seven out of ten. In terms of the ability to build middleware and offer scalability, it would be 10 out of 10 from me. However, if you take into account only the visualization, I'd only rate it at three or four out of ten.
It's good for what is meant to do, a lot of big data, but it's not as good for low latency applications. If you have to perform quick queries on naive or analytics it can be frustrating. It can be useful for what it was intended to be used for. I would rate this solution a seven out of ten.
I would give this product a rating of eight out of ten. It would not be a ten out of ten because of some problems we are having with the upgrade to the newer version. It would have been better for us if these problems were not holding us back. I think eight is good enough.
Implement for defined use cases. Don't expect it to all just work very easily. I would rate this platform a seven out of 10. On the one hand, it's the only place you can use certain functions, and on the other hand, it's not going to put any of the other ones out of business. It's really more of a complement. There is no fundamental battle between relational databases and Hadoop.
What do you like most about Apache Hadoop?
Thanks for sharing your thoughts with the community!