We performed a comparison between Apache Hadoop and IBM Db2 Warehouse on Cloud based on real PeerSpot user reviews.
Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Data ingestion: It has rapid speed, if Apache Accumulo is used."
"It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming."
"The ability to add multiple nodes without any restriction is the solution's most valuable aspect."
"The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so."
"Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing."
"We selected Apache Hadoop because it is not dependent on third-party vendors."
"It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database."
"The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable."
"The performance is okay as long as the volume of queries is not too high."
"It will be MPP, so performance should improve."
"The way that it scales will help a lot of customers that are stuck with Netezza boxes that can't grow any larger."
"The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment."
"I would like to see more direct integration of visualization applications."
"The integration with Apache Hadoop with lots of different techniques within your business can be a challenge."
"Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them."
"The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning."
"It could be more user-friendly."
"The upgrade path should be improved because it is not as easy as it should be."
"The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support."
"Right now, we are implementing on ESX VMware 6.0. Support for this platform is poor. Also, one of the backup/recovery options is broken and IBM is not addressing the issue."
"Tech support for dashDB is awful. We usually have tickets open for three to four weeks."
"Containers get corrupted very easily. Restoring them using GPFS can result in a lot of issues."
"Ultimately, the product itself has challenges and we are not currently satisfied with the support, either."
Apache Hadoop is ranked 5th in Data Warehouse with 33 reviews while IBM Db2 Warehouse on Cloud is ranked 15th in Cloud Data Warehouse. Apache Hadoop is rated 7.8, while IBM Db2 Warehouse on Cloud is rated 7.6. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of IBM Db2 Warehouse on Cloud writes "The "prefetch" feature anticipates needed data and keeps it available. BLU acceleration determines what data is unqualified for analysis and skips it". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and Teradata, whereas IBM Db2 Warehouse on Cloud is most compared with Amazon Redshift, IBM Db2 Warehouse, IBM Netezza Performance Server, Microsoft Azure Synapse Analytics and Snowflake. See our Apache Hadoop vs. IBM Db2 Warehouse on Cloud report.
See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.
We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.