Apache Hadoop vs IBM Db2 Warehouse on Cloud comparison

Cancel
You must select at least 2 products to compare!
Apache Logo
2,467 views|2,109 comparisons
87% willing to recommend
IBM Logo
294 views|268 comparisons
83% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Apache Hadoop and IBM Db2 Warehouse on Cloud based on real PeerSpot user reviews.

Find out in this report how the two Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.
To learn more, read our detailed Apache Hadoop vs. IBM Db2 Warehouse on Cloud Report (Updated: May 2024).
769,789 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"​​Data ingestion: It has rapid speed, if Apache Accumulo is used.""It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.""The ability to add multiple nodes without any restriction is the solution's most valuable aspect.""The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.""Two valuable features are its scalability and parallel processing. There are jobs that cannot be done unless you have massively parallel processing.""We selected Apache Hadoop because it is not dependent on third-party vendors.""It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.""The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable."

More Apache Hadoop Pros →

"The performance is okay as long as the volume of queries is not too high.""It will be MPP, so performance should improve.""The way that it scales will help a lot of customers that are stuck with Netezza boxes that can't grow any larger.​"

More IBM Db2 Warehouse on Cloud Pros →

Cons
"The solution could use a better user interface. It needs a more effective GUI in order to create a better user environment.""I would like to see more direct integration of visualization applications.""The integration with Apache Hadoop with lots of different techniques within your business can be a challenge.""Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them.""The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.""It could be more user-friendly.""The upgrade path should be improved because it is not as easy as it should be.""The main thing is the lack of community support. If you want to implement a new API or create a new file system, you won't find easy support."

More Apache Hadoop Cons →

"Right now, we are implementing on ESX VMware 6.0. Support for this platform is poor. Also, one of the backup/recovery options is broken and IBM is not addressing the issue.""Tech support for dashDB is awful. We usually have tickets open for three to four weeks.""Containers get corrupted very easily. Restoring them using GPFS can result in a lot of issues.""Ultimately, the product itself has challenges and we are not currently satisfied with the support, either."

More IBM Db2 Warehouse on Cloud Cons →

Pricing and Cost Advice
  • "Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."
  • "​There are no licensing costs involved, hence money is saved on the software infrastructure​."
  • "This is a low cost and powerful solution."
  • "The price of Apache Hadoop could be less expensive."
  • "If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."
  • "We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."
  • "The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."
  • "We just use the free version."
  • More Apache Hadoop Pricing and Cost Advice →

  • "If your going to go with warehouse DB/dashDB, use the cloud or Sailfish version."
  • More IBM Db2 Warehouse on Cloud Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    769,789 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Tools like Apache Hadoop are knowledge-intensive in nature. Unlike other tools in the market currently, we cannot understand knowledge-intensive products straight away. To use Apache Hadoop, a person… more »
    Top Answer:Organizations of all sizes, especially those who are in need of powerful and elastic cloud data warehouse solutions that can help administrators maximize the efficiency of their data-based operations… more »
    Ranking
    5th
    out of 35 in Data Warehouse
    Views
    2,467
    Comparisons
    2,109
    Reviews
    11
    Average Words per Review
    573
    Rating
    7.9
    15th
    Views
    294
    Comparisons
    268
    Reviews
    0
    Average Words per Review
    0
    Rating
    N/A
    Comparisons
    Also Known As
    IBM dashDB
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    IBM dashDB family offers private and public cloud database solutions for transactional and analytic workloads, with IBM fully managed or client managed options with a Common SQL engine across all deployment options.

    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Copenhagen Business School, BPM Northwest, GameStop
    Top Industries
    REVIEWERS
    Financial Services Firm38%
    Comms Service Provider25%
    Hospitality Company6%
    Consumer Goods Company6%
    VISITORS READING REVIEWS
    Financial Services Firm28%
    Computer Software Company10%
    Comms Service Provider6%
    University6%
    VISITORS READING REVIEWS
    Financial Services Firm15%
    University10%
    Computer Software Company9%
    Manufacturing Company9%
    Company Size
    REVIEWERS
    Small Business34%
    Midsize Enterprise20%
    Large Enterprise46%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise74%
    VISITORS READING REVIEWS
    Small Business14%
    Midsize Enterprise11%
    Large Enterprise75%
    Buyer's Guide
    Apache Hadoop vs. IBM Db2 Warehouse on Cloud
    May 2024
    Find out what your peers are saying about Apache Hadoop vs. IBM Db2 Warehouse on Cloud and other solutions. Updated: May 2024.
    769,789 professionals have used our research since 2012.

    Apache Hadoop is ranked 5th in Data Warehouse with 33 reviews while IBM Db2 Warehouse on Cloud is ranked 15th in Cloud Data Warehouse. Apache Hadoop is rated 7.8, while IBM Db2 Warehouse on Cloud is rated 7.6. The top reviewer of Apache Hadoop writes "Handles huge data volumes and create your own workflows and tables but you need to have deeper knowledge". On the other hand, the top reviewer of IBM Db2 Warehouse on Cloud writes "The "prefetch" feature anticipates needed data and keeps it available. BLU acceleration determines what data is unqualified for analysis and skips it". Apache Hadoop is most compared with Azure Data Factory, Microsoft Azure Synapse Analytics, Oracle Exadata, Snowflake and Teradata, whereas IBM Db2 Warehouse on Cloud is most compared with Amazon Redshift, IBM Db2 Warehouse, IBM Netezza Performance Server, Microsoft Azure Synapse Analytics and Snowflake. See our Apache Hadoop vs. IBM Db2 Warehouse on Cloud report.

    See our list of best Data Warehouse vendors and best Cloud Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.