We changed our name from IT Central Station: Here's why

Apache Hadoop vs Snowflake comparison

Cancel
You must select at least 2 products to compare!
Apache Hadoop Logo
7,075 views|5,891 comparisons
Snowflake Logo
37,053 views|27,399 comparisons
Featured Review
Find out what your peers are saying about Apache Hadoop vs. Snowflake and other solutions. Updated: January 2022.
564,599 professionals have used our research since 2012.
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"The most valuable features are powerful tools for ingestion, as data is in multiple systems.""The solution is easy to expand. We haven't seen any issues with it in that sense. We've added 10 servers, and we've added two nodes. We've been expanding since we started using it since we started out so small. Companies that need to scale shouldn't have a problem doing so.""Hadoop is designed to be scalable, so I don't think that it has limitations in regards to scalability.""The performance is pretty good.""Hadoop is extensible — it's elastic."

More Apache Hadoop Pros →

"From a data warehouse perspective, it's an excellent all-round solution. It's very complete.""It requires no maintenance on our part. They handle all that. The speed is phenomenal. The pricing isn't really anything more than what you would be paying for a SQL server license or another tool to execute the same thing. We have zero maintenance on our side to do anything and the speed at which it performs queries and loads the data is amazing. It handles unstructured data extremely well, too. So, if the data is in a JSON array or an XML, it handles that super well.""The querying speed is fast.""The syntax is advanced which reduces the time to write code.""We find the data sharing and data marketplace aspects of Snowflake absolutely amazing.""It is a very well-distributed system. It has different data engines for different applications. Many applications can use different computational engines at the same time. In terms of data processing, the feeling was similar to working with a relational database but in a scalable way.""Its performance is most valuable. As compared to SQL Server, we are able to see a significant improvement in performance with Snowflake.""Once you have finished your designs they can be easily imported to Snowflake and the information can be readily accessed without an IT expert."

More Snowflake Pros →

Cons
"It would be helpful to have more information on how to best apply this solution to smaller organizations, with less data, and grow the data lake.""The solution is very expensive.""From the Apache perspective or the open-source community, they need to add more capabilities to make life easier from a configuration and deployment perspective.""The solution needs a better tutorial. There are only documents available currently. There's a lot of YouTube videos available. However, in terms of learning, we didn't have great success trying to learn that way. There needs to be better self-paced learning.""Hadoop's security could be better."

More Apache Hadoop Cons →

"We would like to see more security including more masking and more encryption at the database level.""There is a scope for improvement. They don't currently support integration with some of the Azure and AWS native services. It would be good if they can enhance their product to integrate with these services.""Their strategy is just to leverage what you've got and put Snowflake in the middle. It does work well with other tools. You have to buy a separate reporting tool and a separate data loading tool, whereas, in some platforms, these tools are baked in. In the long-term, they'll need to add more direct partnerships to the ecosystem so that it's not like adding on tools around Snowflake to make it work. They can also consider including Snowflake native reporting tools versus partnering with other reporting tools. It would kind of change where they sit in the market.""There are a lot of features that they need to come up with. A lot of functions are missing in Snowflake, so we have to find a workaround for those. For example, OUTER APPLY is a basic function in SQL Server, but it is not there in Snowflake. So, you have to write complex code for it.""For the Snowflake database, there should be some third-party features for the ETL. It would also be good to be able to use some kind of controls to get the data either from another database or a flat file. Its price should be improved. It should be cheaper than Microsoft.""Every product has room for improvement, although in this case, it needs some broadening of the functionality.""There could be better ELT tools that are appropriate for Snowflake. We decided on Matillion and it seemed to be the only one. There need to be better choices, it would be great if Snowflake provided an ELT solution that people could use. Additionally, if there was a pure cloud-based ELT tool it would be useful.""They need to incorporate some basic OLAP capabilities in the backend or at the database level. Currently, it is purely a database. They call it purely a data warehouse for the cloud. Currently, just like any database, we have to calculate all the KPIs in the front-end tools. The same KPIs again need to be calculated in Snowflake. It would be very helpful if they can include some OLAP features. This will bring efficiency because we will be able to create the KPIs within Snowflake itself and then publish them to multiple front-end tools. We won't have to recreate the same in each project. There should be the ability to automate raised queries, which is currently not possible. There should also be something for Exception Aggregation and things like that."

More Snowflake Cons →

Pricing and Cost Advice
Information Not Available
  • "It is not cheap."
  • "The pricing for Snowflake is competitive."
  • "On average, with the number of queries that we run, we pay approximately $200 USD per month."
  • "Pricing is approximately $US 50 per DB. Terabyte is around $US 50 per month."
  • "The price of Snowflake is very reasonable."
  • "The price of Snowflake is quite reasonable."
  • "Its price should be improved. It should be cheaper than Microsoft."
  • "It is per credit. It has a use-it-as-you-go model. We bought a chunk of 20,000 credits, and they were lasting us for at least a year. We didn't have the scale of data like a much larger company to consume more credits. For us, it was very inexpensive. Their strategy is just to leverage what you've got and put Snowflake in the middle. It doesn't make it expensive because most of the organizations already have reporting tools. Now, if you were starting from scratch, it might be cheaper to go a different way."
  • More Snowflake Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
    564,599 professionals have used our research since 2012.
    Questions from the Community
    Top Answer: 
    I don't think using Apache Spark without Hadoop has any major drawbacks or issues. I have used Apache Spark quite successfully with AWS S3 on many projects which are batch based. Yes for very high… more »
    Top Answer: 
    Hadoop is extensible — it's elastic.
    Top Answer: 
    Hadoop's security could be better.
    Top Answer: 
    Once you have finished your designs they can be easily imported to Snowflake and the information can be readily accessed without an IT expert.
    Top Answer: 
    For our licensing, we renew every January by $25,000 in both credits. Their pricing structure is a pay-per-second usage in terms of credits, but you can get discounts if you buy them in bulk. I think… more »
    Top Answer: 
    The documentation could improve. They should provide architecture information. There could be better integration with tools other than the common databases used to receive data. There are other tools… more »
    Ranking
    7th
    out of 30 in Data Warehouse
    Views
    7,075
    Comparisons
    5,891
    Reviews
    5
    Average Words per Review
    436
    Rating
    7.6
    1st
    out of 30 in Data Warehouse
    Views
    37,053
    Comparisons
    27,399
    Reviews
    40
    Average Words per Review
    578
    Rating
    8.4
    Comparisons
    Also Known As
    Snowflake Computing
    Learn More
    Overview
    The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

    Snowflake provides a data warehouse built for the cloud, delivering a solution capable of solving problems for which legacy, on-premises and cloud data platforms were not designed.

    Offer
    Learn more about Apache Hadoop
    Learn more about Snowflake
    Sample Customers
    Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
    Accordant Media, Adobe, Kixeye Inc., Revana, SOASTA, White Ops
    Top Industries
    REVIEWERS
    Financial Services Firm43%
    Comms Service Provider29%
    Consumer Goods Company14%
    Government14%
    VISITORS READING REVIEWS
    Computer Software Company28%
    Comms Service Provider18%
    Financial Services Firm13%
    Energy/Utilities Company5%
    REVIEWERS
    Computer Software Company39%
    Financial Services Firm11%
    Comms Service Provider7%
    Manufacturing Company7%
    VISITORS READING REVIEWS
    Computer Software Company27%
    Comms Service Provider15%
    Financial Services Firm8%
    Insurance Company5%
    Company Size
    REVIEWERS
    Small Business38%
    Midsize Enterprise24%
    Large Enterprise38%
    REVIEWERS
    Small Business25%
    Midsize Enterprise27%
    Large Enterprise48%
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise15%
    Large Enterprise68%
    Find out what your peers are saying about Apache Hadoop vs. Snowflake and other solutions. Updated: January 2022.
    564,599 professionals have used our research since 2012.

    Apache Hadoop is ranked 7th in Data Warehouse with 5 reviews while Snowflake is ranked 1st in Data Warehouse with 41 reviews. Apache Hadoop is rated 7.6, while Snowflake is rated 8.4. The top reviewer of Apache Hadoop writes "Great micro-partitions, helpful technical support and quite stable". On the other hand, the top reviewer of Snowflake writes "Good usability, good data sharing and elastic compute features, and requires less DBA involvement". Apache Hadoop is most compared with Microsoft Azure Synapse Analytics, VMware Tanzu Greenplum, Oracle Exadata, Teradata and Vertica, whereas Snowflake is most compared with Microsoft Azure Synapse Analytics, Amazon Redshift, Vertica, Teradata and Teradata Cloud Data Warehouse. See our Apache Hadoop vs. Snowflake report.

    See our list of best Data Warehouse vendors.

    We monitor all Data Warehouse reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.