Try our new research platform with insights from 80,000+ expert users

Amazon Redshift vs Apache Hadoop comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
6.2
Amazon Redshift ROI varies; cloud transition boosts sales but data volume impacts cost-effectiveness compared to databases like Netezza.
Sentiment score
6.5
Apache Hadoop provides cost-effective data storage and processing, though ROI varies based on analytics use and sophistication.
 

Customer Service

Sentiment score
6.9
Amazon Redshift's customer service is praised for efficiency and professionalism, though some desire easier phone access and consistent availability.
Sentiment score
6.4
Customer service varies by Hadoop distributor, with Hortonworks rated highly; support depends on vendor, community resources, or external vendors.
Whenever we need support, if there is an issue accessing stored data due to regional data center problems, the Amazon team is very helpful and provides optimal solutions quickly.
It's costly when you enable support.
It's not structured support, which is why we don't use purely open-source projects without additional structured support.
 

Scalability Issues

Sentiment score
7.4
Redshift is popular for its easy scalability on AWS, although some users face challenges with large cluster configurations.
Sentiment score
7.6
Apache Hadoop excels in scalability, allowing easy cluster expansion and efficient data handling, ideal for varied organizational needs.
The scalability part needs improvement as the sizing requires trial and error.
It is a distributed file system and scales reasonably well as long as it is given sufficient resources.
 

Stability Issues

Sentiment score
7.4
Amazon Redshift is stable with minor scaling challenges, appreciated AWS support, and noted visibility concerns versus Snowflake.
Sentiment score
7.3
Apache Hadoop's stability, rated 8/10, improves with newer versions, though minor issues exist with memory and data processing.
Amazon Redshift is a stable product, and I would rate it nine or ten out of ten for stability.
Continuous management in the way of upgrades and technical management is necessary to ensure that it remains effective.
 

Room For Improvement

Amazon Redshift users struggle with data management, pricing, performance, integration, UI support, and compatibility with various data types.
Apache Hadoop needs improved usability, integration, security, support, and performance for efficient high-volume data processing and better community resources.
They should bring the entire ETL data management process into Amazon Redshift.
Integration with AI could be a good improvement.
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.
 

Setup Cost

Amazon Redshift offers competitive pricing with scalable costs, ideal for large enterprises, though not as economical for smaller companies.
Enterprise Hadoop offers cost benefits but varies with deployment type and distribution, impacting smaller organizations more heavily.
It's a pretty good price and reasonable for the product quality.
The cost of technical support is high.
The pricing of Amazon Redshift is expensive.
 

Valuable Features

Amazon Redshift offers scalable, efficient, and secure data warehousing with fast processing, AWS integration, and flexible configurations for analytics.
Apache Hadoop excels with a scalable, cost-effective system handling diverse data types, integrating with tools, and supporting big data analytics.
The specific features of Amazon Redshift that are beneficial for handling large data sets include fast retrieval due to cloud services and scalability, which allows us to retrieve data quickly.
Scalability is also a strong point; I can scale it however I want without any limitations.
Amazon Redshift's performance optimization and scalability are quite helpful, providing functionalities such as scaling up and down.
If you don't do the upgrades, the platform ages out, and that's what happened to the Hadoop content.
 

Categories and Ranking

Amazon Redshift
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
71
Ranking in other categories
Cloud Data Warehouse (6th)
Apache Hadoop
Average Rating
7.8
Reviews Sentiment
6.7
Number of Reviews
40
Ranking in other categories
Data Warehouse (7th)
 

Featured Reviews

Sriram-Natesan - PeerSpot reviewer
The ability to create a lot of views or materialized views is beneficial
Improvement in the immediate response and the process of getting into a call could be helpful. We have had to wait for at least twenty-four hours to get a call and then wait for a couple more hours for a solution. Improved connectivity to different BI tools and already published connectors for major tools in AWS could enhance the service.
Sushil Arya - PeerSpot reviewer
Provides ease of integration with the IT workflow of a business
When working with Kafka, I saw that the data came in an incremental order. The incremental data processing part is still not very effective in Apache Hadoop. If the data is already there, it can be processed very effectively, especially if the data is coming in every second. If you want to know the location of some data every second, then such data is not processed effectively in Apache Hadoop. I can say that one of the features where improvements are required revolves around the licensing cost of the tool. If the tool can build some licensing structures in a pay-per-use manner, organizations can get the look and feel of Apache Hadoop. Apache Hadoop can offer a licensing structure of the product that can be seen as similar to how AWS operates. Apache Hadoop can look into the capability of processing incremental data. The tool's setup process can be a scope of improvement. Also, it is not very simple because while doing the setup, we need to do all the server settings, including port listing and firewall configurations. If we look at other products on the market, then they can be made simpler. There are certain shortcomings when it comes to the product's technical support part, making it an area where improvements are required. The time frame for the resolution is an area that needs to be improved. The overall communication part of the technical support team also needs improvement.
report
Use our free recommendation engine to learn which Data Warehouse solutions are best for your needs.
858,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
39%
Financial Services Firm
12%
Computer Software Company
9%
Manufacturing Company
6%
Financial Services Firm
34%
Computer Software Company
12%
University
5%
Energy/Utilities Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How does Amazon Redshift compare with Microsoft Azure Synapse Analytics?
Amazon Redshift is very fast, has a very good response time, and is very user-friendly. The initial setup is very straightforward. This solution can merge and integrate well with many different dat...
What do you like most about Amazon Redshift?
The tool's most valuable feature is its parallel processing capability. It can handle massive amounts of data, even when pushing hundreds of terabytes, and its scaling capabilities are good.
What do you like most about Apache Hadoop?
It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.
What is your experience regarding pricing and costs for Apache Hadoop?
The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...
What needs improvement with Apache Hadoop?
The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it. This wa...
 

Overview

 

Sample Customers

Liberty Mutual Insurance, 4Cite Marketing, BrandVerity, DNA Plc, Sirocco Systems, Gainsight, Blue 449
Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab
Find out what your peers are saying about Amazon Redshift vs. Apache Hadoop and other solutions. Updated: May 2025.
858,649 professionals have used our research since 2012.