Apache Hadoop vs Oracle Autonomous Data Warehouse comparison

Apache Hadoop vs. Oracle Autonomous Data Warehouse

Download the complete report

Helped 884,873 peers since 2012

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Categories and Ranking

Apache Hadoop

Average Rating

8.0

Reviews Sentiment

6.6

Number of Reviews

Ranking in other categories

Data Warehouse (7th)

Oracle Autonomous Data Ware...

Average Rating

8.4

Reviews Sentiment

6.9

Number of Reviews

Ranking in other categories

Cloud Data Warehouse (12th)

Featured Reviews

Nick Rapoport

Financial Advisor at a financial services firm with 10,001+ employees

Reliable performance maintained but requires ongoing management and support

Hadoop was used for years, but there were problems since the people who originally set it up left the firm. The group that owned it later didn't have the technical resources to properly maintain it. Although there was nothing wrong with Hadoop itself, issues arose without proper management and upgrades.

Read full review

Kwajah Mohiuddin

Global Head of Architecture at a financial services firm with 1,001-5,000 employees

Provides self-repair features, but the setup is complex

We use the product for online applications. We use it in the financial industry The product has self-repair features. The tool tunes itself. It separates compute from storage. We can scale storage and compute separately. The setup is complex. Oracle is a complex tool. I have been using Oracle…

Read full review

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros

"It is a reliable product."

"Hadoop can store any kind of data—structured, unstructured, and semi-structured—and presents it using the relational model through Hive."

"The most important feature is its ability to handle large volumes. Some of our customers have really large volumes, and it is capable of handling their data in terms of the core volume and daily incremental volume. So, its processing power and speed are most valuable."

"The Distributed File System, which is the base of Hadoop, has been the most valuable feature with its ability to store video, pictures, JSON, XML, and plain text all in the same file system."

"Apache Hadoop can manage large amounts and volumes of data with relative ease, which is a feature that is beneficial."

"The ability to add multiple nodes without any restriction is the solution's most valuable aspect."

"Its integration is Hadoop's best feature because that allows us to support different tools in a big data platform."

"Most valuable features are HDFS and Kafka: Ingestion of huge volumes and variety of unstructured/semi-structured data is feasible, and it helps us to quickly onboard a new Big Data analytics prospect."

More Apache Hadoop pros

"The solution is used for analytics and it works for our data security needs."

"The product has self-repair features."

"It is a stable and scalable solution."

"The solution integrates well with Power BI."

"The solution is used for analytics and it works for our data security needs."

"The solution is self-securing. All data is encrypted and security updates and patches are applied automatically both periodically and off-cycle."

"Self-patching and runs machine-learning across its logs all the time"

"I loved the simplicity of loading the data and simply relying on the self-tuning capabilities of ADW."

More Oracle Autonomous Data Warehouse pros

Cons

"Since it is an open-source product, there won't be much support."

"The stability of the solution needs improvement."

"The key shortcoming is its inability to handle queries when there is insufficient memory. This limitation can be bypassed by processing the data in chunks."

"I would like to see more direct integration of visualization applications."

"Real-time data processing is weak. This solution is very difficult to run and implement."

"Based on our needs, we would like to see a tool for data visualization and enhanced Ambari for management, plus a pre-built IoT hub/model. These would reduce our efforts and the time needed to prove to a customer that this will help them."

"It requires a great deal of learning curve to understand. The overall Hadoop ecosystem has a large number of sub-products. There is ZooKeeper, and there are a whole lot of other things that are connected. In many cases, their functionalities are overlapping, and for a newcomer or our clients, it is very difficult to decide which of them to buy and which of them they don't really need. They require a consulting organization for it, which is good for organizations such as ours because that's what we do, but it is not easy for the end customers to gain so much knowledge and optimally use it."

"Hadoop in and of itself stores data with 3x redundancy and our organization has come to the conclusion that the default 3x results in too much wasted disk space."

More Apache Hadoop cons

"The solution could be improved by allowing for migration tools from other cloud services, including migration from Amazon Redshift, RDS, and Aurora."

"One of the major problem is creating custom tablespace. The ADB serverless option doesn't support custom tablespace creation, which could cause issues during on-premise database migration that requires specifically named tablespace. There should be an option to create customized tablespace."

"The setup is complex."

"Sometimes the solution works differently between the cloud and on-premises. It needs to be more consistent and predictable."

"They should make the solution more user-friendly."

"Oracle Autonomous Data Warehouse is not available as an on-premises solution."

"It doesn't work well when you have unstructured data or you need online analytics. It is not as nice as Hadoop in these aspects."

"A lot of the tools that were previously there have now been taken away."

More Oracle Autonomous Data Warehouse cons

Pricing and Cost Advice

"The product is open-source, but some associated licensing fees depend on the subscription level."

"If my company can use the cloud version of Apache Hadoop, particularly the cloud storage feature, it would be easier and would cost less because an on-premises deployment has a higher cost during storage, for example, though I don't know exactly how much Apache Hadoop costs."

"There are no licensing costs involved, hence money is saved on the software infrastructure."

"This is a low cost and powerful solution."

"For any big enterprise the costs can be handled, and it is suitable for big enterprises because the scale of data is large. For medium and small enterprises, the tool is on the high-price side."

"We don't directly pay for it. Our clients pay for it, and they usually don't complain about the price. So, it is probably acceptable."

"The price could be better. Hortonworks no longer exists, and Cloudera killed the free version of Hadoop."

"Do take into consider that data storage and compute capacity scale differently and hence purchasing a "boxed" / 'all-in-one" solution (software and hardware) might not be the best idea."

More Apache Hadoop pricing and cost advice

"ROI is high."

"On a scale from one to ten, where one is a low price and ten is a high price, I rate the pricing an eight."

"Oracle Autonomous Data Warehouse's pricing is fair and reasonable compared to the other cloud vendors."

"We pay approximately $70,000 per month. The cost includes maintenance and support."

"The solution is expensive."

"The price depends on the configuration we choose."

"The cost is perfect with Oracle Universal credit."

"In terms of architecture and pricing structure, I feel it is a little bit costly compared to Azure. It's fine compared to RedShift, but compared to Azure, it's a bit pricey when you calculate for one TB storage plus around five hours of reporting with the frequency of 1TB data. The cost adds up, making Oracle a bit expensive."

More Oracle Autonomous Data Warehouse pricing and cost advice

See which vendors are best for you

Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.

See recommendations

884,873 professionals have used our research since 2012.

Top Industries

By visitors reading reviews

Financial Services Firm

31%

Computer Software Company

Government

University

Manufacturing Company

10%

Media Company

Computer Software Company

Insurance Company

Company Size

By reviewers

Large Enterprise

Midsize Enterprise

Small Business

By reviewers
Company Size	Count
Small Business	14
Midsize Enterprise	8
Large Enterprise	21

By reviewers
Company Size	Count
Small Business	7
Midsize Enterprise	1
Large Enterprise	11

Questions from the Community

What do you like most about Apache Hadoop?

It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.

What is your experience regarding pricing and costs for Apache Hadoop?

The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...

What needs improvement with Apache Hadoop?

The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it. This wa...

What is your experience regarding pricing and costs for Oracle Autonomous Data Warehouse?

We pay approximately $70,000 per month. The cost includes maintenance and support.

What needs improvement with Oracle Autonomous Data Warehouse?

Optimization should be better. The SQLs are sometimes very slow. I also noticed that Java is not supported, which is not ideal.

What is your primary use case for Oracle Autonomous Data Warehouse?

We are using Oracle Autonomous Data Warehouse for analytics in my company.

Snowflake vs Apache Hadoop

Comparisons

Compared 13% of the time

OpenText Analytics Database (Vertica) vs Apache Hadoop

Compared 6% of the time

Teradata vs Apache Hadoop

Compared 6% of the time

Databricks vs Apache Hadoop

Compared 6% of the time

Oracle Exadata vs Apache Hadoop

Compared 5% of the time

More Apache Hadoop Competitors

BigQuery vs Oracle Autonomous Data Warehouse

Compared 24% of the time

Snowflake vs Oracle Autonomous Data Warehouse

Compared 16% of the time

Databricks vs Oracle Autonomous Data Warehouse

Compared 13% of the time

Microsoft Azure Synapse Analytics vs Oracle Autonomous Data Warehouse

Compared 9% of the time

Microsoft Parallel Data Warehouse vs Oracle Autonomous Data Warehouse

Compared 2% of the time

More Oracle Autonomous Data Warehouse Competitors

Product Reports

Apache Hadoop

Download Apache Hadoop product report

Oracle Autonomous Data Warehouse

Download Oracle Autonomous Data Warehouse product report

Overview

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Apache

Oracle Autonomous Data Warehouse is the world’s first and only autonomous database optimized for analytic workloads, including data marts, data warehouses, data lakes, and data lakehouses. With Autonomous Data Warehouse, data scientists, business analysts, and nonexperts can rapidly, easily, and cost-effectively discover business insights using data of any size and type. Built for the cloud and optimized using Oracle Exadata, Autonomous Data Warehouse benefits from faster performance and, according to an IDC report (PDF), lowers operational costs by an average of 63%.

Autonomous Database provides the foundation for a data lakehouse—a modern, open architecture that enables you to store, analyze, and understand all your data. The data lakehouse combines the power and richness of data warehouses with the breadth, flexibility, and low cost of popular open source data lake technologies. Access your data lakehouse through Autonomous Database using the world's most powerful and open SQL processing engine.

Oracle

Sample Customers

Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab

Hertz, TaylorMade Golf, Outront Media, Kingold, FSmart, Drop-Tank

Apache Hadoop vs. Oracle Autonomous Data Warehouse