Apache Hadoop vs Dremio comparison

Read 11 Dremio reviews

3,725 Views
2,161 Comparison Views

100% willing to recommend

Apache Hadoop

Comparison Buyer's Guide

Download the report

Executive Summary

We performed a comparison between Apache Hadoop and Dremio based on real PeerSpot user reviews.

Find out in this report how the two Cloud Data Warehouse solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI.

To learn more, read our detailed Apache Hadoop vs. Dremio Report (Updated: March 2026).

Apache Hadoop vs. Dremio

Download the complete report

Helped 884,933 peers since 2012

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

ROI

Sentiment score

5.4

Apache Hadoop offers cost-effective storage and processing, benefiting some with analytics and optimizing data applications for resource savings.

Sentiment score

5.6

Dremio reduces manpower costs, enhances efficiency, and eliminates infrastructure concerns, improving operations by accessing multiple data sources.

No quotes available

For more quotes and insights, download the Apache Hadoop report

Dremio surely saves time, reduces costs, and all those things because we don't have to worry so much about the infrastructure to make the different tools communicate.

For more quotes and insights, download the Dremio report

SR BI developer at BRQ Digital Solutions

Customer Service

Sentiment score

6.1

Customer service for Apache Hadoop varies, with differing satisfaction levels and reliance on external resources and forums for support.

Sentiment score

5.2

Dremio's customer service is responsive and helpful, facing staffing challenges as demand grows, requiring more integrators for support.

It's not structured support, which is why we don't use purely open-source projects without additional structured support.

For more quotes and insights, download the Apache Hadoop report

Financial Advisor at a financial services firm with 10,001+ employees

We have had to reach out for customer support many times, and they respond, so they are pretty supportive about some long-term issues.

For more quotes and insights, download the Dremio report

SR BI developer at BRQ Digital Solutions

Scalability Issues

Sentiment score

7.4

Apache Hadoop is valued for its scalability, supporting large data and users effectively, especially in cloud environments.

Sentiment score

7.1

Dremio scales well, offering flexibility and built-in capabilities, though community users face scaling limits due to licensing.

It is a distributed file system and scales reasonably well as long as it is given sufficient resources.

For more quotes and insights, download the Apache Hadoop report

Financial Advisor at a financial services firm with 10,001+ employees

Dremio's scalability can handle growing data and user demands easily.

For more quotes and insights, download the Dremio report

SR BI developer at BRQ Digital Solutions

Internally, if it's on Docker or Kubernetes, scalability will be built into the system.

KamleshPant

Senior Software Architect at USEReady

Stability Issues

Sentiment score

7.1

Apache Hadoop is stable and reliable in multi-node clusters, performing well with minimal instability during high-load operations.

Sentiment score

7.2

Dremio is generally stable, scoring high ratings with occasional performance issues, especially with large datasets, requiring maintenance restarts.

Continuous management in the way of upgrades and technical management is necessary to ensure that it remains effective.

For more quotes and insights, download the Apache Hadoop report

Financial Advisor at a financial services firm with 10,001+ employees

I rate Dremio a nine in terms of stability.

For more quotes and insights, download the Dremio report

SR BI developer at BRQ Digital Solutions

Room For Improvement

Apache Hadoop needs user-friendly enhancements, better integration, improved security, streamlined setup, and modernized features and support.

Dremio struggles with Delta connector support, performance issues, SQL limitations, high costs, and fewer connectors than competitors.

The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it.

For more quotes and insights, download the Apache Hadoop report

Financial Advisor at a financial services firm with 10,001+ employees

Starburst comes with around 50 connectors now.

KamleshPant

Senior Software Architect at USEReady

It should be easier to get Arctic or an open-source version of Arctic onto the software version so that development teams can experiment with it.

SyedIsmail

Senior Consultant - Data Analytics at a comms service provider with 201-500 employees

I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement.

For more quotes and insights, download the Dremio report

SR BI developer at BRQ Digital Solutions

Setup Cost

Enterprise Apache Hadoop pricing varies greatly, influenced by distribution choice, deployment type, and specific usage requirements.

Dremio's pricing, though costly for scaling, is seen as valuable compared to competitors, requiring careful evaluation based on needs.

No quotes available

For more quotes and insights, download the Apache Hadoop report

No quotes available

For more quotes and insights, download the Dremio report

Valuable Features

Apache Hadoop offers scalable, cost-effective data processing, supporting diverse environments with fault tolerance, integration, and analytics tools like Hive.

Dremio offers efficient data management and visualization with seamless integration, native SQL, and role-based access control features.

Hadoop is a distributed file system, and it scales reasonably well provided you give it sufficient resources.

For more quotes and insights, download the Apache Hadoop report

Financial Advisor at a financial services firm with 10,001+ employees

I assess Apache Hadoop's fault tolerance during hardware failures positively since we have hardware failover, which works without problems.

YuQing Ding

Principle Network and Database Engr at Parsons Corporation

Having everything under one system and an easier-to-work-with interface, along with having API integrations, adds significant value to working with Dremio.

SyedIsmail

Senior Consultant - Data Analytics at a comms service provider with 201-500 employees

Dremio has positively impacted my organization as nowadays we are connected to multiple databases from multiple environments, multiple APIs, and applications, and Dremio organizes everything in an amazing way for me.

Joao Silveira

Data Analyst at a insurance company with 501-1,000 employees

You just get the source, connect the data, get visualization, get connected, and do whatever you want.

KamleshPant

Senior Software Architect at USEReady

For more quotes and insights, download the Dremio report

Categories and Ranking

Apache Hadoop

Average Rating

8.0

Reviews Sentiment

6.6

Number of Reviews

Ranking in other categories

Data Warehouse (7th)

Dremio

Average Rating

8.4

Reviews Sentiment

6.6

Number of Reviews

Ranking in other categories

Cloud Data Warehouse (5th), Data Science Platforms (11th)

Featured Reviews

Reliable performance maintained but requires ongoing management and support

Financial Advisor at a financial services firm with 10,001+ employees

Hadoop was used for years, but there were problems since the people who originally set it up left the firm. The group that owned it later didn't have the technical resources to properly maintain it. Although there was nothing wrong with Hadoop itself, issues arose without proper management and upgrades.

Read full review

Has simplified complex data integration workflows and supported consistent reporting across multiple sources

SR BI developer at BRQ Digital Solutions

We also have a close relationship with the team that does the Dremio maintenance for the database, like upgrading the versions and they know about some specific problems we had in the past, such as a memory leak. We had a memory leak on some versions, which sometimes stopped the service. Since we are using Dremio installed like a server, not a SaaS solution, many times we need to stop and restart the service to clear all the cache and all that, and this is the thing I should add. I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement. I remember using some features in the past, like pivot tables, which proved to be really difficult, but I know this is a fault also for other vendors. Pivoting, transposing, and unpivoting are often not so good. CTEs also many times prove to be not so good, so I think these two main items could be improved significantly if they standardize them.

Read full review

See which vendors are best for you

Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.

See recommendations

884,933 professionals have used our research since 2012.

Top Industries

By visitors reading reviews

Financial Services Firm

31%

Computer Software Company

Government

University

Financial Services Firm

26%

Computer Software Company

Manufacturing Company

Healthcare Company

Company Size

By reviewers

Large Enterprise

Midsize Enterprise

Small Business

By reviewers
Company Size	Count
Small Business	14
Midsize Enterprise	8
Large Enterprise	21

By reviewers
Company Size	Count
Small Business	1
Midsize Enterprise	5
Large Enterprise	5

Questions from the Community

What do you like most about Apache Hadoop?

It's primarily open source. You can handle huge data volumes and create your own views, workflows, and tables. I can also use it for real-time data streaming.

What is your experience regarding pricing and costs for Apache Hadoop?

The product is open-source, but some associated licensing fees depend on the subscription level. While it might be free for students, organizations typically need to pay for their subscriptions. Th...

What needs improvement with Apache Hadoop?

The problem with Apache Hadoop arose when the guys that originally set it up left the firm, and the group that later owned it didn't have enough technical resources to properly maintain it. This wa...

What is your experience regarding pricing and costs for Dremio?

I don't have information about pricing, setup cost, and licensing for Dremio, so I am not entitled to discuss it.

What needs improvement with Dremio?

I wouldn't say there is anything Dremio can be improved on. If I could change something, I would say many developers and programmers, when they are starting to work in this specific field or area, ...

What is your primary use case for Dremio?

I have been using Dremio for a year and a half. My main use case for Dremio is that I am able to access multiple databases and I can easily and quickly connect Dremio with my dashboards. In my rece...

Snowflake vs Apache Hadoop

Comparisons

Compared 13% of the time

OpenText Analytics Database (Vertica) vs Apache Hadoop

Compared 6% of the time

Teradata vs Apache Hadoop

Compared 6% of the time

Databricks vs Apache Hadoop

Compared 6% of the time

More Apache Hadoop Competitors

Databricks vs Dremio

Compared 23% of the time

Snowflake vs Dremio

Compared 11% of the time

Starburst Enterprise vs Dremio

Compared 4% of the time

Dataiku vs Dremio

Compared 4% of the time

Amazon SageMaker vs Dremio

Compared 4% of the time

More Dremio Competitors

Product Reports

Apache Hadoop

Download Apache Hadoop product report

Download Dremio product report

Also Known As

No data available

Dremio AWS - BYOL

Overview

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

Apache

Dremio offers a comprehensive platform for data warehousing and data engineering, integrating seamlessly with data storage systems like Amazon S3 and Azure. Its main features include scalability, query federation, and data reflection.

Dremio's core strength lies in its ability to function as a robust data lake query engine and data warehousing solution. It facilitates the creation of complex queries with ease, thanks to its support for Apache Airflow and query federation across endpoints. Despite challenges with Delta connector support, complex query execution, and expensive licensing, users find it valuable for managing ad-hoc queries and financial data analytics. The platform aids in SQL table management and BI traffic visualization while reducing storage costs and resolving storage conflicts typical in traditional data warehouses.

What are Dremio's most valuable features?

Native Error Interfaces: Simplifies error detection and troubleshooting
Integration with Apache Airflow: Enhances workflow automation capabilities
Scalability: Effortlessly handles growing data demands
Role-Based Access Management: Provides secure data access control
Data Reflection: Optimizes query performance automatically
Query Federation: Supports queries across multiple endpoints seamlessly
Data Cataloging and Virtualization: Offers streamlined data organization

What benefits and ROI should users look for?

Reduced Data Storage Costs: Efficiently manages data to minimize storage expenses
Improved Query Performance: Enhances processing speed for large datasets
Flexibility in Integration: Easily connects with storage solutions like Oracle and MySQL
Enhanced Data Management: Supports comprehensive data analysis workflows

Dremio is primarily implemented in industries requiring extensive data engineering and analytics, including finance and technology. Companies use it for constructing data frameworks, efficiently processing financial analytics, and visualizing BI traffic. It acts as a viable alternative to AWS Glue and Apache Hive, integrating seamlessly with multiple databases, including Oracle and MySQL, offering robust solutions for data-driven strategies. Despite some challenges, its ability to reduce data storage costs and manage complex queries makes it a favorable choice among enterprise users.

Sample Customers

Amazon, Adobe, eBay, Facebook, Google, Hulu, IBM, LinkedIn, Microsoft, Spotify, AOL, Twitter, University of Maryland, Yahoo!, Cornell University Web Lab

UBS, TransUnion, Quantium, Daimler, OVH

Apache Hadoop vs. Dremio