Cancel
You must select at least 2 products to compare!
Dremio Logo
2,683 views|2,043 comparisons
100% willing to recommend
H2O.ai Logo
1,962 views|1,376 comparisons
100% willing to recommend
Comparison Buyer's Guide
Executive Summary

We performed a comparison between Dremio and H2O.ai based on real PeerSpot user reviews.

Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms.
To learn more, read our detailed Data Science Platforms Report (Updated: May 2024).
771,157 professionals have used our research since 2012.
Featured Review
Quotes From Members
We asked business professionals to review the solutions they use.
Here are some excerpts of what they said:
Pros
"Dremio enables you to manage changes more effectively than any other data warehouse platform. There are two things that come into play. One is data lineage. If you are looking at data in Dremio, you may want to know the source and what happened to it along the way or how it may have been transformed in the data pipeline to get to the point where you're consuming it.""Everyone uses Dremio in my company; some use it only for the analytics function.""Dremio allows querying the files I have on my block storage or object storage.""Dremio gives you the ability to create services which do not require additional resources and sterilization.""We primarily use Dremio to create a data framework and a data queue.""The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."

More Dremio Pros →

"Fast training, memory-efficient DataFrame manipulation, well-documented, easy-to-use algorithms, ability to integrate with enterprise Java apps (through POJO/MOJO) are the main reasons why we switched from Spark to H2O.""It is helpful, intuitive, and easy to use. The learning curve is not too steep.""One of the most interesting features of the product is their driverless component. The driverless component allows you to test several different algorithms along with navigating you through choosing the best algorithm.""The most valuable features are the machine learning tools, the support for Jupyter Notebooks, and the collaboration that allows you to share it across people.""The ease of use in connecting to our cluster machines.""AutoML helps in hands-free initial evaluations of efficiency/accuracy of ML algorithms."

More H2O.ai Pros →

Cons
"We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily.""It shows errors sometimes.""Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement.""Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake.""They have an automated tool for building SQL queries, so you don't need to know SQL. That interface works, but it could be more efficient in terms of the SQL generated from those things. It's going through some growing pains. There is so much value in tools like these for people with no SQL experience. Over time, Dermio will make these capabilities more accessible to users who aren't database people.""I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported."

More Dremio Cons →

"The model management features could be improved.""Referring to bullet-3 as well, H2O DataFrame manipulation capabilities are too primitive.""I would like to see more features related to deployment.""The interpretability module has room for improvement. Also, it needs to improve its ability to integrate with other systems, like SageMaker, and the overall integration capability.""It lacks the data manipulation capabilities of R and Pandas DataFrames. We would kill for dplyr offloading H2O.""On the topic of model training and model governance, this solution cannot handle ten or twelve models running at the same time.""It needs a drag and drop GUI like KNIME, for easy access to and visibility of workflows."

More H2O.ai Cons →

Pricing and Cost Advice
  • "Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have."
  • "Dremio is less costly competitively to Snowflake or any other tool."
  • More Dremio Pricing and Cost Advice →

  • "We have seen significant ROI where we were able to use the product in certain key projects and could automate a lot of processes. We were even able to reduce staff."
  • More H2O.ai Pricing and Cost Advice →

    report
    Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
    771,157 professionals have used our research since 2012.
    Questions from the Community
    Top Answer:Dremio allows querying the files I have on my block storage or object storage.
    Top Answer:Every tool has a value based on its visualization, and the pricing is worth its value.
    Top Answer:Dremio's interface is good, but it has a few limitations. I cannot do a lot of things with ANSI SQL or basic SQL. I cannot use the recursive common table expression (CTE) in Dremio because the support… more »
    Ask a question

    Earn 20 points

    Ranking
    9th
    Views
    2,683
    Comparisons
    2,043
    Reviews
    6
    Average Words per Review
    530
    Rating
    8.7
    20th
    Views
    1,962
    Comparisons
    1,376
    Reviews
    0
    Average Words per Review
    0
    Rating
    N/A
    Comparisons
    Databricks logo
    Compared 43% of the time.
    Snowflake logo
    Compared 17% of the time.
    Starburst Enterprise logo
    Compared 12% of the time.
    Amazon Redshift logo
    Compared 4% of the time.
    Databricks logo
    Compared 21% of the time.
    Amazon SageMaker logo
    Compared 17% of the time.
    Dataiku logo
    Compared 14% of the time.
    KNIME logo
    Compared 10% of the time.
    Learn More
    Dremio
    Video Not Available
    Overview

    Dremio is a data analytics platform designed to simplify and expedite the data analysis process by enabling direct querying across multiple data sources without the need for data replication. This solution stands out due to its approach to data lake transformation, offering tools that allow users to access and query data stored in various formats and locations as if it were all in a single relational database.

    At its core, Dremio facilitates a more streamlined data management experience. It integrates easily with existing data lakes, allowing organizations to continue using their storage of choice, such as AWS S3, Microsoft ADLS, or Hadoop, without data migration. Dremio supports SQL queries, which means it seamlessly integrates with familiar BI tools and data science frameworks, enhancing user accessibility and reducing the learning curve typically associated with adopting new data technologies.

    What Are Dremio's Key Features?

    • Data Reflections: Reduces query times by creating optimized representations of source data, which can accelerate performance without the complexity of traditional data warehousing solutions.
    • Semantic Layer: Allows users to define business metrics and dimensions centrally, ensuring consistency and governance across all analytics tools.
    • Built-in Security Features: Provides robust security measures, including column- and row-level security, ensuring compliance with data governance and privacy standards.
    • Support for Multiple Data Formats and Sources: Enables querying directly against a variety of data formats (Parquet, JSON, etc.) and sources without the need for conversion or replication.

    What Benefits Should Users Expect?

    When evaluating Dremio, potential users should look for feedback on its query performance, especially in environments with large and complex data sets. Reviews might highlight the efficiency gains from using Dremio’s data reflections and its ability to integrate with existing BI tools without significant changes to underlying data structures. Also, check how other users evaluate its ease of deployment and scalability, particularly in hybrid and cloud environments.

    How is Dremio Implemented Across Different Industries?

    Dremio is widely applicable across various industries, including finance, healthcare, and retail, where organizations benefit from rapid, on-demand access to large volumes of data spread across disparate systems. For instance, in healthcare, Dremio can be used to analyze patient outcomes across different data repositories, improving treatment strategies and operational efficiencies.

    What About Dremio’s Pricing, Licensing, and Support?

    Dremio offers a flexible pricing model that caters to different sizes and types of businesses, including a free community version for smaller teams and proof-of-concept projects. Their enterprise version is subscription-based, with pricing varying based on the deployment scale and support needs. Customer support is comprehensive, featuring dedicated assistance, online resources, and community support.

    H2O is a fully open source, distributed in-memory machine learning platform with linear scalability. H2O’s supports the most widely used statistical & machine learning algorithms including gradient boosted machines, generalized linear models, deep learning and more. H2O also has an industry leading AutoML functionality that automatically runs through all the algorithms and their hyperparameters to produce a leaderboard of the best models. The H2O platform is used by over 14,000 organizations globally and is extremely popular in both the R & Python communities.

    Sample Customers
    UBS, TransUnion, Quantium, Daimler, OVH
    poder.io, Stanley Black & Decker, G5, PWC, Comcast, Cisco
    Top Industries
    VISITORS READING REVIEWS
    Financial Services Firm31%
    Computer Software Company11%
    Manufacturing Company8%
    Retailer4%
    VISITORS READING REVIEWS
    Financial Services Firm19%
    Computer Software Company11%
    Manufacturing Company9%
    Insurance Company6%
    Company Size
    VISITORS READING REVIEWS
    Small Business16%
    Midsize Enterprise11%
    Large Enterprise73%
    REVIEWERS
    Small Business22%
    Midsize Enterprise22%
    Large Enterprise56%
    VISITORS READING REVIEWS
    Small Business18%
    Midsize Enterprise12%
    Large Enterprise69%
    Buyer's Guide
    Data Science Platforms
    May 2024
    Find out what your peers are saying about Databricks, Microsoft, Alteryx and others in Data Science Platforms. Updated: May 2024.
    771,157 professionals have used our research since 2012.

    Dremio is ranked 9th in Data Science Platforms with 6 reviews while H2O.ai is ranked 20th in Data Science Platforms. Dremio is rated 8.6, while H2O.ai is rated 7.6. The top reviewer of Dremio writes "It enables you to manage changes more effectively than any other platform". On the other hand, the top reviewer of H2O.ai writes "It is helpful, intuitive, and easy to use. The learning curve is not too steep". Dremio is most compared with Databricks, Snowflake, Starburst Enterprise, Amazon Redshift and Microsoft Azure Synapse Analytics, whereas H2O.ai is most compared with Databricks, Amazon SageMaker, Dataiku, Microsoft Azure Machine Learning Studio and KNIME.

    See our list of best Data Science Platforms vendors.

    We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.