We have been using it to build one of our frameworks. We primarily use Dremio to create a data framework and a data queue. It's being used in combination with DBT and Databricks.
Sr Manager at a transportation company with 10,001+ employees
Offers smooth installation and cloud/on-prem flexibility, but faces integration challenges with Databricks
Pros and Cons
- "We primarily use Dremio to create a data framework and a data queue."
- "We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily."
What is our primary use case?
What is most valuable?
We're still in the exploration phase with Dremio, so it's a bit early to determine its most valuable feature. We're currently deploying it across different departments for various use cases and learning from these internal applications.
What needs improvement?
We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily. We had to set up two different VMs and execute them in a different manner and integrate them.
For how long have I used the solution?
I've been using Dremio for about two to three months now. However, one of our teams has been using it for the past year.
Buyer's Guide
Dremio
December 2025
Learn what your peers think about Dremio. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,422 professionals have used our research since 2012.
What do I think about the stability of the solution?
From my three months of experience, I haven't noticed any stability issues with Dremio.
What do I think about the scalability of the solution?
In my department, which focuses on data and AI, we have about 538 people. I'm not sure how many are actively using Dremio.
How was the initial setup?
The installation process was quite smooth and didn't present any issues.
We currently have Dremio on the cloud. For proof of concept (POC) purposes, we are using it on-premises.
Which other solutions did I evaluate?
What other advice do I have?
We are currently evaluating Dremio against other similar products. But at first glance, I would recommend using Dremio.
Considering my limited access and experience over these three months, I would rate Dremio around a seven out of ten.
Which deployment model are you using for this solution?
Hybrid Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer. customer/partner
Senior Data Engineer at a marketing services firm with 1,001-5,000 employees
Stable self-service data tool used for service to service integration and to manage simple ad hoc queries
Pros and Cons
- "Dremio gives you the ability to create services which do not require additional resources and sterilization."
- "Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement."
What is our primary use case?
I have used this solution as an ETL tool to create data marks on data lakes for bridging. I have used it as a greater layer for ad-hoc queries and for some services which do not require sub-second latency to credit data from very big data lakes. I have also used it to manage simple ad-hoc queries similar to Athena, Presto or BigQuery.
We do not have a large number of people using this solution because it's mainly setup as a service to service integration. We integrated a big workload when we started using Dremio and this was very expensive. The migration is still in progress. As soon as this migration is finished, we plan to migrate ad-hoc queries from our analytical team.
What is most valuable?
The native error interface has been most valuable. With Apache Airflow, which Dremio supports, you don't need to digitalize data. Dremio gives you the ability to create services which do not require additional resources and serialization.
What needs improvement?
Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement.
The community version lacks information or documentation. If you want to create a more dynamic scaling policy for your cluster, it's always a problem because you need an additional custom code to be sure that Dremio shuts down gracefully and does not get interrupted.
The support of the Delta I/O connector could be improved because the current support is very basic and the performance isn't great.
For how long have I used the solution?
I have used this solution for three years in a non-production environment and for nine months in a production environment.
What do I think about the stability of the solution?
This is a stable solution.
How are customer service and support?
I have used the community forum to get most of my answers for Dremio. This is best suited for simple queries but does not work as well for advanced questions that need input from a developer.
How was the initial setup?
The complexity of the setup depends on the use case. If you are working with a cluster that you plan to shut down after a week, the setup is simple. For more complicated scenarios where you need to connect parquet table, it is more complicated and the documentation to support this setup is not comprehensive.
Experienced DevOps teams need to spend at least three to five days to understand the level of ability between multiple masters and how the scheduling works.
I tried to integrate my Dremio metrics and logs to DataDog and it wasn't simple.
What's my experience with pricing, setup cost, and licensing?
Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have.
Which other solutions did I evaluate?
We considered Apache Drill, Apache Presto, Flink and Spark. The main reason we chose Dremio was cost and multitenancy.
What other advice do I have?
I would advise others not try to use Dremio as an ETL framework from day one. I also recommend do not use reflections, or any other tool which make dremio cluster statefull.
The installation for a production environment is complex and I would not recommend this solution for small businesses.
I would rate this solution a seven out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Dremio
December 2025
Learn what your peers think about Dremio. Get advice and tips from experienced pros sharing their opinions. Updated: December 2025.
879,422 professionals have used our research since 2012.
Database Engineer at a tech services company with 201-500 employees
Beneficial memory competition, good support, and price well
Pros and Cons
- "The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."
- "Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake."
What is our primary use case?
We were using Dremio as a data lake query engine tool. We were creating our PDSs and VDSs on top of our S3 buckets, and our data lake and the data-scientist teams were using the data for further processing. We didn't use it for any ETL jobs. We were using it as a data-lake tool.
What is most valuable?
The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory.
What needs improvement?
Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake.
For how long have I used the solution?
I have been using Dremio for approximately two years.
What do I think about the stability of the solution?
Dremio is highly stable.
What do I think about the scalability of the solution?
Dremio is scalable which is a benefit of the solution. You can scale up to the number of instances you want in case you are feeling the load, and in case you feel your query is running low or you are receiving extra traffic.
You can set the configuration while installing and while setting it up in the Dremio. In the configuration file, you can set up a lot of settings, such as what time.
We have approximately 18 people using the solution in my organization.
How are customer service and support?
We have been in touch with the support from Dremio when we had some internal issues. This happened approximately two times. The support is good.
Which solution did I use previously and why did I switch?
This is the first tool in this category that I have used.
How was the initial setup?
Dremio's initial setup took one or two days, one day is sufficient and typical.
What about the implementation team?
There was one DevOps person used for the deployment and maintenance of the solution.
What's my experience with pricing, setup cost, and licensing?
Dremio is less costly competitively to Snowflake or any other tool.
What other advice do I have?
My advice to others is if they are creating a data lake for a customer, Dremio would be useful for a data engineering team. If they're willing to create a data lake and they wanted to use it with the cloud-agnostic tool, then it is a good choice. If this solution meets their requirement they should try it out.
I rate Dremio an eight out of ten.
Which deployment model are you using for this solution?
Public Cloud
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
Buyer's Guide
Download our free Dremio Report and get advice and tips from experienced pros
sharing their opinions.
Updated: December 2025
Popular Comparisons
Databricks
Teradata
KNIME Business Hub
Snowflake
Amazon SageMaker
Dataiku
Alteryx
OpenText Analytics Database (Vertica)
Microsoft Azure Synapse Analytics
IBM Watson Studio
BigQuery
Amazon Redshift
Oracle Autonomous Data Warehouse
Amazon EMR
Starburst Enterprise
Buyer's Guide
Download our free Dremio Report and get advice and tips from experienced pros
sharing their opinions.
Quick Links
Learn More: Questions:
- Which are the best end-to-end data science platforms?
- What enterprise data analytics platform has the most powerful data visualization capabilities?
- What Data Science Platform is best suited to a large-scale enterprise?
- When evaluating Data Science Platforms, what aspect do you think is the most important to look for?
- How can ML platforms be used to improve business processes?
- Why is Data Science Platforms important for companies?















