Try our new research platform with insights from 80,000+ expert users
Venkatesh Kollana - PeerSpot reviewer
Associate Software Engineer at Systech Solutions
Real User
Top 10
Can store different types of data - structured, unstructured, and semi-structured—in one lake
Pros and Cons
  • "The tool's best feature is that it can store different types of data - structured, unstructured, and semi-structured—in one lake. We can use the required data for analytics and dashboard design."
  • "I suggest enhancing the connectors for improvement. Some software, like HubSpot and Xero, has connectors, but they're limited to a few fields. That's why we use REST API calls instead. It would be better if the connectors could retrieve all data."

What is our primary use case?

We use Azure Data Lake Storage for sources like HubSpot (CRM software) and Xero (invoice software). We call their APIs, get the data, and store it in the product. From there, we use it to get the responses and load them into Azure SQL DB.

What is most valuable?

The tool's best feature is that it can store different types of data - structured, unstructured, and semi-structured—in one lake. We can use the required data for analytics and dashboard design.

What needs improvement?

I suggest enhancing the connectors for improvement. Some software, like HubSpot and Xero, has connectors, but they're limited to a few fields. That's why we use REST API calls instead. It would be better if the connectors could retrieve all data.

For how long have I used the solution?

I have been using the product for a year. 

Buyer's Guide
Azure Data Lake Storage
July 2025
Learn what your peers think about Azure Data Lake Storage. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.
864,053 professionals have used our research since 2012.

What do I think about the stability of the solution?

The tool is a stable product; they've been improving it recently.

What do I think about the scalability of the solution?

About 400 to 500 people (40 to 50 percent of employees) use Azure Data Lake Storage for ETL or development. It's scalable - we can add or remove users and change permissions within minutes.

How are customer service and support?

I haven't talked directly to Microsoft support, but I use blogs and forums to find answers. 

How was the initial setup?

Setting up and deploying Azure Data Lake Storage is easy. We use Azure DevOps to connect and deploy all our services.

What's my experience with pricing, setup cost, and licensing?

The tool is cheap, depending on the services and requirements you need.

What other advice do I have?

I use Azure Data Lake Storage in a cloud-only setup, not on-premises. We receive API calls and store the responses in the product. Then, we process these files using the tool.

For first-time users, I recommend learning from Microsoft materials or YouTube videos before using the tool. It is better to gain some knowledge before using it. It's easy for beginners to learn and use, especially compared to AWS and other services.

I'd rate Azure Data Lake Storage eight out of ten. I find it user-friendly as a fresher with about two point eight years in my tech career. I started my career with this tool, gained much knowledge, and now I can lead a team.

Disclosure: My company has a business relationship with this vendor other than being a customer. customer/partner
PeerSpot user
MatsHagberg Olsson2 - PeerSpot reviewer
Senior Solutions Architect at EQ2 Technology
Real User
Top 5
Offers good safety to users and is impossible to hack into
Pros and Cons
  • "The tool is very safe to use. It is impossible to hack the product."
  • "If tools like Azure Data Lake Storage are enabled within the tool named Azure Storage Explorer, then it would be of tremendous help, but it can be really tricky."

What is our primary use case?

From my perspective, it is secure transfer storage out there.

In terms of the use of Azure Data Lake Storage by customers for data analytics and processing workflows, I would say that my role is to convince customers that it is the safest tool for the storage of data. You can securely connect to remote regions with the tool Azure Storage Explorer, which gives the options and possibilities to safely transfer your data from your existing storage premises and send it to Azure Cloud.

What is most valuable?

The solution's most valuable features are the tools and functions, which are primarily hosted in Azure Storage Explorer. However, you can also facilitate them from within the backup. The tool is very safe to use. It is impossible to hack the product.

What needs improvement?

Some customers residing in former Eastern European countries operate in an independent and very weak IT environment. If tools like Azure Data Lake Storage are enabled within the tool named Azure Storage Explorer, then it would be of tremendous help, but it can be really tricky. It would be great if some of the aforementioned features could be enabled, but I fully understand the complexities involved.

I used to like Azure Data Lake Storage previously. Presently, I like the fact Azure Data Lake Storage is improving rapidly by investing and honestly in assets, resources, top personnel, along with a lot of money for making Azure's storage part a bigger concept. Azure Data Lake Storage can be a danger for the large storage products.

For how long have I used the solution?

I have been using Azure Data Lake Storage for a couple of years. I am an Azure solution architect. I work with Azure Data Lake Storage Gen2.

What do I think about the stability of the solution?

I am very confident that it is a stable solution. Stability-wise, I rate the solution a ten out of ten.

There was a major issue during mid-October, which affected many global businesses just for a few hours. It was the biggest issue with the tool I had been involved in for many years.

What do I think about the scalability of the solution?

Scalability-wise, I rate the solution a ten out of ten.

Azure Lake Dade Data Lake Storage scalability has very much impacted our customer's data storage strategy since it offers options to choose the disk, scale-out, and DRC options, making everything fantastic.

All customers I have worked with over the last year are using the tool and assigning me to look after it, so it could be seven or eight businesses over the last three years, some of which are global leaders in the market, having over 12,000 employees globally.

How was the initial setup?

On a scale of one to ten, where one is difficult and ten is easy, I rate the product's initial setup phase as ten. You have to understand what to do since you can be lucky and just go and click a few buttons to do the setup process. Knowing the tool's setup phase can make the product cost-effective, but if you don't know about it, then it can be costly. Combining the tool with the features of Azure Cost Management can make things much easier for you. The upcoming edition of Microsoft Copilot should make everything in the tool way easier and also other things not so expensive.

I was not directly involved in the product's deployment process, but I am subscribed to all channels associated with the deployment part, and I have many friends in Microsoft in Northern Europe, and in Sweden, where I live. Storage is one of my top skills, and my friends want to help me become a champion.

The solution is deployed on the cloud and in the on-premises version.

What's my experience with pricing, setup cost, and licensing?

From one to ten, where one is cheap and ten is expensive, I rate the product price as five. It costs money, but it is cheaper by at least thirty percent if you consider the other equivalent solutions from AWS. Considering the aforementioned perspective, the tool is cheap, but you have to pay a certain amount.

There are options to choose from depending on the subscription you have and the amount of features you consume from Microsoft, so it can vary quite a bit.

What other advice do I have?

For my organization, the most valuable part of the tool stems from a variety of features within SQL tables and also data, which is a combination of Azure Data Lake Storage and Azure Blob Storage. Azure Data Lake Storage Gen2 is the best, and I think it is fantastic. From my point of view, the tool is considered to be very competitive here.

I have been working with storage tools for over twenty years, so I am developing my skill sets related to cloud solution providers, mainly on Azure since I began with that in 2008.

Take a deep dive into all the possibilities and options you get because you won't be disappointed. You need to do comparisons with other CSPs and other storage vendors, like NetApp and Dell EMC.

I rate the tool a ten out of ten.

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer. consultant
PeerSpot user
Buyer's Guide
Azure Data Lake Storage
July 2025
Learn what your peers think about Azure Data Lake Storage. Get advice and tips from experienced pros sharing their opinions. Updated: July 2025.
864,053 professionals have used our research since 2012.
MACIEJPOLAKOWSKI - PeerSpot reviewer
Senior Manager at IT Squad
Real User
Top 20
A cost-effective solution to store data and allows flexible capacity management
Pros and Cons
    • "The version was a bit outdated compared to the newer Microsoft Data Fabric offerings."

    What is our primary use case?

    We use the solution for storing data but don’t use Synapse to store data directly in it. Instead, Azure Synapse Analytics is utilized to analyze and process data in Data Lake Storage. Data Lake Storage is a large, scalable solution that handles extensive volumes of structured and unstructured data rather than a direct disk storage system.

    What needs improvement?

    In Azure Data Lake Storage, the tool we're using, Spark, handles the management, storage, retrieval, and organization of data. Spark employs its algorithms to abstract the underlying complexities. We don’t work with a large amount of data. If we were to handle larger datasets, we would need to focus more on optimizing storage and retrieval processes, as the efficiency of these operations would become more critical.

    The version was a bit outdated compared to the newer Microsoft Data Fabric offerings. For instance, the directory services are already available in Data Fabric, so I don't think adding them to Azure Data Lake Storage would be necessary. For example, Snowflake, a cloud data analytics platform, adds its capabilities and optimizations to Azure Data Lake Storage, such as improved performance or easier integration with SQL. Compared to other similar services, Azure Data Lake Storage remains very competitive.

    For how long have I used the solution?

    I have been using Azure Data Lake Storage for over a year.

    What do I think about the stability of the solution?

    Azure is a stable platform. These interruptions are relatively rare and usually last only a few minutes. It is good for data-oriented applications that don’t require continuous online processing.

    These brief outages do not significantly impact the quality of service. We haven’t experienced major stability issues with Azure Storage. 

    What do I think about the scalability of the solution?

    It is scalable.

    How are customer service and support?

    Any issues are handled by the team responsible for managing the platform.

    Which solution did I use previously and why did I switch?

    We primarily use Azure Synapse, which integrates with Azure Data Lake Storage. Synapse leverages the storage provided by Data Lake Storage, so both are part of the Azure ecosystem but remain distinct services.

    Another integration involves SQL Server, which serves data to various consumers as an SQL database. The main consumer is Power BI, which provides extensive reporting capabilities. Additionally, Azure Functions integrates with internal systems at the client’s end.

    What's my experience with pricing, setup cost, and licensing?

    It is a cost-effective solution.

    What other advice do I have?

    Using a cloud platform generally allows for flexible capacity management, meaning you can use and pay for resources only when needed. This is particularly useful for our customers, who can run Spark clusters in serverless mode. They only pay for the time they use the service, which is cost-effective since they don’t need constant access to high power and typically run jobs for shorter periods, like half an hour.

    It is available continuously and supports data archiving. However, since the current volume of data is not large, the client doesn’t need to focus on archiving or optimization. As their data grows and becomes more historical, they may need to optimize storage and archiving practices.

    The other team manages the integration tasks. The process is straightforward as long as the systems, functions, or other components interact with external systems. The ease of integration can depend on the intensity of the integration requirements.

    Overall, I rate the solution an eight out of ten.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    Richard Mottershead - PeerSpot reviewer
    Enterprise Architect at a non-profit with 501-1,000 employees
    Real User
    Top 5Leaderboard
    Able to partition data into various datasets using a directory hierarchy
    Pros and Cons
    • "The most valuable feature of Azure Data Lake Storage is the ability to partition data into various datasets using a directory hierarchy. This folder structure is key for any delivery. Currently, we're not doing much with the data in the tool, but when Databricks comes along, we'll convert it to Parquet format. It's a two-step process: raw data is moved to Parquet, which Databricks can manipulate easily."
    • "One improvement I'd suggest is the out-of-the-box conversion of input data, like spreadsheet or table data, to various formats. We'll be using Parquet, which enables transactional integrity."

    What is most valuable?

    The most valuable feature of Azure Data Lake Storage is the ability to partition data into various datasets using a directory hierarchy. This folder structure is key for any delivery. Currently, we're not doing much with the data in the tool, but when Databricks comes along, we'll convert it to Parquet format. It's a two-step process: raw data is moved to Parquet, which Databricks can manipulate easily.

    What needs improvement?

    One improvement I'd suggest is the out-of-the-box conversion of input data, like spreadsheet or table data, to various formats. We'll be using Parquet, which enables transactional integrity.

    For how long have I used the solution?

    I have been using the product for a year. 

    What do I think about the stability of the solution?

    Stability is good if you build your Azure Data Lake Storage well in the first place.

    What do I think about the scalability of the solution?

    Scalability depends on process complexity—it is high for simple processes and low for complex ones. This is due to the architecture of a data lake, but once converted to a data lakehouse, scalability is high across the board. I think Azure Data Lake Storage would suit medium—to large enterprises. 

    How are customer service and support?

    Microsoft's documentation is superb, and support is good, especially if you have a relevant intermediate supplier.

    Which solution did I use previously and why did I switch?

    We haven't compared Azure Data Lake Storage with products from other vendors because we're an Azure shop. We did check that the Azure product was good enough for our needs, and it was, so we didn't explore alternatives like AWS, Google, or Snowflake.

    How was the initial setup?

    The initial setup is fairly complex, but if you get your data architecture right from the start, it's not a problem. We're using a totally cloud-based deployment with Azure.

    What other advice do I have?

    Integration capabilities are fairly smooth and comparable to AWS in terms of cloud integration. Some might say it's slightly better, others slightly worse, but I think it's good. I'd rate Azure Data Lake Storage an eight out of ten. However, it's important to note that it's only eventually consistent, so don't expect immediate consistency when changes are made. It works well as a data storage bucket for future use, but it's unsuitable for transactional work. You need to use a data lakehouse like Databricks for transactional processes, which can handle transactional work once the data is in the correct format (like Parquet). The tool is great for storing data you want to put into a data lakehouse, but not for frequent transactions. It's suitable for daily archiving, but anything more frequent than that might cause issues.

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Microsoft Azure
    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    Data Architecture and Engineering Specialist at coprocenva
    User
    Top 5Leaderboard
    Manages large data volumes and has user-friendly automation
    Pros and Cons
    • "Azure Data Lake Storage is user-friendly and easy to use."
    • "Azure Data Lake Storage is user-friendly and easy to use."
    • "Maybe the solution could be a bit more user-friendly."
    • "The scalability is limited. However, it's easy to set up."

    What is our primary use case?

    We use Azure Data Lake Storage for managing large data volumes in our big data projects.

    How has it helped my organization?

    I have configured the tool to automate the deletion of data and transfer data from one repository to another automatically.

    What is most valuable?

    Azure Data Lake Storage is user-friendly and easy to use. It effectively manages large data volumes and allows for automated configuration of data operations such as deletion and transfer between repositories.

    What needs improvement?

    Maybe the solution could be a bit more user-friendly.

    What do I think about the stability of the solution?

    It is very stable and reliable. It is a good solution that doesn't crash.

    What do I think about the scalability of the solution?

    The scalability is limited. However, it's easy to set up.

    How are customer service and support?

    The support from Microsoft for Azure products is good. It's timely.

    How would you rate customer service and support?

    Positive

    How was the initial setup?

    The initial setup is very easy with this tool.

    What's my experience with pricing, setup cost, and licensing?

    I am not familiar with the pricing.

    What other advice do I have?

    Overall, I would rate the Azure Data Lake Storage as nine out of ten.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    Flag as inappropriate
    PeerSpot user
    Kerols Alfons - PeerSpot reviewer
    BI & Data Engineering Manager at a sports company with 10,001+ employees
    Real User
    Top 20
    Allows for seamless integration and secure storage with decent pricing

    What is our primary use case?

    When dealing with data from various sources or servers, Azure Data Lake Storage allows for seamless integration and secure storage through features like Azure BitLocker encryption. This ensures data integrity and protection, regardless of its origin or destination.

    How has it helped my organization?

    We focus on the regional messaging for our data housed within it ensures better data management and quality control.

    What is most valuable?

    The price for Azure is decent. However, the reliability of the provider and their efficient collaboration are positive aspects. Using Azure Data services, we've successfully deployed support systems that facilitate seamless data confirmation across various resources within our company.

    Azure offers a comprehensive suite of tools beneficial for both development and delivery. Performance, usability, and scalability are strong points. Recently, I acquired a new tool called AWS Storage Gateway for seamless data transfer between AWS storage and Azure Data Lake, enhancing database operations and developer workflows.

    What needs improvement?

    One feature could be added is the ability to create and manage files within the same storage using serverless query integration with Data Lake Analytics.

    When comparing your account with AWS, the financial aspect will be the deciding factor for the customer. If it's a favorable response, we will continue working with Azure. We were awaiting the HTTPS integration.

    There's a strong community for learning how to use Azure Data Lake. We've encountered an issue while testing AzureProtect.

    Overall, I rate the solution a nine out of ten.

    For how long have I used the solution?

    I have been using Azure Data Lake Storage for a year.

    What do I think about the stability of the solution?

    The solution is stable.

    What do I think about the scalability of the solution?

    The product’s scalability is good. 45 users are using this solution.

    How was the initial setup?

    The initial setup is easy. Our team consisting of two individuals initially, with the option for a third support member, making it easier to utilize Infiniti. It takes four months to deploy. 

    We have ongoing plans for building more deposits. Additionally, they can send project details via calls, as all projects are managed within the same system.

    Four members were involved during deployment.

    What's my experience with pricing, setup cost, and licensing?

    The pricing is reasonable.

    What other advice do I have?

    We're planning to utilize machine learning techniques in Azure Data Lake.

    Maintenance is easy.

    Overall, I rate the solution a nine out of ten.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    Mukesh  Kumar - PeerSpot reviewer
    Senior Software Engineer at a tech consulting company with 10,001+ employees
    Real User
    Top 5
    Easily integrates with a company's current workflow
    Pros and Cons
    • "The response time and quality offered by the support team are good."
    • "The high price of the product is an area of concern where improvements are required."

    What is our primary use case?

    I use the solution in my company as per our project requirements. In my company, we are only putting data from on-premises RDBMS into Azure Data Lake Storage Gen2, and then the file is stored in parquet format. After the aforementioned process is followed, my company has another data engineering team, which reads those data further.

    What is most valuable?

    For writing data to data lake, my company uses Oracle GoldenGate for Big Data. With Oracle GoldenGate for Big Data, my company had to use Handler (Java Platform SE 8), but now we use HDFS Handler, and then using it, we have to configure some files and open some ports between a bank's private network to Azure Data Lake Storage Gen2. After opening all the aforementioned areas, my company is able to push the data to Azure Data Lake Storage Gen2.

    What needs improvement?

    In my company, we are not facing any slowness or other kinds of issues with the product. Each day in my company, we create new directories and put the current files into them, so there is the segregation part that is taken care of, and because of this, there are no issues with the tool.

    In our company, one of the teams use Azure Databricks to read data from Azure Data Lake Storage's account and as per the business use case, they move data or take the data further. The project I am currently doing has only limited work. I haven't explored all the points associated with the tool.

    The high price of the product is an area of concern where improvements are required.

    For how long have I used the solution?

    I have been using Azure Data Lake Storage for a year.

    What do I think about the stability of the solution?

    The product's stability is good.

    What do I think about the scalability of the solution?

    The scalability part of the product is very good, and my company has not faced any issues with it.

    At present, 15 to 16 percent of the company uses the tool, but it will increase by a percent in the future.

    How are customer service and support?

    The response time and quality offered by the support team are good. I rate the technical support as nine out of ten.

    How would you rate customer service and support?

    Positive

    How was the initial setup?

    The product's initial setup phase is not too simple, and it can be described as a moderate process.

    The solution is deployed on the cloud.

    What's my experience with pricing, setup cost, and licensing?

    I rate the product price as two or three, where one is high, and ten is low. The product's price is really high.

    What other advice do I have?

    Integrating Azure Data Lake Storage into my company's current workflow was easy.

    I recommend the product to those who plan to use it.

    I rate the tool an eight to nine out of ten.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user
    CurtisFisher - PeerSpot reviewer
    Data Manager at National Committee for Quality Assurance
    Real User
    Top 20
    User-friendly and helps aggregate all information into one particular environment
    Pros and Cons
    • "Azure Data Lake Storage is a user-friendly and easy-to-learn solution."
    • "Some terminology is not easy to understand when using some interfaces."

    What is our primary use case?

    We use the tool to aggregate all our information into one particular environment. We have separate environments, but we aggregate the important pieces into one environment so they can be used for reporting and analytics.

    What is most valuable?

    Azure Data Lake Storage is a user-friendly and easy-to-learn solution.

    What needs improvement?

    The solution's instructions could be improved. Some terminology is not easy to understand when using some interfaces. Microsoft must provide more examples because everybody uses the tool differently.

    For how long have I used the solution?

    I have been using Azure Data Lake Storage for 5 years.

    What do I think about the stability of the solution?

    I rate the solution’s stability ten out of ten.

    What do I think about the scalability of the solution?

    Around 300 users use the solution in our organization, including developers and people who use the data through report interfaces.

    I rate the solution ten out of ten for scalability.

    How was the initial setup?

    The solution’s initial setup was straightforward.

    What about the implementation team?

    We implemented the solution through an in-house team. Certain features are turned on to deploy the tool. You go in there and set up what you need, like the database or data factory. Once they turn it on and give you permission, you go in there and set it up yourself.

    What was our ROI?

    The tool is worth the money because I've worked in other environments, and this one seems to be the most flexible and easy to use.

    What other advice do I have?

    I use Azure Data Lake Storage version 2. We integrate the solution with other internal and external sources through REST and SQL. We have external sites that we integrate or pull the information into. We can see all the data in one place and report off of it.

    The tool can be expanded at a moment's notice. Once I'm done, I can lower the threshold back down. The solution's flexibility allows me to increase my CPU or database size at a moment's notice.

    Overall, I rate the solution 10 out of 10.

    Disclosure: My company does not have a business relationship with this vendor other than being a customer.
    PeerSpot user