PeerSpot user
Enterprise Architect at TechnipEnergies
Real User
Feature-rich, scales well, and it provides good extract, transform, and load functionality
Pros and Cons
  • "The best part of this product is the extraction, transformation, and load."
  • "The need to work more on developing out-of-the-box connectors for other products like Oracle, AWS, and others."

What is our primary use case?

We are not using this product specifically as a data factory. We have taken Synapse Analytics as the entire component for the data warehousing solution. Azure Data Factory is one of the components of that, and we are using it for ETL. 

How has it helped my organization?

Prior to this, we did not have a proper data warehousing solution. Instead, we had segregation between different tools like Oracle Data Warehouse, Exadata, and other products. Now, most of the tools that we have are from Microsoft, including Power BI, which has been rolled out throughout the organization. Synapse was the better choice for us to implement, as it has a lot of out-of-the-box connectors that we can utilize for data transformation and organization. 

What is most valuable?

The best part of this product is the extraction, transformation, and load. In fact, we have found that the three of them work quite well. We are implementing the cloud-based system right now.

We see a lot of improvement with the most recent version of this solution. Some of the new features are very important to us.

What needs improvement?

The need to work more on developing out-of-the-box connectors for other products like Oracle, AWS, and others.

Buyer's Guide
Azure Data Factory
April 2024
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,857 professionals have used our research since 2012.

For how long have I used the solution?

We have been working with Azure Data Factory for approximately six months. We are still implementing and it is not live, yet, but we expect it to be in 2021.

What do I think about the stability of the solution?

I have found it to be quite stable. Here and there, there could be some issues and problems but overall, I'm okay with the product.

What do I think about the scalability of the solution?

Scalability is one of the points that we were looking for because we are hosting approximately two terabytes of data and we expect that it will grow at least five times over the next two years. This is one of the reasons that we adopted this solution.

In perhaps a year, we will increase our usage.

How are customer service and support?

The technical support from Microsoft is quite good. if you get good resources and they can provide you with free consulting, then it is quite good. However, when you purchase paid consulting and dedicated support, it is quite costly compared to the market.

How was the initial setup?

I don't think that the initial setup was very complex. We have quite an advanced IT infrastructure team and the Microsoft FastTrack team also helped us a lot during the programming of the development and setup.

What's my experience with pricing, setup cost, and licensing?

I would not say that this product is overly expensive. It is competing with the other providers, and they have almost the same pricing model.

What other advice do I have?

In general, I would recommend this product. However, it depends on the target IT ecosystem. If they are utilizing a lot of Microsoft products like Power BI, Office, Project, SharePoint, and so forth, it's better to implement Data Factory because it will reduce a lot of effort spent to consume data from other sources.

At this point, I can only rate based on my pre-implementation experience, so I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Richard Griffin - PeerSpot reviewer
Manager Data & Analytics at Fletcher Building
Real User
Top 10
Simple to use, good performance, and competitive pricing
Pros and Cons
  • "The most valuable feature of this solution would be ease of use."
  • "It does not appear to be as rich as other ETL tools. It has very limited capabilities."

What is our primary use case?

I am a manager of a team that uses this solution.

Azure Data Factory is primarily used for data integration, which involves moving data from sources into a data lake house called Delta Lake.

What is most valuable?

It's fairly simple to use. The most valuable feature of this solution would be ease of use.

What needs improvement?

It does not appear to be as rich as other ETL tools. It has very limited capabilities. It simply moves data around. It's not very good after that because it's taking the data to the next level and modeling it.

For how long have I used the solution?

I have been working with Azure Data Factory for less than a year.

I would say that we are working with the latest version.

What do I think about the stability of the solution?

The stability of Azure Data Factory is good. The performance is good.

What do I think about the scalability of the solution?

I haven't had to scale this solution as of yet.

We have six people in our company who use this solution.

Increasing the usage is not on our strategy pathway.

How are customer service and support?

I have not contacted technical support. I have not required any yet.

I have had very little contact with Microsoft support, but it's been good.

Which solution did I use previously and why did I switch?

I have also worked with Talend. I didn't switch products, but rather companies.

Talend is a more robust enterprise-wide solution that can handle everything from start to finish, whereas Azure Data Factory is more of an ingestion tool.

How was the initial setup?

I was not involved with the initial setup.

What about the implementation team?

We are an enterprise that uses an integrator.

It does not require any maintenance, it's simple.

What's my experience with pricing, setup cost, and licensing?

I don't see a cost; it appears to be included in general support. I have been told that you have to be very careful because it can blow out. I have not experienced it yet, but I've been warned that as Azure ingestion increases, the costs can rise.

In my opinion, the price is competitive.

What other advice do I have?

It's a good tool, a good product that does what it's supposed to do well, which is ingesting data from a source to your target, to another cloud, to another source. Just be conscious to monitor your costs.

I would rate Azure Data Factory an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Azure Data Factory
April 2024
Learn what your peers think about Azure Data Factory. Get advice and tips from experienced pros sharing their opinions. Updated: April 2024.
768,857 professionals have used our research since 2012.
Chief Technology Officer at cornerstone defense
Real User
Easy to bring in outside capabilities, flexible, and works well
Pros and Cons
  • "It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory."
  • "There is always room to improve. There should be good examples of use that, of course, customers aren't always willing to share. It is Catch-22. It would help the user base if everybody had really good examples of deployments that worked, but when you ask people to put out their good deployments, which also includes me, you usually got, "No, I'm not going to do that." They don't have enough good examples. Microsoft probably just needs to pay one of their partners to build 20 or 30 examples of functional Data Factories and then share them as a user base."

What is our primary use case?

Our customers use it for data analytics on a large volume of data. So, they're basically bringing data in from multiple sources, and they are doing ETL extraction, transformation, and loading. Then they do initial analytics, populate a data lake, and after that, they take the data from the data lake into more on-premise complex analytics.

Its version depends on a customer's environment. Sometimes, we use the latest version, and sometimes, we use the previous versions.

What is most valuable?

It is very modular. It works well. We've used Data Factory and then made calls to libraries outside of Data Factory to do things that it wasn't optimized to do, and it worked really well. It is obviously proprietary in regards to Microsoft created it, but it is pretty easy and direct to bring in outside capabilities into Data Factory.

It is very flexible. You can build any features you want.

What needs improvement?

There is always room to improve. There should be good examples of use that, of course, customers aren't always willing to share. It is Catch-22. It would help the user base if everybody had really good examples of deployments that worked, but when you ask people to put out their good deployments, which also includes me, you usually got, "No, I'm not going to do that." They don't have enough good examples. Microsoft probably just needs to pay one of their partners to build 20 or 30 examples of functional Data Factories and then share them as a user base.

For how long have I used the solution?

I have been using this solution for the last five years, but probably, the last three years have been significant.

What do I think about the stability of the solution?

It has been stable. I have not experienced any issues.

What do I think about the scalability of the solution?

It is decent for most things. I'm not sure if it is necessarily intended for large volume and high-speed streams of data. By large, I mean really big, but for pretty much anything that most users would want to do, including ourselves, it is fine. Our clients are large government organizations.

It scales fine within its environment. You can literally throw another Data Factory in or replicate one and do things pretty quickly. So, it is not at all hard to increase your processing footprint, but you have to pay for it. It doesn't end up being quite expensive. Although I haven't really done it, I would suspect that if I did the equivalent in AWS, Azure would be more expensive than AWS because of the way they price data.

How are customer service and technical support?

They're all right. I would rate them a seven out of 10. They do fine, but there is a lot that they don't do.

I'm not sure if even Microsoft has enough SMEs from a user point of view. They are helpful for getting it set up, making it work, and helping you figure out why it doesn't work. If you want to ask them about something that you are trying to do, they'll try to direct you to a partner, which is fine, but the partners also don't necessarily have an experience. It is Catch-22. There aren't a lot of people out there with Azure experience because Azure started to be in demand only over the last two years.

Which solution did I use previously and why did I switch?

The customer used a lot of homebrew stuff. They were doing a lot of internal stuff and some Oracle stuff. They were doing things, and they made a workaround and said, "Okay, we'll bring it into Oracle Database, and then we'll do all these things to it." We're like, "Okay, that works, but then you're taking it out of that database and putting it over into the data lake. I don't understand why are you doing that?" That's what they were doing.

How was the initial setup?

It is pretty straightforward. Devil is in the details, but you can easily get up and running in a day with Data Factory. Anybody who is comfortable in Azure can set up Data Factory, but it takes experience to know what it can and can't do or should and shouldn't do.

What other advice do I have?

It is proven, and it works. Make sure you have a well-defined use case and build a quick prototype to ensure that it, in fact, does what you need. Give yourself some benchmarks. That's exactly what we did. We defined the use case, and then we set up Data Factory. We found a couple of things that it didn't do. We figured out a way to work around those things and have it do those things. After that, we confirmed it. It is operational, and it is doing its job. It has been pretty much error-free since then.

It would become easier to use as more people become Azure-capable. If I want to find an AWS SME, I can get tons. They're expensive, but I have them. If I want to find an Azure SME, I usually have to create them. Azure was later to market than AWS. So, there are fewer people who are experts in Azure, and they are in high demand.

I would rate Azure Data Factory a nine out of 10. They just don't have enough good examples out there of things.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
Abdelmonem Metwally - PeerSpot reviewer
ETL/BI Senior Consultant at Qrious
Consultant
Top 20
A good data migration tool that has strong security features, but needs more development to be able to deal with complex data transformations
Pros and Cons
  • "This solution has provided us with an easier, and more efficient way to carry out data migration tasks."
  • "This solution is currently only useful for basic data movement and file extractions, which we would like to see developed to handle more complex data transformations."

What is our primary use case?

We mainly use this solution to carry out data movement and transformation.

What is most valuable?

This solution has provided us with an easier, and more efficient way to carry out data migration tasks.

What needs improvement?

This solution is currently only useful for basic data movement and file extractions, which we would like to see developed to handle more complex data transformations.

For how long have I used the solution?

We have been using this solution for a year.

What do I think about the stability of the solution?

We have found this to be a stable solution.

What do I think about the scalability of the solution?

This is an easily scalable product, due to it being cloud-based.

How are customer service and support?

The customer support for this solution is very good. 

How was the initial setup?

The initial setup of this product is straightforward, if you deploy the solution using a template; rather than implementing the solution first, and configuring the features afterwards.

What about the implementation team?

We implemented the product using both in-house staff, and members of a vendor team. The vendor team were very helpful, and gave good advice while we were deploying the solution.

What other advice do I have?

We would recommend this solution as it is very solid and has good security features.

I would rate this solution an eight out of 10.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
Principal at a tech services company with 51-200 employees
Real User
Top 20
Good product integrations but transient issues sometimes cause pipeline failures
Pros and Cons
  • "It is beneficial that the solution is written with Spark as the back end."
  • "There are limitations when processing more than one GD file."

What is our primary use case?

Our company uses the solution for data ingestion. 

What is most valuable?

It is beneficial that the solution is written with Spark as the back end. 

The solution is cloud-based and integrates well with other Azure products such as Synapse Analytics.  

What needs improvement?

There are limitations when processing more than one GD file. 

Data ingestion pipelines sometimes fail because of transient issues that have to do with the cloud network. It takes more than six hours to process or ingest 300,000 records and that is a long time. 

For how long have I used the solution?

I have been using the solution for two years. 

What do I think about the stability of the solution?

The solution is new in the market and pretty stable because ADF is a little more codified than AWS. Synapse Analytics adds another tool for data. 

Stability is not quite at the level of Informatica or DataStage. 

What do I think about the scalability of the solution?

The solution is scalable. 

For multi-tenant applications connected to multiple databases, Microsoft recommends a share box and a cell post integration run time. But a run time connecting to multiple sources has limitations and requires multiple shares connecting to your data if you are ingesting it from on-premises. 

How are customer service and support?

Technical support is okay. Support is contracted or partnered with various companies but is fine as a first level. 

Most of the time, technical support has to connect with product engineers who troubleshoot issues. 

How was the initial setup?

The setup is not very complex but requires intake, setting up integration services, and connecting to databases like Oracle before you push it to service.

What about the implementation team?

We implemented the solution in-house. 

What's my experience with pricing, setup cost, and licensing?

The pricing is pay-as-you-go or reserve instance. Of the two options, reserve instance is much cheaper. 

In the cloud, everything is service based and expensive. Users should be knowledgeable enough to maximize the solution. 

For example, it makes no sense to run integration services all day if you are not ingesting data because you pay for that usage. It is important to understand how the product works to manage it accordingly and keep costs down. 

What other advice do I have?

I rate the solution a six out of ten. 

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
IT Analyst at a tech vendor with 10,001+ employees
Real User
Improved data resilience, in the way that we move data from on-prem to the cloud and vice versa
Pros and Cons
  • "The most important feature is that it can help you do the multi-threading concepts."
  • "There should be a way that it can do switches, so if at any point in time I want to do some hybrid mode of making any data collections or ingestions, I can just click on a button."

What is our primary use case?

It's a PaaS service. It's a hybrid solution. The cloud provider is Microsoft.

We are not using Azure Data Factory as for users. Rather, we're using it as a process base. We're just using it for orchestration, not for any kind of ETL stuff.

We have plans to increase usage. It's going to take a major role in any kind of traditional data warehousing. It has big potential, especially as a PaaS offering.

How has it helped my organization?

There has been improvement in data resilience, in the way that we're moving the data from on-prem to cloud and vice versa.

What is most valuable?

The most important feature is that it can help you do the multi-threading concepts. It's in Informatica, but the resourcing is quite robust. You can scale up and scale down as per your needs.

What needs improvement?

There should be a way that it can do switches, so if at any point in time I want to do some hybrid mode of making any data collections or ingestions, I can just click on a button. I can change a switch and make sure a batch can be a streaming process.

For how long have I used the solution?

I've been using Azure Data Factory for more than two years.

What do I think about the stability of the solution?

The stability of Azure as a PaaS could be improved.

What do I think about the scalability of the solution?

It's scalable.

How are customer service and support?

I would rate their technical support 3 out of 5. It's not great, but it isn't bad.

How was the initial setup?

The setup is complex. It has nothing to do with the technology but with the design. We were wondering how to leverage the orchestration layer where we are having the Azure Data Factory and how to integrate with the Databricks. That's where we had some challenges in terms of choosing the right product.

What about the implementation team?

You can do deployment in-house. 

What other advice do I have?

I would rate this solution 8 out of 10.

For someone who is looking to use this solution, my advice is to do proper due diligence of your current application, know where your application is fitting, and look for the requirements. It all depends upon the current use case that you have currently in your system.

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Microsoft Azure
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Azure Technical Architect at a computer software company with 10,001+ employees
Real User
Has the ability to copy data to any environment
Pros and Cons
  • "From my experience so far, the best feature is the ability to copy data to any environment. We have 100 connects and we can connect them to the system and copy the data from its respective system to any environment. That is the best feature."
  • "The user interface could use improvement. It's not a major issue but it's something that can be improved."

What is our primary use case?

It's an integration platform, we migrate data across hybrid environments. We have data in our cloud environment or on-prem system so we use it for when we want to integrate data across different environments. It was a problem for us to get data from different hybrid environments.

What is most valuable?

From my experience so far, the best feature is the ability to copy data to any environment. We have 100 connectors and we can connect them to the system and copy the data from its respective system to any environment. That is the best feature. 

What needs improvement?

The user interface could use improvement. It's not a major issue but it's something that can be improved. 

It has the ability to create separate folders to organize objects, Data Factory objects. But any time that we created a folder we were not able to create objects. We had to drag and drop into the folder. There were no default options. It was manual work. We offered their team our feedback and they accepted my request.

For how long have I used the solution?

I have been using Azure Data Factory for around one year. 

What do I think about the stability of the solution?

Based on my experience with other products on the market, the stability is good. 

What do I think about the scalability of the solution?

I haven't had much experience with scalability. I know we do have scalability options though. It's used daily. 

There are around 1,000 plus users using this solution in my company. 

It requires two people for maintenance. The administrators are the ones who maintain it and give access to the engineers. They regulate who has privileges. 

How are customer service and technical support?

We have needed to contact their technical support. If we can't find the answers ourselves on the blogs, we contact them with our questions. We get most of the answers we need from the blogs but if not then we can directly speak to the Microsoft team from the Data Factory interface itself, it's really helpful.

Which solution did I use previously and why did I switch?

I have only used Data Factory for the cloud. For on-prem we have used SSIS.

How was the initial setup?

The initial setup was a bit complex but once you understand its setup, it's less complex. There are certain processes that need to be followed. Once you understand the process, it becomes easier to implement.

The implementation took a little less than one day. The planning for the deployment takes around one or two days. 

What about the implementation team?

We had a discussion with the Microsoft team about the data. We discussed how we were going to implement. Based on the discussion we were able to deploy. A Microsoft partner helped us with some parts. 

Which other solutions did I evaluate?

We also evaluated AWS.

What other advice do I have?

The advice that I would give to someone considering this solution is to have some background in data warehousing and ETL concepts. Have the background about data warehousing and ETL that extract, transform, and load. If you have the background you need, you will be successful. If not, then my advice would be to learn a little more about it before using Data Factory.

I would rate Data Factory as an eight out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
Network Team Lead at a computer software company with 1,001-5,000 employees
Real User
A stable solution that helps to move data from on-premises to the cloud
Pros and Cons
  • "We use the solution to move data from on-premises to the cloud."
  • "The tool’s workflow is not user-friendly. It should also improve its orchestration monitoring."

What is our primary use case?

We use the solution to move data from on-premises to the cloud.

What needs improvement?

The tool’s workflow is not user-friendly. It should also improve its orchestration monitoring.

For how long have I used the solution?

I have been using the product since 2019.

What do I think about the stability of the solution?

The tool is a stable product.

What other advice do I have?

I would rate the solution a seven out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros sharing their opinions.
Updated: April 2024
Buyer's Guide
Download our free Azure Data Factory Report and get advice and tips from experienced pros sharing their opinions.