What is our primary use case?
We use Zerto for disaster recovery (DR) purposes. We needed a tool to provide a quick resolution during a failure or problem and help us achieve our goals related to our service level agreement (SLA). We needed a tool that would help us in providing the availability of lost services within a specific time frame. We wanted to make sure that in case there is a problem and we have to execute the DR procedure, it is quick, easy, and safe. So, the main purpose for going for Zerto is related to meeting the required parameters for RPO and RTO.
We don't use Zerto for backup purposes. It is used only for virtual machines. We also have physical servers, but we have different tools for backup.
Currently, we are using it for on-premise data centers, but we also do proof of concept tests with public clouds or hybrid clouds.
How has it helped my organization?
It is very good in terms of end-user experience and functionality. The graphical user interface (GUI) is quite simple, and it shows many values related to the status of replication. You can see the current status of a specific application and the health status of infrastructure on the GUI. You can very quickly navigate the application and find very useful information related to the health status of your infrastructure. You can see if the infrastructure is working fine and if there are any bottlenecks or problems that need to be verified by the IT infra department.
It helps us with standardization. It allows us to have a unified DR approach where with one tool, we can meet the DR requirements of different systems with different levels of DR criticality and classification. We have customers for whom the availability of a particular system is crucial for business, and this system requires a very high quality of replication. At the same time, they also have systems that are not as critical. For a unified approach to the DR process, it is better to use the same DR tool for all applications with different levels of criticality. Instead of using different tools for critical and non-critical applications, it is better to use one single tool and have a unified process. Zerto helps us with that.
Different types of integration allow you to provide the tool not only to specialist teams, such as infrastructure teams, but also to the end-users to perform activities like failover, system recovery, and system protection. Service portal integration and automation integration provide a big value in terms of DR activities. You don't have to wait for VMware specialists to perform the DR failover task. You can do it on your own if you have access to the DRaaS portal, for example. DRaaS portal is DR as a service that we have implemented in our own infrastructure, and it is a part of process improvement in our organization.
It has significantly decreased our RTO for failover to the full scope of an application. You can easily measure DR activities for a specific application. If you are responsible not only for your application as an application owner but also need to provide support to many customers at the same time, such a tool is very good. When you are responsible for delivering as per the SLA for RTO to many customers at the same time, it is very helpful because you can perform required activities automatically, and you can also perform them in parallel.
We are very satisfied with the achieved RTOs. We have specific requirements based on the service delivered values and SLA contracts, and by using the tool, we are able to fully meet RTOs for specific applications, a group of applications, or the whole scope of a data center.
During our DR exercises, we try to simulate the worst-case scenario where a complete data center is unreachable, and we are able to achieve the required RTO. Zerto is able to fully meet our needs, and we are able to achieve the required RTO during our normal and yearly DR exercise. We are receiving exactly what we were promised. However, I can't provide metrics or compare it to another tool because we have been using it from the beginning. I don't have the metrics for how much time it would take if we didn't have this tool, but the values that we are receiving during our annual DR exercise are fully satisfactory. So, it is fully sufficient for us.
The time saved in a data recovery situation depends on the specific scenario and the specific system that needs to be recovered. For example, if you have 50 gigabytes built into a machine that needs to be recovered, then with a traditional backup and restore solution, the recovery is very quick and easy. It would take from minutes to an hour depending upon your infrastructure or the bottleneck in your infrastructure. The problem occurs when you have very big systems with 10, 30, or 50 terabytes to be recovered. In such a case, it doesn't matter if it is ransomware or it is an infra failure. Even though the root cause is not the same, the outcome is the same. The fact is that you don't have a working system, and you need to recover the system. Recovering a big system with the traditional approach could take you a week, which is something that businesses do not accept. With a tool like Zerto, I can fail over the system very quickly. During DR exercises, I performed DR activities for systems with many terabytes of data, and it is not a problem to recover that system and failover. I have very good experience with that. During the training or presentation for my customers, I have shown how it works and what are its advantages. One of them is the possibility of a very quick recovery irrespective of the size of the system. The approach is exactly the same irrespective of whether it is an infra issue or a ransomware issue.
What is most valuable?
The main purpose of this tool is to allow failover between different data centers and different locations. When one site is unavailable, we can start the failover activity and perform the failover task. When a primary location is unavailable, or there is some hardware or logical issue at the primary location, it allows us to resolve the problem. We are able to start services at safe locations. We handle the disaster recovery process, and this is the main function for which we are using it all the time.
The second valuable feature is related to integration. If we want to implement any tool in our company, we want to make sure that it is not sandboxed. It shouldn't be completely isolated from other systems, and it should help other systems to receive feedback. To gain advantages of having tools like Zerto on the board, we want to combine our disaster recovery with other processes, such as incident management or change management. We can integrate these processes using different tools, but usually, the best approach is related to API. So, we can integrate different systems and combine them into a big IT platform, which allows us to achieve more features that are normally not available in the tool itself.
Zerto supports different ways to integrate with or get information from the systems. GUI is one of the options, and technologies like PowerShell cmdlets or RESTful APIs are also very good to exchange data for integration or automation purposes.
We also use Zerto for compliance purposes in case we need to provide evidence. When we perform DR exercises and we have some problems with the infrastructure, we need to prove that some actions were taken. It works very well when we perform DR activities and we want to show external auditors the proof and evidence of performed actions.
What needs improvement?
It is very quickly developed, and new features are provided quite often. I don't have any input for improvement or a critical feature request at this moment. If anything, a lower price is always better.
For how long have I used the solution?
I have been using Zerto for six years.
What do I think about the stability of the solution?
Its stability is very good. Of course, bugs can be found because this is live infrastructure, which is normal. There are new VMware releases, and there are new operating system releases. If there are some problems with applications, we raise cases, and we get the required support in resolving the issue. So, my experience has been very good.
What do I think about the scalability of the solution?
Its scalability is very good. We can use it in on-prem and hybrid cloud environments. It works well with different vendors.
We are using it for different locations. We are using it for on-premise data centers, and we are also using it for all the production systems that we have. Any increase in its usage will depend on the decision of the company. If the decision is to change the platform and integrate with different vendors, we can choose additional features, but at the moment, we are using only the on-premises functionality.
How are customer service and support?
Their support is very good. We have to meet our SLA, and the infrastructure is very sophisticated and demanding. We have had different cases that need investigation and resolution, and we could always count on Zerto's support, which is available 24/7. I don't have any complaints about their support. I would rate them a ten out of ten. If possible, I would even give them eleven.
How would you rate customer service and support?
Which solution did I use previously and why did I switch?
I have not worked with any other tool previously. This is the only tool I have used.
How was the initial setup?
It was quite easy to implement and start the execution. There were no problems. It took us about three months to implement it in production.
What about the implementation team?
For implementation, we were using the services directly from Zerto's support teams. In terms of the number of people, there were two people from the DR team and two from the infrastructure team, which included the networking and VMware teams.
In terms of maintenance, every tool requires maintenance, and when you upgrade, there are some bugs or issues that need to be resolved. Currently, for the maintenance of the application for many customers, one person is enough. Based on the number of protected systems and sophisticated infrastructure that we have, it works very well. It is not something that we should complain about.
What's my experience with pricing, setup cost, and licensing?
Its price is reasonable. I have not worked with other tools, but as compared to its competitors such as VMware, its price is lower. So, in my opinion, its price is good.
What other advice do I have?
Based on our experience and the implementations that we have done successfully, it is the right tool for protecting small environments and very big environments. It fits the needs of organizations that require a few functionalities, and it also fits the needs of organizations with a sophisticated environment comprising managed systems, multiple integrations, etc.
Zerto provides near-synchronous replication. So, the RTO is near zero. It is not equal to zero. From my perspective, there are some specific IT areas where synchronous replication is a must, but in most scenarios or use cases, synchronous replication is like a trap because you need to have a single connection between two replication zones or sites, which I would refer to as a single point of failure. If you have storage that is replicated between two sites, in certain scenarios, you won't be able to perform failover activities. If storage is broken on the primary location and you have enabled synchronous replication, the replicated data is also sent to the recovery site. So, you cannot perform failover activities because you now have corrupted data at both sites or data centers. We have chosen this tool to get out of this trap and be able to failover but not to the exact point in time when the issue occurred. I have experience working with such scenarios. For a specific group of systems that require synchronous replication, I can have an additional level of protection by having other DR tools, and at the same time, I can provide replication by using tools like Zerto. So, I can enable two DR solutions on one protected system and resolve the issues related to different scenarios.
In terms of reducing the DR tasks, because I have not used other tools, I can't provide the metrics for increase or decrease in time. However, considering that the tool is implemented for the whole scope of our application, we do not have to wait and spend weeks or months preparing for the DR test. We are prepared all the time for any issue. We also perform the unknown data center DR exercise allowing us to choose the test data center just before the DR exercise. I can very quickly start the recovery operations without a long preparation phase. This is one of the main features of a DR tool that should be taken into account for a company. You should have a tool that you can use at any time. You should ensure and be confident of the fact that it will work and not create any problems during the failover.
DR exercises generally should be performed by customers and application owners. That's because they know best what the issue is and how to provide a solution. It requires synchronization of some tasks and allowing more critical systems to be failed over before the less critical ones. To perform a global DR exercise preparation and execution, very less staff is required. Communicating with customers about the agenda and defining the scope, tasks, schedule, and other things take most of the time, but the execution phase is quick. It can be executed by one operator. It can be done by an infra specialist or an application owner, but ideally, it shouldn't be done by the specialist team. It should be done by application owners because they know the best about the issue.
I would rate Zerto a ten out of ten. It is a very good tool, and we have had very good experience with it. We have no problems with recommending it to others.
Which deployment model are you using for this solution?
On-premises
*Disclosure: My company does not have a business relationship with this vendor other than being a customer.