PagerDuty Operations Cloud Valuable Features
PagerDuty Operations Cloud offers robust features including centralized alerting and incident response, on-call scheduling and escalation, automated remediation and runbooks, stakeholder communication during major incidents, and automation capability.
Everything has been working exceptionally well with PagerDuty Operations Cloud. The features have been excellent. It is a spanning tool that has enabled us to operationalize reliability. Since rolling it out three years ago, we have cut MTTR by 35% and reduced on-call noise by 70% through AIOps correlation.
For us, the main value of PagerDuty Operations Cloud lies less in paging and more in coordination, automation, and reducing the operational burden on engineers. Reducing the operational burden has improved morale among our engineers. They have been able to focus on other projects and tasks since they have saved considerable time, and consequently productivity in the organization has greatly improved. We have also seen faster incident response, lower MTTR, reduced alert noise, and decreased on-call burnout. Additionally, automation has reduced manual toil, with approximately 25 of our P2 incidents now being resolved or partially resolved by automation before an engineer touches them.
View full review »NS
Nambi Srinivasan
Senior SRE at IBM
One of the features I find valuable in PagerDuty Operations Cloud, which is part of our current migration activities, involves automating the entire incident management process by integrating all service incidents into a single incident management page. In IBM cloud, since many services and incidents occur, I utilize the runbook automation feature where I create runbooks for each service and common issues that facilitate incident management for common incidents.
The runbook automation positively impacts my team's workflow by significantly speeding up the incident resolution process. In IBM cloud, we have different services hosted such as IBM Schematics and IBM Kubernetes Service, with thousands of concurrent global users. We face several issues during multiple incidents, particularly in reliability and infrastructure side issues. Basic level zero incidents often require simple commands run in kubectl. Therefore, I created runbooks to address these common issues, allowing SRE team members to refer to the runbook and manually fix the issues. However, before using PagerDuty Operations Cloud's runbook automation, it took over 20 minutes to resolve these issues. After implementing PagerDuty Operations Cloud's runbook automation, I have reduced the response time from over 20 minutes to less than two minutes, saving approximately 80 to 90 percent of the time and making mean time to resolve significantly faster. I review the runbooks quarterly to update them with any new steps necessary.
PagerDuty Operations Cloud has greatly improved our productivity. Previously, I handled many incidents with manual automation runbooks, leading to substantial toil for the SRE teams in resolving even minor incidents and complicating our on-call schedule. Once I adopted PagerDuty Operations Cloud and heavily utilized the runbook automation, I provided a list of common incidents to PagerDuty Operations Cloud, which then did the heavy lifting in fixing basic incidents automatically. This allowed my team to focus more on development activities related to platform engineering. Overall, it has reduced our toil by at least 50 to 60 percent and improved our efficiency, enabling us to onboard more services. We increased from onboarding seven core IBM cloud services to over 28 services now hosted.
The expansion of services impacts our organization's goals and customer experience by allowing all IBM cloud internal services to be hosted on a dedicated platform engineering service called Rednote. Before using PagerDuty Operations Cloud, I utilized Nagios and Sysdig, and after migrating to PagerDuty Operations Cloud last year, I prepared a set of runbooks, automating on-call schedules and incident management. This automation has led to a significant increase in incident closure rates from around 40 to 45 percent, improving efficiency and reducing manual toil for basic incidents by about 60 percent. This enables my team to focus more on development activities as basic incidents that can be managed through simpler runbooks are now handled automatically by PagerDuty Operations Cloud. Additionally, incorporating AIOps into our on-call scheduling and notifications helps it learn from previous incidents and proactively address issues. This scalability has allowed me to grow from handling four cloud services to 28 services, increasing from 200 to 300 customers to over 1800 plus customers, thanks to PagerDuty Operations Cloud.
View full review »The best feature of PagerDuty Operations Cloud is that it easily integrates with other systems, and we can easily create and process escalation policies.
For example, at two a.m., when a Level 1 engineer receives a call but does not pick up or acknowledge the alert, they receive another call. If the Level 1 engineer still does not respond, the Level 2 engineer receives the call. If the Level 2 engineer is still not available, the Level 3 engineer receives the call. Based on the incident priority, the call shifts from Level 1 to Level 2, and from Level 2 to Level 3. In this way, we can escalate through the hierarchy and easily work on the incident. Based on the category of P1, P2, or P3, the escalation policy works accordingly. This is a very best feature as we can easily track alerts based on their priority.
View full review »Buyer's Guide
PagerDuty Operations Cloud
May 2026
Learn what your peers think about PagerDuty Operations Cloud. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
899,917 professionals have used our research since 2012.
The best feature I liked about PagerDuty Operations Cloud is that you can set up multiple notifications. I had set up five to six notifications so that if I missed one call, after a minute the alert would trigger again. If I skipped or was away somewhere and missed it, it would call me again after two minutes, after three minutes, after five minutes over the 15 minutes. I could set multiple times of calls. This was the best feature for me because I am a deep sleeper. When I was on call over the weekend, I would sleep, and I was very concerned about missing the call. If I missed one call, I would get another call.
One time, I remember all our teammates went on vacation. We were thinking about what if a person would miss PagerDuty Operations Cloud alert or someone would run out of battery while on call. For one person, we had added four or five numbers to PagerDuty Operations Cloud so that the first minute they would get a call, the third minute someone else would get a call, the fifth minute someone else would get a call. This was a really good feature. It is not like you can only add one number. In that way, I felt it was very scalable because you can add multiple numbers.
I think PagerDuty Operations Cloud is very stable and reliable. I never had an experience where I was not paged whenever our alert triggered. For me, it was always working at 99.9% reliability. Our biggest concern was not missing any incident raised by customers when we were offline because customers are essential for any company. I think PagerDuty Operations Cloud made sure we were always connected with the customer. At the same time, we could continue our lives instead of sitting in front of the laptop 24 into 7. It was really an important key for us to maintain the product.
It was actually pretty good. More than preventing cost, it was more reliable in terms of the client. When clients were creating tickets, they were getting prompt responses from our side because of PagerDuty Operations Cloud. We were able to get the information at the same time when they were creating the ticket. So we could ensure them that we are here for them 24 into 7. This was a pretty important point because for any product-based company, it is very important to make sure that clients feel connected and that their requests are being heard, seen, and addressed.
View full review »The best features of PagerDuty Operations Cloud that I find most valuable are incident management and webhooks. The reason is that if you go for any P1 incident, somebody would usually need to do all the manual efforts, such as creating the Slack channel, informing the team leads and other team members, gathering information related to any service outage or component outage, and providing it in a text format to the Slack channel. All these manual efforts take much time, but PagerDuty does that for you. It gathers the related services and people who are already in that Slack channel and sends the alerts. It automates the whole process, so no human intervention is required.
Another very good functionality is webhooks. They work exceptionally well. If my cloud infrastructure has an issue, a person needs to investigate, and if it requires a server reboot or installation of a script, that person needs to intervene manually. However, using this webhook, we can send one alert or notification to our automated job indicating that this issue requires execution of a particular script or file on that server, which fixes the ongoing issue without requiring human intervention. If this happens at midnight, people can sleep easily without any headache, and PagerDuty will take care of the task.
PagerDuty Operations Cloud has a very good impact on our company, especially regarding business and service level agreements, as well as reducing incident resolution time. It also helps automate manual work.
View full review »AS
Aitha Shashikanth
Senior Network Operations Center Engineer at Accolite
One of the greatest features of PagerDuty Operations Cloud is that there is no need to disturb people by directly calling from personal numbers or Teams or anything else. I can just click on PagerDuty Operations Cloud. It will directly ring the person's phone, message, and Outlook, and the app directly, so that they know someone is calling and they have to attend that call in an emergency case.
View full review »AS
Aksharma Aksharma
Manager, OTT platform at a media company with 1,001-5,000 employees
The best features PagerDuty Operations Cloud offers are the policies and the insights. It provides a lot of data which is required to drill down the specific errors we have to work on. The escalation policy works well because it timely triggers the respective people required to work on those issues.
I use those insights and escalation policies in my day-to-day work to review the root causes of why we are getting those issues and why in such high numbers. It helps us to drill down the specific areas where we need to improve in order to have better robustness in terms of solution. It also creates the required awareness. We can use PagerDuty Operations Cloud to trigger the correct stakeholders who need to be involved when such issues occur, based on the definition defined.
PagerDuty Operations Cloud has positively impacted our organization by helping us save a lot of costs, and our responsiveness and actions towards any issues on the platform have improved many folds. If any issue occurs in real-time, we get paged. The correct engineer starts working on it. Considering we have thousands of customers, it is not possible to monitor all of those, but PagerDuty Operations Cloud helps in defining which ones are actionable and which ones can be ignored.
Regarding response times, it used to take around 10 to 15 minutes, and now that can be achieved within seconds. Similarly, for operations, around five to six engineers were required in a shift, and now that can be fulfilled using two to three engineers. That definitely saves a lot of effort and time and helps to improve operations as well.
View full review »PagerDuty Operations Cloud has several valuable features. We have many monitoring tools, but the major use of PagerDuty Operations Cloud is that when a production alert comes in, it previously went mostly to Outlook and Slack. Once PagerDuty Operations Cloud came into the picture, sometimes we are not in shift or on weekends, or we might be outside. During that time, we would miss most of the issues. However, if PagerDuty Operations Cloud is enabled, it will notify us on mobile. We will get SMS as well via PagerDuty Operations Cloud. We have to know the criticality of the request, and based on that, if critical servers or issues arise, we have to work and fix them immediately. PagerDuty Operations Cloud mostly notifies us for issues based on priority.
The benefits I have seen so far from using PagerDuty Operations Cloud solution include many solutions. The major use is monitoring alerts. Another benefit is on-call scheduling, which goes directly to the right person. We also have an option called the escalation policy. If I am the primary one and I don't acknowledge the call, it redirects to my secondary person. If the secondary is unresponsive, it moves to the tertiary person. This is a major feature. Once we acknowledge the alert, it sends to the next level. It helps in easily coordinating incidents, such as identifying whether it is an issue or an outage. The notification aspect is a major use I see in PagerDuty Operations Cloud.
View full review »PagerDuty Operations Cloud helps our team respond by increasing our response time. Whenever there is any incident, we will get notified and through PagerDuty Operations Cloud, we receive calls 24/7, allowing us to instantly get into a call or investigation and remediate the issue as early as possible. This way, PagerDuty Operations Cloud helps us reduce the MTTR and ensures our application is more reliable and resilient.
We have been using the Runbook automation feature for building automated flows that help us add extra monitoring for specific alerts or incidents and perform remediation tasks autonomously using this Runbook flow.
One feature I particularly appreciate about PagerDuty Operations Cloud is that it offers multiple notification options. I receive alerts via call as well as SMS, which is beneficial. If I miss the call, I may still receive the SMS and vice versa.
Through PagerDuty Operations Cloud, our MTTR has been reduced by at least 30% over the last year due to its instant notification features like SMS and calls, which help us jump on calls quickly to remediate issues. This reduction has impacted our application downtime, ensuring an uptime of approximately 99% throughout the year.
View full review »The best features of PagerDuty Operations Cloud are intelligent alerting, the code fixture, on-call scheduling, escalation capabilities, automation, runbooks, and integration. PagerDuty Operations Cloud has improved downtime mostly by 30 to 50%.
View full review »PagerDuty Operations Cloud is one of the best tools we have seen because it is already integrated with AI. We use it as a barrier tool, meaning it is the top tool that we consider and we get notified when there is an issue.
The best features include integrating with any tool and analyzing all previous alerts that have been stored. When an alert occurred on a particular day, we can immediately be notified on Slack with historical data and, since it is integrated with AI, we receive suggestions on how it can be resolved, how it was resolved earlier, and who resolved it. These are the very best features we have seen on PagerDuty Operations Cloud.
Since we have historical data showing when an alert has triggered on a particular day, we can turn it into a problem incident and work with the relevant teams to get it fixed completely so it does not reoccur. We are recording these kinds of repetitive issues using that feature.
It is very helpful that we can integrate with numerous monitoring tools such as Datadog, Splunk, and Kibana. Since we have integrated many other tools, I feel this is one of the features that PagerDuty Operations Cloud offers that makes it great.
View full review »One of the best features is the notification categorization that we can do with PagerDuty Operations Cloud, which is the incident type categorization. We can select whether it is a major incident or a minor incident and based on what we select, a dedicated Slack channel or a dedicated Teams channel is created, which is much more helpful for us to diagnose the issue.
With the incident type categorization, we are able to prioritize which issues to sort out first and which issues to sort out later. This has helped us firefight the major issues on a first come first serve basis, so categorization helps us work more efficiently.
With the help of PagerDuty Operations Cloud, we have been able to resolve a lot of issues in a much quicker fashion. Our overall sales has gone up 30% after the introduction of PagerDuty Operations Cloud, which is a major advantageous situation for us.
This was majorly due to reduced downtime and faster response, which made the customer believe us more and made the customer's entire user journey much more smoother. This has directly impacted our sales, and the customer's journey within our e-commerce platform has been very quick with the reduced downtime, so this has helped us gain more sales.
View full review »PagerDuty Operations Cloud is used for monitoring, and we upload detailed documentation for major incidents such as P2 or P1 severity. We prepare documentation about the incident including what caused it, what the resolution time was, what the impact was, and everything else, which we then put on PagerDuty Operations Cloud. Apart from this, we do not use it for any other applications; it is used exclusively for monitoring purposes and setting up alertness.
We receive many benefits as part of L1 or L2 operations running 24 hours a day. As soon as there is an issue, if I am the first point of contact and I do not receive the call, it goes to the second person, my line manager. If my line manager does not pick up the phone, it goes to the third person, the skip-level manager. This is beneficial for us; even if it is a minor outage lasting 5 or 10 minutes, we receive an alert about it. If there is a major incident, we still receive the alert. Even if we are away from the system and not actively monitoring, we get the alert as soon as there is an outage.
We have the TIBCO integration layer, which is integrated with DataDog, and DataDog is integrated with PagerDuty Operations Cloud. When we ask PagerDuty Operations Cloud how many incidents are recurring with a specific service, it provides historical data showing how many times that service was down.
View full review »PagerDuty Operations Cloud offers top-tier features that enable real-time alerting and accelerate incident response.
The solution is reliable and effective when it comes to automating routine diagnostic tasks.
Regarding how the real-time alerting and automation features have helped my team, problem-solving became automatic, and incident management becomes less complex to manage.
PagerDuty Operations Cloud has positively impacted my organization by enabling faster issue response, which helped reduce downtime, saved revenue by avoiding long outages, improved team accountability during incidents, reduced manual effort in handling alerts, and helped maintain a better customer experience.
The solution's alert reduction feature has had a major impact on preventing costly incidents in my organization. By grouping related alerts and de-duplicating noise, my team was able to spot real issues faster instead of getting buried in alerts, helping us prevent two to three potential outages because engineers responded to the root alert instead of missing it in noise.
View full review »PagerDuty Operations Cloud definitely helped us. In terms of governance, it had very good reports regarding the alerts we were receiving and calculating the times and effort. Overall, it was a very effective product and really helped the productivity of the teams.
PagerDuty Operations Cloud has correlation features which help us when we get a lot of alerts from one particular location. Based on the correlation values set for the alerts, it automatically correlates all the incidents. It also attaches the KB articles with troubleshooting steps
View full review »I appreciate the notifications in PagerDuty Operations Cloud. We get real-time updates in PagerDuty, and that is the best part. The second feature is that we get each and every detail in PagerDuty as well, including the main problem, which account is affected, and in which account, which instance or which VM is affected. We also get graphs in PagerDuty Operations Cloud, making it so that 50% of our work is done by PagerDuty itself, with the rest of the 50% being the work we have to do.
View full review »The best features PagerDuty Operations Cloud offers include automated on-call scheduling, AI-driven alert grouping, and an impressive number of integrations, as it has more than 700 integrations, which really help us.
The user interface is user-friendly for non-technical persons, and it has been maintained well over the years, making it easier for our project managers and product managers to navigate PagerDuty Operations Cloud dashboards.
PagerDuty Operations Cloud's embedded AI has helped reduce alert fatigue and lower costs from incidents, which has contributed to retaining revenue by minimizing financial losses.
The AI-driven alert grouping, particularly for incident management, helps us significantly as it streamlines our processes.
PagerDuty Operations Cloud has positively impacted our organization by accelerating incident response and reducing MTTR by up to 27%, and we are also integrating AI and ML into PagerDuty Operations Cloud.
View full review »One of the best features I would mention about PagerDuty Operations Cloud is its on-call rotational scheduling support and escalation management practices. If an engineer did not acknowledge the alert within a defined time frame, the incident was automatically escalated to the next person, support team, or manager of that specific team. Another useful feature was its integration capability. We were able to integrate PagerDuty Operations Cloud with monitoring and observability tools that allow alerts to generate automatically whenever issues were detected in the environment within a fraction of time. We also had the mobile application that was very helpful because the engineer could receive calls, notifications, and acknowledge the incident and track the updates even when they were away from their laptop.
I also valued the centralized incident management dashboard that provides visibility into active incidents, response status, escalation history, and overall operational health. I used to get all the data accumulated there through the dashboard.
PagerDuty Operations Cloud helps us manage production incidents beyond service outages, mostly high CPU utilization where we set alerts, application failures, pod issues in Kubernetes, and infrastructure-related alerts.
View full review »DS
Di Singh Negi
Lead Data Ops Engineer at Wipro Limited
The best features are those we have been using for incident management. We have been using PagerDuty Operations Cloud for on-call scheduling, escalation policies, and integration capabilities. Incident management is extremely valuable because it ensures critical alerts are delivered to the right people immediately. On-call scheduling and escalation policies are very helpful because we can define clear ownership for the services and automatically escalate incidents if they are not acknowledged within a specific timeframe. Another key strength is the integration ecosystem. We can integrate it with our monitoring stack including Prometheus, Grafana, and AWS services, which helps us automate alerts ingestion and incident creation without manual intervention. The most valuable features are automating alerts, escalations, on-call management, integrations, and incident analytics.
One example that stands out was a production incident where we experienced a sudden spike in database latency during peak gaming hours. This started impacting player transactions and causing delays in some backend services. Our Prometheus and Grafana monitoring detected this abnormal latency and error rate increase, which went beyond a threshold, and the alert was automatically routed to PagerDuty Operations Cloud. PagerDuty Operations Cloud immediately notified the on-call engineer of our team and triggered the escalation workflow based on the incident severity. Since the issue occurred during peak traffic, quick response was critical, which was maintained. PagerDuty Operations Cloud helped us coordinate multiple teams, including DataOps, application, and other infrastructure teams. The platform helped ensure everyone was engaged quickly and that no critical notifications were missed. While we were under investigation, we identified a resource bottleneck in the database layer caused by an unexpected traffic surge. With the help of the database team, we scaled the required AWS resource and optimized a few long-running queries. This restored normal performance.
View full review »The integration with ServiceNow is one of the most valuable features. I rely on PagerDuty Operations Cloud for on-call support for high severity incidents or sev zero scenarios, which is a great feature. PagerDuty Operations Cloud has been reliable as it has never gone down, and I trust it for incident response. Additionally, I find the capability to update using the mobile app extremely helpful.
Our developers noted that integrating incident bots or chatbots created with AI tools occasionally presents challenges with PagerDuty Operations Cloud. This is the only negative feedback I have encountered. However, I am very satisfied with PagerDuty Operations Cloud because my team is also pleased with it. We have onboarded multiple teams using this tool, and it functions well. From a security perspective, I believe there could be more layers. When scheduling on-call rotations for different team members, access should be restricted to specific users to prevent unauthorized changes to the on-call module. Despite this, the security features have been functioning well, and overall, I appreciate PagerDuty Operations Cloud.
View full review »Some of the best features PagerDuty Operations Cloud offers are comprehensive incident management, automation, and AI operations, all integrated into one platform. Second, it provides noise reduction and smarter alert grouping through global intelligent alert grouping that uses machine learning to group and correlate alerts across services. It also provides automation to reduce toil and speed up resolutions and artificial intelligence, including generative AI assistance, to help teams respond faster and smarter. Additionally, it has built-in workflows with standardized, repeatable processes, improved visibility, collaboration, and a unified operations view, and support for bridging customer-facing teams and engineering and the SRE teams. The last thing it provides is scalability for enterprise environments.
The AI-powered alert grouping and automation have made a difference in my day-to-day work by reducing alert noise. It automatically groups multiple related alerts into a single incident, so instead of 20 separate alerts, I get one meaningful alert, which prevents on-call engineers from being spammed. It also helps in faster root cause understanding because AI looks at patterns across systems including logs, metrics, alarms, and graphs, finally providing a broad summary about that. This cuts down the response time, helps in prioritization, and reduces the burnout of on-call teams.
PagerDuty Operations Cloud has positively impacted my organization by helping in faster incident detection and resolution with less downtime. It has reduced noise and fewer false alerts, allowing better focus for teams, meaning that on-call engineers can focus only on real and important issues rather than all the duplicate and negligible issues. It has helped with automation and efficiency, better collaboration and communication among teams, improved post-incident learning and prevention, and has not only helped in operational cost savings and better return on investment, but also in scalability and readiness for growth.
View full review »The ease of use is one of the key strengths. Creating the escalation policies and notification channels per user is straightforward, and it is not a requirement that everyone has the same notification rules. Users have flexibility in getting the communication they need. Event orchestration is the other part which works well for us.
Primarily, we were able to get the right people at the right time through our escalation mechanism, which is an automated switch from level one to level two. This helped us improve the overall MTTA, and the acknowledgment rate has drastically improved. For the major incidents, we were able to triage everything with PagerDuty Operations Cloud itself instead of switching between multiple tools such as Teams or other orchestration platforms. With one solution, we are able to do the triaging, and that definitely reduced the outage window and the average outage window.
We do have automations in two main ways. One is the incoming automation where we have multiple monitoring tools and systems that generate events. We ingest them into PagerDuty Operations Cloud and then using event orchestration, we create all of the respective incidents, whether they are PagerDuty Operations Cloud-only incidents, ServiceNow incidents, or different methods we use. The other automation method is incident workflows where we are able to call out to respective endpoints for the remaining automations. This is growing at this point in time, but event orchestration is mainly what we use for the automation of the triaging.
We used to have a two-digit figure of MTTA, and now it is reduced to less than one hour.
Getting the right people on board whenever there is a major issue and dialing them individually took a longer time. Now with PagerDuty Operations Cloud, having all of the predefined rules and the orchestrations we can create, it is definitely bringing value. Bringing the right people at the right time and improving the restoration time so that we do not impact any of the business end-user services is where PagerDuty Operations Cloud definitely plays a key role in delivering the business value.
View full review »PagerDuty Operations Cloud is part of our daily operational workflow. It sits between monitoring tools and response teams, ensuring alerts reach the right people without delay. We use it for on-call scheduling, incident escalations, and coordinating responses across teams. Having everything centralized has reduced alert fatigue and helped us respond to issues more consistently, especially during off-hours and high-priority incidents.
PagerDuty Operations Cloud offers intelligent alerting, on-call scheduling, automated escalations, and incident management as its best features. The platform makes it easy to ensure alerts reach the right person, and escalation policies prevent critical issues from being missed. We also rely heavily on its integration with monitoring and collaboration tools and its real-time visibility into operations. Together, these features help our team respond faster, stay organized during an incident, and reduce service disruptions for our customers.
View full review »PagerDuty Operations Cloud is easy to understand, allowing on-call service teams to escalate priority alerts to the required development team or service team, which helps reduce alert downtime by the SLA, benefiting the organization's requirements.
The reliability of PagerDuty Operations Cloud includes high availability architecture, multiple alert channels, strong enterprise adoption, and a reliable escalation system, making it easy to see the escalation matrix on the page dashboard.
The impact of PagerDuty Operations Cloud's alert reduction feature on our organization helps determine which incidents are business-critical, the impacted services, which teams should respond first, and what customer business impacts exist, focusing on the prioritization of critical incidents, enabling faster incident responses and reducing alert noise for better business visibility.
View full review »The best features PagerDuty Operations Cloud offers are that it is simple to set up and supports Excel sheet uploads, which was very helpful. Setting up notifications and the integration with Datadog was excellent. We can automate many things.
PagerDuty Operations Cloud has positively impacted my organization because the support team is very happy. Before, setting up everything was very difficult. Now, we don't have to think about it. We can simply set it up in PagerDuty and it works. The escalation and everything simply works with the configuration we set up six months to one year ago, and it still functions. We make only minor changes. I think a lot of manual effort has been reduced, and the system is more reliable.
Since implementing PagerDuty Operations Cloud, before the L1 team had to stay online at night, and if someone fell asleep and missed an issue, it would easily escalate to a manager or someone higher up, creating a lot of fuss. That is almost gone now. The discussion part about deciding who will be on-call and setting that up was not as foolproof when we were creating it manually, and someone had to invest a lot of time, around one or two hours weekly. Now, it takes simply less than five minutes. Every week, we simply discuss and it's done. I think a lot of time has been saved, and a lot of mental effort has been saved.
View full review »AU
Adil Ustad
Senior Software Engineer at LTM
The most valuable feature of PagerDuty Operations Cloud is that even though my device is on silent, it still rings and lets me know that something happened for the organization.
On-call schedules for team members are very helpful to find out who is currently on call to get help with incidents or to get tickets routed to them. At the same time, it pushes me notifications, gives me a call on my mobile number, and triggers emails on my email address, so the multiple notification service of PagerDuty Operations Cloud is excellent.
From a user perspective, the most valuable part of PagerDuty Operations Cloud is the notification feature that continuously contacts me until I acknowledge it. High and critical incidents are totally valuable for the organization because something is failing and I need to repair it on priority to not lose the business.
PagerDuty Operations Cloud improved my team's ability to focus on core tasks rather than routine issues by having the notification feature that was very helpful to monitor and trigger high and critical issues directly to team members.
View full review »PagerDuty Operations Cloud has improved the incident management and operational responses workflows. It has the capabilities of intelligent alerting and automated escalation policies, which help reduce manual intervention. The most useful aspects would be the noise reduction if that can be improved. Overall, PagerDuty Operations Cloud is a very strong platform to improve the incident response time and operational efficiency through automation and intelligent alert management.
View full review »One feature of PagerDuty Operations Cloud that I find valuable is the on-call schedule. We can manage our on-call scheduling, and we have various alert and notification delivery methods available, including mobile. We can receive phone calls, emails, SMS, and push notifications. For example, if someone missed the notification, they will get a phone call, which is very straightforward. We also have incident automation, making collaboration with any third-party monitoring services we use very straightforward, such as Datadog. We can seamlessly automate things with PagerDuty Operations Cloud. The AI features are also beneficial; for example, noisy alerts that trigger regularly and false positive alerts get suppressed. It checks the past month's alerts, showing us that this alert triggered 60 percent, this alert triggered 20 percent, this alert is rare, and this alert is not rare. The escalation policy is excellent as well, as if I did not pick up the call, my manager will get the call; if my manager did not pick up, then his manager gets the call. These are some of the most valuable parts we use in PagerDuty Operations Cloud.
In Datadog, we have multiple dashboards and monitoring systems where we see our spikes and alerts. When we integrated with PagerDuty Operations Cloud, we got better signal and less noise. When we are seeing a spike that is concurrent, in PagerDuty Operations Cloud, the AI feature already signifies that alert as a noisy alert, and it suppresses that alert. This significantly improves our workflow with both Datadog and PagerDuty Operations Cloud. We have faster response and faster escalation. Previously, in Datadog, we did not get notifications, and people would refresh it and check the spike every hour. Now that we integrated PagerDuty Operations Cloud, any alert triggers, and we quickly get a notification or a phone call. Therefore, we do not sit in front of a computer and refresh repeatedly. Additionally, we have a centralized incident workflow; PagerDuty Operations Cloud and Datadog feed into PagerDuty Operations Cloud incident timeline, so we see everything there. We do not need to open Datadog again and again, and if we need to deep dive into an alert from Datadog, we can click the link inside PagerDuty Operations Cloud, redirecting us to the Datadog dashboard where everything is noted down and visible.
In PagerDuty Operations Cloud, AI suppressing our alerts has helped streamline repetitive tasks. For example, very noisy alerts get suppressed automatically, aiding smarter routing. When we have new joiners in our team, they see alerts already suppressed, allowing them to focus on the critical ones instead of the lower ones. Additionally, alert prioritization is present; we receive critical alerts, high alerts, and then low alerts. The faster prioritization facilitated by AI enhances our alert management processes. Also, the root cause historical pattern assists us; if we get an alert similar to one from last month, it tells us how we resolved that alert previously. Historical patterns using AI greatly aid us in alert management.
View full review »
The best features of PagerDuty Operations Cloud include the SMS and call alerting functions, which I find very beneficial compared to other tools I have used, such as Coral Logics and Sumo Logic, which primarily focus on email alerting. PagerDuty Operations Cloud goes a level beyond by notifying users through SMS and calls, allowing us to tag the concerned teams to address specific issues promptly. In addition to those features, I also find the integration and reporting aspects of PagerDuty Operations Cloud valuable, as it records all triggered calls and incidents, enabling us to analyze patterns and identify the times when systems go down, thus assisting us in understanding and addressing the underlying causes. PagerDuty Operations Cloud has positively impacted our organization by ensuring that our banking applications, which operate 24/7, remain functional and efficient, contributing to better service availability.
View full review »
The best features PagerDuty Operations Cloud offers include Incident Workflows, which we use frequently to ease our team's work. These workflows trigger jobs in Rundeck based on certain conditions when incidents occur. We can create flows in Incident Workflow features and utilize Automation Actions, which allow us to run individual jobs in Rundeck. Additionally, Event Orchestration enables us to integrate various tools using integration keys with multiple applications. These features significantly simplify our daily operations within the team.
Integrations with other tools have been beneficial for our team as we receive requests from different teams to integrate their tools with PagerDuty Operations Cloud, enabling them to manage incidents. We have integrated AWS CloudWatch and Azure for monitoring, as well as CyberArk and Guardicore. If teams have specific requirements for integrating their tools, they approach us to create the necessary flows.
PagerDuty Operations Cloud has positively impacted our organization significantly. The response time has improved, and the team responds more quickly now. The PagerDuty Operations Cloud mobile application allows team members to acknowledge incidents via their mobile devices, where they can also receive calls when incidents trigger. The response time has become very quick.
I do not have precise numbers regarding the improvement in response time since using PagerDuty Operations Cloud, but I can share a story about a major incident with Checkmk. After we upgraded our Checkmk console, everything crashed, causing random events to be sent to PagerDuty Operations Cloud. We fixed the event flow from Checkmk using PagerDuty Operations Cloud's features. Furthermore, we have automated the restart of systems through PagerDuty Operations Cloud. If any server requires a restart, we trigger that job with just one click using Ansible, completing the task efficiently.
View full review »What I appreciate about PagerDuty Operations Cloud is its real-time alert capability, which is one of the main things. If a critical system in the production environment goes down and we receive a message via SMS, we might miss that, or if we get an email, we might miss that as well. Someone who is on call or directly providing support twenty-four hours a day and seven days a week, such as the support team, might need PagerDuty Operations Cloud support. Without PagerDuty Operations Cloud, it is difficult to say when the alert got triggered and those kinds of things. PagerDuty Operations Cloud has all the history, including who acknowledged that particular pager, the timeline when it got triggered, and in which channel it got triggered, making it easy to prepare a report for the past month on how many alerts we received for particular services or to segregate by team or by alert name. It is a kind of perfect application, but I can suggest a few more additional improvements to enhance user experience.
PagerDuty Operations Cloud's main benefits are the alerts related to our organization. Alerts are critical; we want our system to be one hundred percent available, but no system is one hundred percent reliable. We want to know whenever our system goes down or we are experiencing some latency in response time or when a certificate for our DNS expires, as these are critical issues that can be handled through PagerDuty Operations Cloud. Even if we set an email notification, individuals working in front of the system twenty-four hours a day and seven days a week may not always be available. If they go for tea, coffee, or lunch, they might miss the critical functionality. However, if you have a pager, you will receive a call, which is much more reliable. While there may be instances when multiple PagerDuty Operations Cloud events trigger and result in one call, that is not the case all the time. Most of the time, we will receive alerts through one of the three channels, and organizations will configure calls to check the logs and address the problem promptly.
PagerDuty Operations Cloud's alert reduction feature has significantly impacted my organization, preventing approximately ninety to ninety-five percent of critical incidents from occurring. From my understanding, if the system is down, people will see the alert and take the necessary resolution steps. If it does not involve actual engineering work, such as restarting a service, that can be followed through this PagerDuty Operations Cloud alert, allowing resolutions to happen as soon as possible.
View full review »What I like the most about it is that it has so many integrations like Azure integrations, AWS integrations, and Prometheus and Grafana integration for the alerting system, which makes it more convenient for us. We are using all kinds of tools like Grafana and others, which are easy to access and integrate with PagerDuty Operations Cloud. Our infrastructure is going to be more secured whenever incidents get triggered, and with the help of PagerDuty Operations Cloud, we are able to get incidents triggered automatically after alerts are triggered.
The benefits in terms of on-call are that we are getting maximum utilization of it. Previously, we were not having any alerting system for our client, and after implementing PagerDuty Operations Cloud, we started finding out the root cause and made other things easier compared to earlier. With the help of PagerDuty Operations Cloud, we are able to fix most of the issues and reduce repetitive issues in our infrastructure.
View full review »CS
Chandrakanth Sangale
Network Operations Center Engineer at HTC Global Inc
The most valuable thing I appreciate about PagerDuty Operations Cloud is the escalation process, where it automatically escalates incidents. It features a level one, level two, level three escalation process. When I engage or page someone at level one and start working on priority one or priority zero calls, and I am busy with those tasks, it automatically escalates to the next person in the chain of command, the second higher authority. There are many automation features in PagerDuty Operations Cloud.
If I complete one form, it automatically communicates with people and automatically scribes the incident in my ServiceNow incident. It automatically performs many tasks simultaneously, including sending Outlook emails. These automations are very important and well-known features in PagerDuty Operations Cloud, and they are the main reasons I value this product.
Infrastructure autonomous agents are installed in important servers and important devices in the infrastructure, and these are required components.
View full review »PagerDuty Operations Cloud is very easy to use and user-friendly.
Regarding the features that PagerDuty Operations Cloud offers, I have explored the automation area and it has a good amount of integrations. For example, the event intelligence and the noise reduction are areas where PagerDuty is really powerful. It reduces and cleans up alerts by doing alert de-duplication and alert grouping. It has also recently got machine learning capabilities, which would surely be helpful. We also have automations and runbooks in place which can help to do auto-remediation of issues or trigger scripts as per the runbooks. We haven't been using all of those things, but I know that these things are present. The incident response on-call management is very easy to use with PagerDuty. There are flexible on-call schedules, escalation policies, and the ability to set up overrides easily. There are multiple channels by which you can send alerts including SMS, calls, and notification pushes.
PagerDuty Operations Cloud also has war room features. Many emerging tools provide this as well, but since PagerDuty is a pretty established company, it has a very mature model with all of these features. The analytics and reporting are also decent.
PagerDuty Operations Cloud has improved our incident management process by ensuring that the right set of people are notified within time. The best part is that it has automated on-call schedules and escalation policies, so you don't have to set them again and again for every week or every month. Features including alert grouping, alert de-duplication, and good analytics and reporting are very helpful during incident management and also for post-incident activities.
View full review »Notifications are received very easily and quickly whenever a page or incident is triggered in PagerDuty Operations Cloud. Notifications are not limited to a single medium. If a person is unavailable on one medium or notifications are suppressed, repeated notifications arrive one after another. For example, if a person does not respond to email, a text message will follow, then a phone call, and if still unresponded, escalation occurs to another level. This is beneficial from a management perspective, but can feel overwhelming when handling on-call duties, especially during stressful periods.
Incident responses in PagerDuty Operations Cloud are not delayed, with instant notifications enabling quick reactions. Escalation policies ensure that if one person is unavailable, the alert escalates to the next level and beyond, preventing missed incidents. PagerDuty Operations Cloud is very useful for root cause analysis and audits, documenting each action including when a person viewed information on the app, received or declined calls, accepted or acknowledged alerts, the time before acting on incidents, and when incidents were resolved or closed. The mobile interface is effective, and managing escalation policies, schedules, and related items is easier compared to other tools.
PagerDuty Operations Cloud has been highly beneficial, preventing numerous outages through prompt response and incident alerting capabilities, demonstrating significant ROI.
View full review »PV
Pradeep Vijayagiri
Php Consultant at a computer software company with 1,001-5,000 employees
The best features PagerDuty Operations Cloud offers include all the required things needed, and I think I am good as it has the people and the team management, and everything looks good.
Regarding team management in PagerDuty Operations Cloud, I appreciate how it shows which people are API, people who can manage API, and who can manage the entire systems, and that's good actually.
PagerDuty Operations Cloud has positively impacted my organization as I think the mean time to resolve has reduced significantly.
The solution's alert reduction feature has had a positive impact on preventing costly incidents in our organization as we have reached our SLAs due to that, by reducing alert noise.
View full review »PagerDuty Operations Cloud offers excellent features including the alert system and automated incident escalation capability, which is effective for routing issues to the right team through mobile notifications, SMS, or phone calls.
The real-time monitoring feature in PagerDuty Operations Cloud makes the biggest difference for my team, as it is quite helpful for on-call management and day-to-day operations.
PagerDuty Operations Cloud has positively impacted my organization by working effectively with most cloud service providers like AWS or Azure, improving visibility and reliability of incident responses.
I have seen a reduction in incident response time, with MTTR efficiently reduced by 30 to 40%, better SLA compliance, and improved operational visibility through incident analytics and reporting dashboards since implementing PagerDuty Operations Cloud.
The alert reduction feature in PagerDuty Operations Cloud has minimized downtime and improved incident response efficiency in my organization.
View full review »Key features include resolving and tracking incidents in real time, 24/7 monitoring and scanning of issues, reliable alert escalation, and seamless on-call scheduling across teams.
View full review »The best features of PagerDuty Operations Cloud are that it is a fairly good tool for alerting. Here is how the process works: suppose there is an XYZ server in my environment hosting a production or development application, and a primary on-call engineer has been assigned for that particular week. We have set up monitoring and observability for that node so that if the node is not reachable, an alert is triggered and sends a notification to our integrated Slack channels with PagerDuty Operations Cloud. If the engineer is available, they can acknowledge the alert. If they fail to acknowledge it, the system calls them on their provided number. If that is also not acknowledged, it sends a text message. If those actions are not acknowledged, it sends an alert to the secondary engineer and calls them as well. This multi-channel approach makes it very difficult to miss an important alert or update. PagerDuty Operations Cloud handles this process perfectly, and we do not miss any alerts because of this system.
Regarding the stability of PagerDuty Operations Cloud, I cannot recall an incident where it was not available. I can say that it is 100 percent reliable for my needs.
View full review »The features of PagerDuty Operations Cloud that I have found the most valuable and useful include alerting, which is very reliable with minimal delays, and the escalation policies and routing rules that are more flexible. Additionally, the on-call scheduling capabilities are great, and it integrates well with any cloud platforms such as AWS, GCP, or Azure, and observability tools such as DataDog and New Relic for logging and checking out logs.
I have noticed that PagerDuty Operations Cloud influences revenue protection by reducing alert fatigue and incident costs. AIOps has helped recently in reducing noise and alert duplications, and runbook automations aid in lowering the mean time to resolve by integrating triggers to Slack and updating runbooks.
I see PagerDuty Operations Cloud as a very good incident management and on-call platform, mostly used by large-scale organizations because it comes with premium pricing, but it is very reliable with alerting and on-call scheduling, triggering incidents, escalation policies, rules, and runbooks.
View full review »PagerDuty Operations Cloud's best features include scalability, managing infrastructure, and managing other services. It helps me manage other services comprehensively, and I think it is pretty good overall. PagerDuty Operations Cloud has positively impacted my organization by being effective in terms of managing the system and in terms of scalability. A specific outcome that shows how PagerDuty Operations Cloud has helped my organization is that it has improved efficiency and helped in saving a lot of time.
View full review »SK
Sunilkumar Kumar
Senior Site Reliability Engineer at MoEngage Inc.
PagerDuty Operations Cloud offers great features, including service directories, Slack integrations, incident reports, alert suppression, orchestration, team management, and permission handling such as read or write. These are the best features according to my daily experience.
The integration part stands out for us, as we have utilized integrations like AWS, Azure, Jenkins for pipeline breaches, Slack, and New Relic, along with a number of plugins that are helpful.
In the market, when comparing PagerDuty Operations Cloud to VictorOps and other services, PagerDuty Operations Cloud offers great features, particularly the simplicity of its integration plugins, which is a significant advantage.
If there are issues in our production environment, we immediately get alerts and can take action. Even if we are busy with other issues and we cannot fully monitor our dashboards, the integration of alerts is fantastic because it notifies us via phone at any time. We can react immediately based on the alert's priority and take necessary action, leveraging our understanding of the infrastructure and the type of issues.
View full review »The aspect I appreciate most about PagerDuty Operations Cloud is certainly the incident management capabilities. We can create our rotational shifts and on-call schedules. The way it operates has become natural to us, allowing us to create schedules for separate teams and then create escalation policies. If team one misses an alert, it goes to the leads. If the lead misses acknowledging it, then it goes to the managers. It is very easy to manage in PagerDuty Operations Cloud. Recently, it has been introduced in Datadog as well, and it looks somewhat similar, but PagerDuty Operations Cloud has been our tool for so long that we have become accustomed to it.
Auto-resolve is available, so if the alert is gone, it automatically resolves. The information it provides on the incident is valuable, so we do not have to each time go to New Relic or Datadog. It brings all the information from those tools to PagerDuty Operations Cloud, allowing us to identify the issue and resolve it.
View full review »The best features that PagerDuty Operations Cloud offers include paging out as one of the best capabilities. We can respond to incidents through a message, call, mobile app, or website, which is very useful and quick for getting incident notifications.
We rely on the mobile app the most for incident notifications because it is easy to carry wherever we go. PagerDuty Operations Cloud is one of the important aspects in our organization which we use widely for incident response and quick incident resolution. When we need any alert or peer help, PagerDuty assists us and helps in managing who is on call for which team, allowing us to view the dashboard, which is very useful for us.
The alert reduction feature of PagerDuty Operations Cloud helps in reducing incident alerts, allowing similar alerts to be resolved easily. The AI functionality is very useful in reducing alerts, automating workflow, improving on-call efficiency, and enabling faster incident resolution. PagerDuty's generative AI is particularly helpful as it reduces alerts and unnecessary noise.
View full review »The best features PagerDuty Operations Cloud offers include the ability to integrate its app through various platforms such as Teams and various monitoring platforms such as New Relic and DynaTrace. It is easy to use, easy to log in and configure your on-call rotation, as well as utilizing their business services and technical services to properly configure how you want things monitored and alerted.
The integrations and easy configuration help our team by saving time and reducing errors. We use Terraform to create various modules, including integrations with PagerDuty Operations Cloud and our monitoring platform, New Relic. When a team creates a new application, we ask them to use our monitoring module to monitor their service using New Relic and PagerDuty Operations Cloud. By doing that, we save time and errors by preventing people from manually having to set up their PagerDuty Operations Cloud operations; it is all done through this module, which is easy to use.
PagerDuty Operations Cloud has positively impacted our organization by allowing us to be immediately paged when a system or service is down, enabling us to quickly respond and provide updates to the organization on issues and their resolution.
This quick response has led to measurable improvements, with reduced downtime and faster incident resolution times, as our on-call engineers are appropriately alerted when things happen. We understand based on the page what is going on and how to quickly respond to it, and if we need help, we can loop in other engineers and our managers that own the product to resolve it quicker.
View full review »I see major benefits from using PagerDuty Operations Cloud because it really helps to engage an on-call engineer. We do not have to call their phone, make numerous calls, or check their availability. If the respective engineer is not available, it triggers to the next-level management, maybe a senior lead or a senior manager who can bring in somebody instead of the engineer being unavailable.
We are using PagerDuty Operations Cloud to onboard new engineers. Recently I had a call with NAM's people who were not on a pager. In case of their unavailability during off-business hours, we took their details, email address, contact information, and everything. We have onboarded them in PagerDuty Operations Cloud, so at the end of the day, we are trying to make all engineers available in it and ensure that we can get their help whenever we need them.
PagerDuty Operations Cloud really helps because as soon as an engineer acknowledges the alert we sent, we get an email notification or confirmation. That helps us continue working on regular operations and gives us confirmation once the engineer acknowledges their availability. It does not keep me switching between tabs on PagerDuty; it runs in the background and lets me know once the process is completed.
View full review »The best feature that I like about PagerDuty Operations Cloud is whenever we page a particular team. There is a specific feature where we can directly page a person. Usually, once we trigger the alert, it goes to a particular person, and if that person does not acknowledge it, then it will go to their reporting person. Even if they also do not acknowledge it, then it will go to some other person. In that case, it tends to take a bit of time because whenever we see the alerts, the alerts will be shifting to other people. Some people might not acknowledge the alerts due to various reasons, and it may get missed. In PagerDuty Operations Cloud, there is a specific feature where we can page a specific person or a specific user. If we give the particular team name, then in the subfield, we can specifically page a person. This feature attracts me a lot.
Additionally, there is another feature where we can check the SME calendar. In my organization, for a particular week, one person will be allotted as an SME. That calendar shows which person is the SME for the particular week regarding the particular product. These are the features I enjoy the most in PagerDuty Operations Cloud.
The main benefits I can say from using PagerDuty Operations Cloud are that we can easily page them. It is also widely used in our operations team for faster incident response, leading to a reduction of the MTTR, mean time to resolution. The smart on-call management allows us to create a call for the on-call people and to involve the backup engineers as well. One special thing in PagerDuty Operations Cloud is it has time zone-based scheduling. As per that particular time zone, we can schedule them. I witnessed automated escalation, where the particular person missed acknowledging the alert in PagerDuty Operations Cloud, leading to an automated escalation to their associate director or VP. This escalation policy is also very good in PagerDuty Operations Cloud.
The impact of integrating PagerDuty Operations Cloud with Freshservice is very good because earlier, when it was not integrated, there were many problems while paging the alerts. Now, when we have integrated it to Freshservice, once the alert comes into the queue of Freshservice, automatically a PagerDuty Operations Cloud alert will be created. So automatically, it syncs. Once it gets synced, the alert will be automatically created in PagerDuty Operations Cloud and will go to that particular person who is allotted as an SME for that particular product.
The measurable benefits from PagerDuty Operations Cloud are that it has made our work easier, where the alerts will be synced and then directly create an alert to the SMEs. Instead of doing it manually, if it is automated in such a way that an alert gets triggered and routed directly to the SME, then that is a great benefit.
View full review »The best features of PagerDuty Operations Cloud include alerting, which is very important and the main reason we retain it, and scheduling as well.
Initially, we used Excel to manage our on-call engineers' schedules, but with PagerDuty Operations Cloud, it shows when you are on duty and allows other team members from different teams to check who is on duty without needing to ask. This has significantly reduced the time spent on checking who is on duty by providing visibility at each point.
Scheduling with PagerDuty Operations Cloud has reduced confusion because we set it up with a round-robin rotation, and nobody needs to update it every day unlike with Excel, where we had to create a new schedule every two months. Now we only make changes when necessary, making the process more efficient and organized for on-call engineers to know when they are on duty. The system also alerts them in advance for their upcoming shifts.
View full review »I have found the most useful and valuable features of PagerDuty Operations Cloud to be the alert notifications.
Alert grouping is also one of the valuable features. I am not using PagerDuty Autonomous AI agents.
I assess the effectiveness of PagerDuty's generative AI in providing insights for decision-making as really good, and I appreciate that.
Overall, PagerDuty's AI has positively impacted my team's productivity.
With PagerDuty Operations Cloud's alert reduction feature, I have noticed that it helped me prevent costly incidents in my organization.
View full review »The best features PagerDuty Operations Cloud offers are PagerDuty AIOps and event orchestration.
PagerDuty AIOps and event orchestration help my team day to day as they allow our teams to share their upstream downstream relationships of services and products and then allow AIOps to interlace and understand how to page during an incident.
PagerDuty Operations Cloud has positively impacted my organization by reducing the number of alerts from 60,000 alerts to every 22 alerts to one meaningful event.
The reduction in alerts has cut down our noise by 90% and allows teams to focus on mitigation and not worry about excessive paging.
View full review »PagerDuty Operations Cloud offers several best features including cloud-based hosting, reliable performance, and flexible expandability.
Regarding the flexibility and expandability, you can scale up and down the amount of employees, add different paths to contacting people, and have monitoring capabilities, which has greatly helped my team.
PagerDuty Operations Cloud has positively impacted my organization with its very good interface and centralized operation. Having a centralized interface has made things easier by providing easy access administration.
View full review »The best features that PagerDuty Operations Cloud offers are that it is user-friendly, the design is scalable, and it is quite easy to use for a newbie as well. Logging into the alerts as well as logging into any details are easy over PagerDuty Operations Cloud.
It is user-friendly because it does not need any sort of coding or any sort of scripting language to be filled in. It can just operate with mouse clicks. We need to know how to operate it, when to close, when not to close, and how to read green, yellow, red, and blue monitors or markers as per the monitoring.
It is easy to use, and I think the software also comes at a very low budget or low cost, which is very cost-effective and cost-friendly as well, rather than its alternatives. It benefits in every way, in network security, in cybersecurity, and overall company data monitoring and firewall as well.
I do not have any specific numbers because those will not be shared with us. This is confidential information. The response time is great, and after using PagerDuty Operations Cloud, false positives and false alarms, as well as security monitoring, were very strongly done.
View full review »The best features PagerDuty Operations Cloud offers include quick access to failed jobs and the ability to add descriptions about the failed job. The quick access allows us to rapidly identify which job or operation has failed because it sends the job name or the operation name that has failed.
I have heard that integration between our systems and PagerDuty Operations Cloud was easy to implement. For efficiency, we can monitor our deployment process in real time. For incident response, PagerDuty Operations Cloud creates alarms that make calls to the specific person who can handle these issues.
We have fewer missed incidents because it keeps calling regarding the incidents until they are resolved. We have also reduced downtime because we notice errors and failed jobs, and we work to fix them.
View full review »The best features of PagerDuty Operations Cloud include integration, mobile app, reporting, and analytics, which I find very useful based on the access I have.
We review the data periodically to see our performance; for example, we check for alert fatigue, how many alerts have been addressed, and our TDX metrics such as time to respond. The analytics of PagerDuty Operations Cloud is so good that it gives me good visibility with just a few clicks, which helps in discussions with the team for continuous improvements.
View full review »The most valuable features I found were the integration capabilities and notification system. We used the open-source tool Alertmanager, which triggers health metrics from Prometheus and Splunk. PagerDuty Operations Cloud allowed us to integrate alerting seamlessly and notify users effectively, which helped the business significantly. Early detection of issues leads to better service provision. PagerDuty Operations Cloud provides multiple notification channels including SMS, phone calls, and email, which I found to be the best part of the platform.
Regarding the autonomous AI agents, I have not explored them because the AI trend started recently and I have been out of touch for the last seven or eight months. However, I have read about how AI integrates with the scheduling part. Previously, we had to manually update schedules every week, but with AI integration, we can write a prompt and build MCPs. Some firms I read about integrated an MCP they built in-house, and with the MCP, they can provide an Excel sheet or image, and PagerDuty Operations Cloud API can update everything without needing to manually access the platform.
We implemented automation through PagerDuty Operations Cloud for incident response. Previously, we had to manually update service level details, SLAs, notification mechanisms, and API keys. Now we can submit an Excel sheet or CSV file and make an API call using Python, which updates everything automatically. PagerDuty Operations Cloud also helps with analytics by showing how many alerts were triggered, how many were resolved, and which person handled which alert. This visualization helps us demonstrate to clients that we managed a certain number of alerts and reduced the alert count.
View full review »The product used to be unique in getting the message to the right person to fix the job. It integrates with multiple applications and is highly customizable, with policies, escalation procedures, and an event routing tool that ensures contacting the right person. A significant advantage is that it includes a metrics overlay, which provides an overview of what is happening in the environment, and its scheduling capabilities are superior to other operations.
View full review »PR
Prathik Rokhade
Associate Sr. Manager at Financial Insight Technology, Inc.
PagerDuty helps you bifurcate teams and redirect alerts to specific teams. It has good user management and is easy to plug in. It has good APIs to integrate with our apps and send alerts, making it compatible with most applications. It becomes a powerful infrastructure when handling priority applications, so we use it during the critical phase of our engineering development.
View full review »EM
Emery Lyne Manayan
Manager, Service Delivery at Coherent Capital Advisors
The product easily integrates with other solutions. We can do automation. The product is used for alerting.
We found that the interface of PagerDuty is more friendly. It was easy to use and easy to set up. It integrated easily with our critical server and our Jira project console. It also had better features than Opsgenie. We were impressed with PagerDuty, so we decided to give it a try.
View full review »PagerDuty let us set up rosters based on our shifts. We could assign a hierarchy for how the calls should be escalated and the number of times the call will be transferred between people before it is answered. It makes it easy to access an agent via mobile phone.
View full review »The most important feature that is used is call scheduling. We are also able actually to call IT folks in the case of an emergency.
I have not heard anything bad about its stability.
It has scaled well for us.
View full review »The SMS pages and the mobile application are pretty much the top two features.
The ease of having it on a mobile phone is valuable. We can always track incidents wherever we are, even away from our computers. Additionally, the escalation feature for incidents to senior engineers or other people is very useful.
The inbound integrations that PagerDuty provides with most of the DevOps tools are valuable.
It's an incident management tool. So, its main functionality is to integrate with monitoring tools. There is a flexible and easy way of integrating with monitoring tools. That's what I have observed in most of the scenarios. It allows us to configure the integration with APIs and plugins as well.
The most valuable features of PagerDuty are customization, access, policies, and different rules regarding the path of escalation. Additionally, it's easy to use and create overrides. For example, if you all are on a call for one week each, but somebody wants to go on PTO, the team needs to swap shifts in PagerDuty. This is easy to do by creating overrides to switch up the set schedules. It's very user-friendly in that aspect. It works well for monitoring and alerting.
View full review »The solution's most valuable features are that it adds each alert as a service, has good scheduling capabilities, and includes the ability to write logic based on texts.
View full review »PagerDuty's notification process is the most valuable feature.
View full review »The most valuable feature is the phone app that allows us to send notifications without additional fees when sending by SMS or phone calls. There is also an escalation metric that allows any alert that goes unanswered to automatically move on to the next person on the list.
View full review »DM
Deepak Malik
Director at a computer software company with 1,001-5,000 employees
Alert deduplication and noise reduction for alerts are the major features that I found useful.
View full review »PagerDuty's best features are the dedicated application that allows me to reach my engineers immediately and the ability to directly assign specific tasks to individuals and have them report back.
View full review »The most valuable feature of PagerDuty is its integration with other tools, such as Amazon AWS, to receive notifications or create automatic instances.
View full review »DK
Darrin Khan
Compliance, Security & Testing Manager at a financial services firm with 11-50 employees
The scheduling feature is the main valuable one for us, because it was previously costing us time. For example, when I was doing the scheduling for the rosters, I would be spending maybe a day out of a month getting the rosters all sorted out. It was rather intense, and a fair chunk of time each month was dedicated to the schedules.
The flexibility in what we can send to it: emails, custom webhooks, and things like that.
We have our production and development environments. If an alert goes offline in the development environment, it's generally treated as a low priority. However, if anything goes down or alerts us from the production environment as a critical or high priority, then an engineer has to stop and fix it straightaway.
View full review »The alerts are immediate in this solution, which allows us to respond to errors quickly.
We are also able to set the escalations level in this solution, which is useful.
The most valuable feature is definitely the flexibility of the schedule.
The mobile app is quite also good for what we do: for receiving alerts, acknowledging, assigning, adding new responders. It has rich features for our needs.
The alerting functionality is quite good. It is one of the key parts of our on-call process, so that people can react when things are not working as expected. We trust it a lot. It definitely works well. We have never missed alerts or had situations in which we felt that we could not trust the alerting functionality of PagerDuty. Alerts in our environment could be caused by a specific part of the platform not running right. If a database goes down, for example, we will get an alert. There could also be errors in an application or a particular zone of infrastructure is down, or if service-level objectives are being affected for some reason. In all these cases, we will get alerts.
View full review »GK
Gilad Karmy
Tier 4 Support Team Leader at a comms service provider with 10,001+ employees
It's a tool for incident management, to help us understand what happened during an alert. A cool feature is that it helps us to understand the flow of the alert. If the alert was coming to the current on-call and he didn't catch the call or didn't notice it for any reason, it starts being escalated automatically, according to the escalation schedule, or to other teammates. You can see the flow very easily on your phone or via the website, if you want to do a post-mortem.
The solution’s alerting functionality is very good. It does the job. It's not that it only works sometimes. It works every time it needs to. It also knows how to close alerts that are closed from the monitoring system and you can easily close and acknowledge alerts via your phone even if you don't have the mobile app. You can do it with an SMS. So at 2:00 a.m., it's very easy to navigate an incident.
The email-for-alerting integration is also valuable. If there is a team that needs my team, they can easily send an email with the subject and why they want us to be on board and that we should start investigating an issue. Instead of how it worked in the past, when they would call the on-call number and start talking and try to explain what is going on, they just send an email and it pages the current on-call who is scheduled. It's very nice and easy.
While using PagerDuty hasn't resulted in a decrease in issues, it has allowed us, in combination with the monitoring systems, to know about issues before customers are alerting us. If a monitoring system was only sending emails, those emails could be missed among thousands of emails. But if we create alerts in New Relic, which integrates with PagerDuty, and we get a call from PagerDuty, it's much better. By not missing an email, it allows us, during working hours, to engage with other teams or to resolve the issue without causing problems to our customers. Issues can be resolved before someone notices.
It is more the monitoring systems that can point out problems to be addressed before they become worse, but those systems are not really able to do more than send us an email. Without the integration to PagerDuty, issues that are defined as critical could be missed.
View full review »MM
Michael Maack
Owner at IT Verke limited
The feature I like the most is the alarming.
View full review »Obviously, scheduling the calls is a valuable feature. We have a feature where there were three calls: primary one, primary two and primary three. If this guy doesn't pick up, it will respond a certain way; that's the most valuable feature. We use it on the weekends. If some customer escalates, I'll get a page, if my manager doesn't pick up. That's one of the great features; I love it.
View full review »Notification is the most valuable feature.
View full review »We email in, and Pager Duty finds the person on-call, alerts and escalates.
View full review »The outbound calling feature is a feature that I find a lot of value from.
View full review »Whenever we have a server that goes down, we get notified immediately which helps out our clients and all the developers, so we know when we have to log on to bring it back up.
When I first started at this company, we set up our AD for free on Mail Server, but it paged everyone who was on-call. Moreover, we couldn't make adjustments, such as adding and removing people. The result was that if you didn't respond to a page, the task wouldn't get done.
With PagerDuty, if you don't respond to a page, the next on-call person would get the page. We don't have to think about who is on-call because the process is automatic.
View full review »I've found the the versatility of notifications to be the most valuable feature. With the multiple methods of notification and the escalation policies, we can always be certain that someone will get the message.
View full review »- Easy to ACK
- Easy to read UI
- Custom notification
- Easy to override schedules for on-call swaps
- Easy integration with Nagios
- Automated scheduling
- Escalation
- Text to voice
I really like the reporting aspects of the solution; they are absolute.
View full review »Its alerting capabilities, simple user interface, stability and functionality is exactly what we need. I would also highlight a fact, they have very responsive support, and API developers support. We mainly use it at GoodData for:
A) Alert/event routing from our product/monitoring system to the responsible DevOPS engineer who is on duty to fix/workaround issues reported. We mostly escalate to mobile/cell phones.
B) We use it for on call schedule resource planning as we have around 100 people using PagerDuty (mainly from engineering, support, services). Using their system, you basically know at any time who is working, when and which service he owns/is responsible for.
C) Billing/accounting. Using their APIs we are able to easily financially compensate those 100 people for their on-call duties. This might be country/geolocation specific.
View full review »Push notifications to mobile devices combined with the in-app ability to override the system volume allows for flexibility in managing other device notifications, while high-level notifications can be alerted without worrying if your device's notification level is sufficient.
View full review »SMS + phone call alerts, multiple escalation lists, personalized notification rules, centralized administrator interface, automatic incident creation. The customization options available in PagerDuty make conformity to our needs possible.
View full review »I would say the notifications, and the information that comes along with them.
View full review »There are several valuable features that we use, among them are the escalation policy, integration of email, telephone, and mobile app.
View full review »The product has valuable on-call scheduling, escalation, and incident workflow management features.
View full review »Buyer's Guide
PagerDuty Operations Cloud
May 2026
Learn what your peers think about PagerDuty Operations Cloud. Get advice and tips from experienced pros sharing their opinions. Updated: May 2026.
899,917 professionals have used our research since 2012.































































