We mostly use it for our on-call engineers, for schedules, alerting, and critical alerts. And, of course, we use it for the management of an issue, so that people acknowledge the alerts, reassign them, etc.
PagerDuty is #2 ranked solution in top IT Alerting and Incident Management tools. PeerSpot users give PagerDuty an average rating of 8 out of 10. PagerDuty is most commonly compared to ServiceNow: PagerDuty vs ServiceNow. The top industry researching this solution are professionals from a computer software company, accounting for 26% of all views.
What is PagerDuty?
PagerDuty is an agile incident management solution that integrates with ITOps and DevOps monitoring stacks to improve operational reliability and agility. From enriching and aggregating events to correlating them into actionable alerts, PagerDuty streamlines the incident management lifecycle by reducing noise and resolution times. With hundreds of native integrations with operations tools, automated scheduling, advance reporting and guaranteed availability, PagerDuty is trusted by over 7,000 organizations globally to increase business and employee efficiency. The company is headquartered in San Francisco and backed by leading venture capital firms Andreessen Horowitz, Bessemer Venture Partners, Baseline Ventures, Ignition Partners and others. To try PagerDuty for free, or to get more information, visit www.pagerduty.com.
Download the IT Alerting and Incident Management Buyer's Guide including reviews and more. Updated: May 2022
40% of the Fortune 100 TrustPagerDuty. Customers include: Slack, Intuit, Zendesk, Panasonic, Pinterest, Airbnb, eHarmony, McKesson, Comcast
PagerDuty Pricing Advice
What users are saying about PagerDuty pricing:
- Highest Rating
- Lowest Rating
- Review Length
Showingreviews based on the current filters.
Compliance, Security & Testing Manager at a financial services firm with 11-50 employees
Reduces white noise, which has reduced engineer fatigue
Pros and Cons
- "It reduces the amount of white noise. If something comes through, then it will alert somebody. However, if it's a bit of white noise that comes through at night, then it gets dealt with the next day. Everything is visible to everybody. It's not just a single person getting an SMS, then going, "Oh, I'm not going to worry about that." The visibility to everybody on the team is one of the great things about it because it reduces the white noise."
- "Because of the way you have to structure the rosters, if an engineer has to go on leave (or something), you can't just go in and reassign/take this person out of all of the different rosters that they're in. You have to go into each of the rosters and take them out. There might be a roster for business hours, after hours rotation, and monitoring deployments. Each time we need to take an engineer out of the pool, e.g., if they're sick or on leave, then we have to go and touch all of those rosters, updating and replacing them. Whereas, if we could just take the person out and have it automatically fill in the rostering, then that would make life a lot easier for managing it."
What is our primary use case?We are a 24-hour online business. We use it for scheduling our on-call engineers and making sure that there is follow-the-sun or round-the-clock coverage for alerting and network operations. It ingests all our alert paths, i.e., anything that generates an alert of any description, such as, Splunk, AWS, and internal applications. We feed all our events into it, then it generates alerts which need a response from an engineer with a description. Another thing is it is built-in scheduling is pretty much hands-off for our on-call engineers unless somebody goes on holidays. That is the only time that we have to jump in there and make any changes.
How has it helped my organization?One of the things with our incident flow is that it generates Jira tickets for us. So, its JIRA integration is a critical thing because we need to have that logged for compliance in a separate ticketing system. Having it go into Jira is great, where we can generate hard copies of the alerts and all the events around it. Also, it has the visibility to be able to update one particular location, so you can update Jira and that information goes across to PagerDuty, or you can update PagerDuty and it goes back to Jira. The integration that they have now is great. For example, if you are in the middle of a major event, where you have multiple incidents coming at you, the way it correlates events into a single incident is great. It reduces the amount of white noise. If something comes through, then it will alert somebody. However, if it's a bit of white noise that comes through at night, then it gets dealt with the next day. Everything is visible to everybody. It's not just a single person getting an SMS, then going, "Oh, I'm not going to worry about that." The visibility to everybody on the team is one of the great things about it because it reduces the white noise.
What is most valuable?The scheduling feature is the main valuable one for us, because it was previously costing us time. For example, when I was doing the scheduling for the rosters, I would be spending maybe a day out of a month getting the rosters all sorted out. It was rather intense, and a fair chunk of time each month was dedicated to the schedules. The flexibility in what we can send to it: emails, custom webhooks, and things like that. We have our production and development environments. If an alert goes offline in the development environment, it's generally treated as a low priority. However, if anything goes down or alerts us from the production environment as a critical or high priority, then an engineer has to stop and fix it straightaway.
What needs improvement?Because of the way you have to structure the rosters, if an engineer has to go on leave (or something), you can't just go in and reassign/take this person out of all of the different rosters that they're in. You have to go into each of the rosters and take them out. There might be a roster for business hours, after hours rotation, and monitoring deployments. Each time we need to take an engineer out of the pool, e.g., if they're sick or on leave, then we have to go and touch all of those rosters, updating and replacing them. Whereas, if we could just take the person out and have it automatically fill in the rostering, then that would make life a lot easier for managing it. We have an on-call phone number. However, at the moment, it is routed to a static voicemail. We would actually like to be able to have that phone follow whoever is on-call.
IT Alerting and Incident Management
Find out what your peers are saying about PagerDuty, Everbridge, Atlassian and others in IT Alerting and Incident Management. Updated: May 2022.
598,116 professionals have used our research since 2012.
For how long have I used the solution?I have used it for five or six years, possibly longer.
What do I think about the stability of the solution?The stability is pretty good. There was one incident where push notifications stopped, but it failed over to SMS and phone calls, so it really didn't make much of a difference. Even then, because we didn't get that many alerts through at the time that they were having push notification issues, it didn't bother us. It was resolved very quickly (in about an hour). The only reason we noticed it was because they told us about it, not because we found it. We haven't had any issues where PagerDuty caused an impact to us from their maintenance. Using their product, we have been able to set our alerts into maintenance, which is good. There has been no downtime from them being offline, or anything like that. Before, we would have needed to have done a lot of alert path manual management, then going through afterward, enabling and disabling them. Whereas, within PagerDuty, it's so much easier. You just go in there and click a service on the maintenance, then it automatically does it all for you in the background. We don't have to sit there and think about it. So, it's quick and simple. This is saving us a good hour a month, because that would be one engineer sitting there going through, updating alert paths, etc. Because we are a payment processor, we can't go offline. We need to be very on the ball and on point with any issues that come up. Having PagerDuty there means we're able to do that.
What do I think about the scalability of the solution?The scalability is pretty good. I haven't seen anything that would restrict it. Because it's a SaaS platform, you can pretty much plug anything you want into it. I haven't had any restrictions on what I can feed into it from an alert perspective, and we can just keep adding more users as we see fit. It is an integral part of our operations environment, so we wouldn't want to change or reduce it in any way. If our production environment increased and we had to add more services to it, then it's easy enough to do. It's not as though that is a major problem. We have eight users in the organization. We also have a couple of stakeholder licenses where we notify stakeholders of major events. These are not actually interactive. They don't get alerts, but they'll get notifications if we allow them, such as, adding them to an incident. They will then get notifications from it, not necessarily alerts. There are internal, as we don't have external clients in that loop because the information management is a something that we keep a tight handle on and that is very manual. There are two other DevOps engineers who maintain it. There is redundancy if I'm sick. However, I still take the lead on a lot of the stuff.
How are customer service and support?We worked with technical support at one stage when we were trying to get a mail filter. We wanted to set up a complex mail filter with some rules around it. That is when we contacted them, though this is not an ongoing requirement. They were pretty good and very informative. They were to the point, without being blunt.
Which solution did I use previously and why did I switch?At the time of implementation, the solution was to replace our SMS-based solution, taking the rostering and management of the SMS rotation and making it easier. This was a bunch of homegrown shell scripts that had a little modem card, which would send SMSs to us. We switch to PagerDuter mainly because of the maintenance and inflexibility of our original solution. We had to maintain it ourselves, paying for the upkeep of the modem, SMS account etc., then making sure that we could send the information to various phones on different carriers. By going to PagerDuty, we were able to come up multiple paths to be able to get those alerts, not just by our SMS. Previously, we were manually copying and pasting the information. Per incident, it was taking us maybe half an hour, because someone would have to sit there and copy things backwards and forwards, making sure it was all in sync at the end of the incident. When we first started looking around for a product to replace the existing alerting process, we found this product where alerts were more visible. Then, based on that fact, they were more visible. After a while, this naturally reduced the quantity of alerts by making them more visible. This made it easier to deal with issues because we were able to see alerts. Also, everybody saw them, not just one person.
How was the initial setup?The initial setup was really easy. We just went in there and clicked a couple buttons, then away we went. Anytime you need to set something up, the initial setup is great, quick, and easy. It's when you get into some of the nuances, like rostering, where you have to take a person out of a roster, then put them back in. That sometimes adds a bit of complexity. However, the initial setup was one of the things that sold us on it since it was so quick and easy. That is because it is a SaaS-based solution. When we initially started it, it was like me fiddling around on one weekend. I said to the guys, "Look, I've got this going," then it pretty much went from there. So, it might have been an hour at the most. It did not take long at all.
What about the implementation team?I was the only person who deployed it.
What was our ROI?The main flexibility and return on investment we get is that we don't have to do the maintenance on the products that we previously had. It's just seamless. It's like, "Oh yeah, it's reliable. We don't have to do anything else." Whereas, previously it was, "Ah, is the pager actually working?" This reduces worry and everybody's comfortable with the fact that it's going to work. So, the return on investment is more a comfort factor, knowing that we're able to rely on it and not worry that, "Oh, hang on, the alerting's not working," then go and chase up what's wrong with the alerting as well as chase any other problems which come up.The best thing that we've had is that we get alerted before things happen rather than after the customer's having a problem or notices the problem. As a result of the reduced white noise, we have reduced engineer fatigue. This means that because the engineers are not tired, their work throughput increases. It is definitely noticeable. If our engineers is working and gets called after hours every night, then when they come in to do their shifts, they're tired because they've had interrupted sleep. Whereas, if we make sure we don't have the white noise and everything else coming through, they're still able to get through their normal workload as well.
What's my experience with pricing, setup cost, and licensing?If you add more people, then you have to pay more, which is always a thing with the SaaS solutions. PagerDuty's pricing seems competitive. At one point, we were looking at OpsGenie because part of their current pricing includes the call routing that we wanted to include. It was actually cheaper to get that plus the call routing than it is on PagerDuty at the moment. However, we would have to go and buy an extra module to go with it. What we have at the moment is solid, and it would be a hard sell to say, "We'll go to something else that we're not familiar with." If we wanted phone calls or additional SMSs, we would have to pitch up for those. They give us so many per month per user, then we have to pay extra if it goes over that.
Which other solutions did I evaluate?Over the years, we have looked at other solutions: OpsGenie and VictorOps. There was another one, but they faded away. We were also using Pingdom at one point. Some of them are still a little bit green in this space. They're definitely coming up to speed. So far, we're settled on PagerDuty because they were the leader and only one around at the time we were evaluating solutions. Since then, we've started looking at other products just to make sure that they're still on point with what we need. The alerting functionality is not too bad. I have evaluated other competitive products for the way you can set different types of alerts, e.g., for non-critical or critical. PagerDuty will alert you differently based on those settings, which is an advantage that we like. It will also try multiple paths so you can set it up to email you the alert, send you an SMS, phone you, or just a push notification to your phone. One of those four mechanisms means the engineer will get notified one way or another. If that doesn't work, it automatically escalates to the next person in the alerting path. We do have a project in the pipes for probably the beginning of next year to go through and do another review to make sure that the solution has everything there. We also want to do comparisons for what other options are available, make sure the pricing is still competitive, what's on offer, and so on.
What other advice do I have?For whatever solution you have for alerting, and it being such a critical role in incident management, you need to be able to rely on it. PagerDuty allows us to do that. Ensure you sit down and identify what you want in any alerting platform, whether it's PagerDuty or OpsGenie. Sit down and define what you want, particularly around your scheduling, what alerts you want to be able to ingest or handle, who you want to be able to process or send those alerts to, and any other possible bits and pieces in there that you may need before you sit down and look at an alerting platform of any description. Because sometimes, depending on what it is, there may be another way of doing it when you actually go and talk to the salespeople or pre-sales engineers. They'll go, "Oh, well, you can do this, this, or this." This will avoid bright light problems where, "Oh, that's a nice, shiny light. Yeah, we need that." You actually have in front of you what you need, not necessarily what they're trying to sell you. We have looked at the solution’s analytics, but haven't gone much into them. At the time that we were looking at it, we didn't see any real benefit to it since we are only a small team. If you would look at a larger organization, you would get more benefit out of it. However, because we're such a small team, everybody knows how many alerts are coming through. It's not as though we need to do a full-on detailed, analytical review of things. I would rate this solution a nine out of 10. It is a reliable solution that works.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Helps with managing schedules, escalating issues, and adding people to an instance, but licensing is not flexible enough
Pros and Cons
- "The most valuable feature is definitely the flexibility of the schedule. The mobile app is quite also good for what we do: for receiving alerts, acknowledging, assigning, adding new responders. It has rich features for our needs."
- "The solution's analytics are okay. I don't think the features, at this point, give you a lot of insights. We have actually been trying to get insights from it but it hasn't really given us a lot of extra points to explore. We were looking at the number of alerts to see where many of the alerts were coming from. We never managed to get many insights on this."
What is our primary use case?
How has it helped my organization?
PagerDuty enables us to implement well-known techniques for on-call. It helps with managing schedules, being able to escalate issues, and being able to add people to an instance. It enables us to have multiple levels of people on-call, in a simple way. Otherwise, managing those schedules, offline, would be quite hard. Having a tool like PagerDuty to support us from that standpoint is really key, so that we can perform well when we have issues.
The fact that it doesn't have maintenance windows is definitely something that is really key. It was a key element in our choice. Having maintenance windows for a monitoring system is just not acceptable. The impact is that we really trust PagerDuty. There were situations where a specific alert didn't reach PagerDuty. We looked into finding the reason elsewhere, and not in PagerDuty, because we really trust that PagerDuty, when it catches an alert, will actually deliver it.
What is most valuable?
The most valuable feature is definitely the flexibility of the schedule.
The mobile app is quite also good for what we do: for receiving alerts, acknowledging, assigning, adding new responders. It has rich features for our needs.
The alerting functionality is quite good. It is one of the key parts of our on-call process, so that people can react when things are not working as expected. We trust it a lot. It definitely works well. We have never missed alerts or had situations in which we felt that we could not trust the alerting functionality of PagerDuty. Alerts in our environment could be caused by a specific part of the platform not running right. If a database goes down, for example, we will get an alert. There could also be errors in an application or a particular zone of infrastructure is down, or if service-level objectives are being affected for some reason. In all these cases, we will get alerts.
What needs improvement?
The solution's analytics are okay. I don't think the features, at this point, give you a lot of insights. We have actually been trying to get insights from it but it hasn't really given us a lot of extra points to explore. We were looking at the number of alerts to see where many of the alerts were coming from. We never managed to get many insights on this. The solution has not enabled us to go beyond responding to incidents and to start predicting or preventing them.
It would help if they simplified the way you try to get insights or information about instances and to improve your situation regarding the groupings of alerts.
Even if there are a lot of different functionalities, a lot of prediction abilities in place, it's not really clear what the best practices are for using PagerDuty to get the best out of the platform. The concepts are quite complex, at times, for people to understand. They need a more straightforward way to use the product and get the best out of it. With all the concepts of escalation, services, and the schedule, it gets confusing.
For how long have I used the solution?
I've been using PagerDuty for one year. I joined this company a year ago and it was already using it.
What do I think about the stability of the solution?
The stability is very good. If I remember correctly, we have only had one incident with PagerDuty over the whole period that I have been using it. I also used PagerDuty at my previous company. My experience is that it is a very stable platform.
There are also no bugs in the apps, in my experience. It is a very high-quality product.
What do I think about the scalability of the solution?
It's a SaaS.
But in my previous company and also here, I see the cost of the platform as an impediment to scaling this from a small team to the whole company. But for the platform itself, we don't have to manage anything in terms of scaling.
We have about 100 people working with PagerDuty — mostly engineers. It is used across all the different engineering teams. The adoption rate is 100 percent.
How are customer service and technical support?
We have used their technical support once. It was okay. I don't think we managed to get to the bottom of the issue we were having, but it ended up that it was not that critical at that point, so we just let it go.
Which solution did I use previously and why did I switch?
In my previous company we used a home-grown system before PagerDuty.
How was the initial setup?
I was involved in the initial setup of PagerDuty in my previous company. Depending on how big your deployment — how many services you want, how many teams — it can get complex. But in general, it's simple.
What is complex is to go from using PagerDuty as an instance manager and as a pager, to getting to the next level and the advantages that PagerDuty sells a lot: intelligence, analytics; the extra functionality that they add. That functionality comes with a high price. For me, that is the complex part. It's not the start, rather the hard part is to really get the most out of what PagerDuty offers.
What was our ROI?
The return of investment is from how quickly we engage our teams, meaning that we don't waste time and money. It has returned the price that we pay, but the price is high.
What's my experience with pricing, setup cost, and licensing?
A con, a failure, is the cost which is quite high. But if you want to get a full-featured application and you have a big team...
Some important features are closed to a group because of the licensing. For example, one of the features that I always wanted to use but never managed to is the postmortem part of PagerDuty. To me, it is important that everyone in the organization be able to read any postmortem that is produced. PagerDuty only allows you to share it with people who have accounts. It doesn't have different levels of accounts. There is only a complete account and you have to pay for it.
You really need to understand what feature functionality you want from the solution and then see what the cost-benefit is for what you want to achieve. We tried the stakeholder licenses, but we ended up never using them. They don't have a lot of flexibility on that. It's almost like one type of licensing or nothing.
Which other solutions did I evaluate?
We have done some investigation internally, but nothing really serious. We looked at Opsgenie and VictorOps. We didn't see a reason to change. We trust PagerDuty.
The main pro, of course, of PagerDuty is that it is a full-featured application, with high trust. Many companies use it, big companies that trust it. This is definitely a big plus.
One of the reasons we stayed with PagerDuty was, of course, trust, but we also see the value of the added functionality that PagerDuty gives us. But if you don't get to use those — the things that are the differentiator for PagerDuty — then the cost starts to be quite high. You need to be completely in to be able to get the full advantage of PagerDuty. Competitors will give you a little bit less functionality, fewer luxury functions, but the cost is more accessible for the feature set that you really use.
What other advice do I have?
From the get-go, approach it in a way where you can get the most out of it. Really engage with PagerDuty from the beginning to support you on that journey. Otherwise, you will only be able to use the core functionalities that you can also get from the competition. Focus on the full platform and make the decisions that will simplify that. Don't do quick wins at the beginning that will not help you to take full advantage of the platform.
PagerDuty worked with us to find the right solution that fit our needs and budget, but they didn't do so as much as I would expect or as much as I would have liked. At some point we asked, "How can you help? How can we get the most out of PagerDuty?" We had some ideas and we engaged with their Professional Services, but the cost of that was quite high. It ended up that we never did what we wanted to do with PagerDuty. It got stuck at some point. They did help but we would need to pay even more to get the help that we need.
It's really key to manage your operations life cycle. You cannot go without having something in place. And it's important to have something that will support you. Whether it's PagerDuty or something else, that is really key.
If I don't look at the cost of it, I would rate PagerDuty between eight and nine out of 10. If I include the cost, it would be a seven.
Disclosure: I am a real user, and this review is based on my own experience and opinions.
Download our free IT Alerting and Incident Management Report and find out what your peers are saying about PagerDuty, Everbridge, Atlassian, and more!
Updated: May 2022
Product CategoriesIT Alerting and Incident Management
Download our free IT Alerting and Incident Management Report and find out what your peers are saying about PagerDuty, Everbridge, Atlassian, and more!
- How do you decide about the alert severity in your Security Operations Center (SOC)?
- What is the difference between mitigation and remediation in incident response?
- What is an incident response playbook and how is it used in SOAR?
- Why a Security Operations Center (SOC) is important?
- GoDaddy has been hacked again. What can be done better?
- What tools and solutions do you use for automated incident response in an enterprise in 2022?