The main purpose of PagerDuty Operations Cloud is to receive alarms for real incident production critical issues. Whenever an incident happens, we get an alarm call or a phone call on our phone or through an application call, so that we are aware of the situation and know that we have to be vigilant about it and resolve that particular issue. This is the main use case of PagerDuty Operations Cloud, and it is the use case where most of the company is using it.
PagerDuty Operations Cloud
PagerDutyExternal reviews
External reviews are not included in the AWS star rating for the product.
PagerDuty Gets the Right Person Notified Fast
1. On-call scheduling is good and easy to track
2. Mobile alerts comes instantly most of time
3. Integrations with monitoring tools saves lot of manual effort
4. Escalation thing is very useful when someone not responding
Reduces Downtime with Easy On-Call Scheduling and Escalation Chains
Automated incident alerts have improved response time and keep critical issues under control
What is our primary use case?
What is most valuable?
PagerDuty Operations Cloud has improved the incident management and operational responses workflows. It has the capabilities of intelligent alerting and automated escalation policies, which help reduce manual intervention. The most useful aspects would be the noise reduction if that can be improved. Overall, PagerDuty Operations Cloud is a very strong platform to improve the incident response time and operational efficiency through automation and intelligent alert management.
What needs improvement?
The noise aspect of PagerDuty Operations Cloud could be better. What happens is if there is some sort of an issue occurring, it keeps on repeating and calling again and again. Once the alert is acknowledged, it should stop calling because it creates noise. When we are handling production critical issues, once we have acknowledged an alert, we are still receiving calls, which becomes a distraction because we are also checking an incident and trying to resolve that particular issue.
Generative AI must provide some significant value to the incident management workflow in PagerDuty Operations Cloud. It helps in faster incident triaging, reducing the time spent understanding the alerts and identifying the root cause. It quite helps at a certain point in time, but suppose we have an issue which is not under the capabilities of that particular functionality. In those cases, it does not help. However, if we have integrated our internal services with the system, it collaborates by providing the context in actionable information using that.
For how long have I used the solution?
I have been using PagerDuty Operations Cloud for four or five years.
What do I think about the stability of the solution?
The functionality in the application or the website of PagerDuty Operations Cloud is 100 percent very much operational all the time. There is no lag. The reliability is very good. We can rely on this application. Even if we are not on the system, we get the call on time. We get the call pretty much on time. There is never a lag on alerting or anything. This is something which we can rely on and trust the product.
What do I think about the scalability of the solution?
Scalability for PagerDuty Operations Cloud is something which is very good. We can put unlimited numbers of alerts and unlimited numbers of things integrated with this system.
How are customer service and support?
I have never contacted the technical support team of PagerDuty Operations Cloud. The application was pretty much reliable and I never had to contact them because it is operational all the time. In case if it is needed, I believe there would be great support, but that is not needed.
How was the initial setup?
I was not involved in the initial deployment of PagerDuty Operations Cloud.
What about the implementation team?
I have not been involved with the implementation at PagerDuty Operations Cloud. The reliability team, like the SRE team, used to do it. I have not done that, but I have used it, so I can only give feedback about using it.
Which other solutions did I evaluate?
I have used OpsGenie once, which is an Atlassian product. It is similar, but their user interface is not that much helpful as compared to PagerDuty Operations Cloud.
I am not aware of the pricing for PagerDuty Operations Cloud, but I believe it is quite good. The other product, OpsGenie, is helping because they are offering it for free of cost if you are using other products like Jira. I believe that is where the differentiation is happening. For PagerDuty Operations Cloud, I do not know what the pricing is; I do not have any idea about it.
What other advice do I have?
PagerDuty Operations Cloud handles maintenance themselves. I give a nine out of ten for PagerDuty Operations Cloud because everything is pretty much perfect, including scalability, as we can scale to unlimited numbers of things. Reliability is also something which we can rely on. Maintenance is solid, and the automation part is also performed very well. I deducted one point because of the noise part which can be improved in the future. Overall, this application is pretty much available all the time and it maintains the operational efficiency very well. My overall review rating for PagerDuty Operations Cloud is nine out of ten. We are users of PagerDuty Operations Cloud product.
Flexible Alerts Across Channels with Strong MRR Insights
Targeted alerting has reduced noise and now supports meaningful incident response
What is our primary use case?
My main use case for PagerDuty Operations Cloud is targeted alert paging.
A specific example of how I use targeted alert paging with PagerDuty Operations Cloud is that we were able to tie our services and use PagerDuty AIOps to understand dependencies so we get paged a minimal number of times as opposed to an alert storm.
What is most valuable?
The best features PagerDuty Operations Cloud offers are PagerDuty AIOps and event orchestration.
PagerDuty AIOps and event orchestration help my team day to day as they allow our teams to share their upstream downstream relationships of services and products and then allow AIOps to interlace and understand how to page during an incident.
PagerDuty Operations Cloud has positively impacted my organization by reducing the number of alerts from 60,000 alerts to every 22 alerts to one meaningful event.
The reduction in alerts has cut down our noise by 90% and allows teams to focus on mitigation and not worry about excessive paging.
What needs improvement?
PagerDuty Operations Cloud can be improved by being more sequenced in terms of adapting into the incident life cycle. For example, an alert cannot be an incident. An incident should be completely unrelated to an alert.
For how long have I used the solution?
I have been using PagerDuty Operations Cloud for about four and a half years.
What do I think about the stability of the solution?
PagerDuty Operations Cloud is stable.
What do I think about the scalability of the solution?
The scalability of PagerDuty Operations Cloud is pretty robust.
How are customer service and support?
The customer support of PagerDuty Operations Cloud is pretty helpful.
Which solution did I use previously and why did I switch?
I previously used Opsgenie, and I switched because it was end of life, end of support.
What's my experience with pricing, setup cost, and licensing?
My experience with pricing, setup cost, and licensing is that it's pretty reasonable. It's on a cost-seat basis and the AIOps and PagerDuty combination was reasonable.
Which other solutions did I evaluate?
Before choosing PagerDuty Operations Cloud, I did not evaluate other options.
What other advice do I have?
PagerDuty Operations Cloud is pretty robust and I recommend thinking about how to scale. I would rate PagerDuty Operations Cloud an eight on a scale of one to ten. I chose an eight because I think there is still scope for improvement in terms of how the product presents itself, but it is pretty robust in terms of how it performs.
Which deployment model are you using for this solution?
If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?
Intuitive and Stabilizing, Optimizes Internal Processes
Reliable paging that keeps our Azure on-call rotation sane
Efficient Alert System with Minor Configuration Hurdles
Seamless Integration and On-Call Management
Centralized Incident Response with Practical AI Summaries
What’s improved a lot recently is the AI side. The incident summarization and contextual insights are actually practical during active incidents, because nobody wants to sift through a flood of alerts just to figure out what changed. The AI-generated summaries help speed up triage, instead of feeling like a checkbox feature.