Listing Thumbnail

    PagerDuty Operations Cloud

     Info
    Sold by: PagerDuty 
    Deployed on AWS
    The PagerDuty Operations Cloud is essential infrastructure for all unplanned, time-sensitive, critical work. It automatically detects and diagnoses disruptive events mobilizes the right team members to respond and automate infrastructure and workflows across your digital operations. This means you can resolve unplanned, unstructured, time-sensitive, and high-impact issues quickly - with fewer escalations to your technical teams while minimizing the impact on your customers and maintaining brand trust.
    4.5

    Overview

    High customer expectations and increasingly distributed systems mean disruptions to digital service can have catastrophic effects on sales, brand loyalty, and operating costs. The PagerDuty Operations Cloud deflects unnecessary work from teams and subject matter experts so they can focus on delivering business value. Urgent work is escalated to the right teams and routine work is made self-service. Teams can automate and accelerate issue resolutions with minimal human interruption -and improve system resilience and team capacity while reducing the strain of operational complexity and the unexpected.

    With more than 700 integrations, APIs, and apps for customer service, the PagerDuty Operations Cloud empowers rapid responses in any environment. And thanks to more than 10 years of data ingestion, its machine learning-powered AIOps functionality can reduce alert noise by up to 98% and drive down MTTR with critical context for faster triage and effective automation.

    PagerDuty integrates with various AWS services, including AWS CloudWatch, Amazon GuardDuty, AWS CloudTrail, AWS Personal Health Dashboard, Amazon EventBridge, AWS Security Hub, Amazon DevOps Guru, AWS Control Tower, AWS Outposts, and AWS S3 Storage Lens.

    AIOps PagerDuty AIOps helps teams reduce noise, triage efficiently to drive the right actions towards resolution, and remove manual, repetitive work from the incident response process. Noise reduction baked in with an ML model that learns and adapts based on user behavior means teams see fewer incidents overall. And automating toil from manual event processing results in greater efficiency, saving teams valuable time for innovating.

    Process Automation PagerDuty Runbook Automation is a managed cloud service that enables DevOps teams and SREs to create and delegate operational tasks in automated runbooks to other stakeholders such as developers, NOC personnel, and incident responders. Runbook Automation provides automated workflows and task automation focused on IT and developer process automation. Examples include service provisioning, CI/CD, configuration management, incident diagnosis and remediation, and more. With PagerDuty Runbook Automation, you can resolve requests in minutes, rather than days, optimize security and compliance, and give your engineers more time to spend on innovation rather than firefighting.

    Incident Response PagerDuty helps you save time and money by bringing together the right teams with the right information to resolve incidents faster. Replace manual processes with automation to streamline incident response, freeing up time and resources for more innovation. Orchestrate end-to-end incident response with a service ownership model that only brings in the teams you need. Over 21K organizations trust PagerDuty to help them adopt DevOps best practices and build more resilient operational practices to minimize costly downtime and protect the customer experience.

    Custom Private Offer We can create a custom offer tailored to your needs. Please contact us at aws-sales@pagerduty.com 

    Highlights

    • Incident Response - Manage incidents end-to-end
    • Process Automation - Automate and delegate business and IT processes
    • AIOps - Maximize IT capacity with fewer incidents and faster resolution

    Details

    Delivery method

    Deployed on AWS
    New

    Introducing multi-product solutions

    You can now purchase comprehensive solutions tailored to use cases and industries.

    Multi-product solutions

    Features and programs

    Buyer guide

    Gain valuable insights from real users who purchased this product, powered by PeerSpot.
    Buyer guide

    Financing for AWS Marketplace purchases

    AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
    Financing for AWS Marketplace purchases

    Pricing

    Pricing is based on the duration and terms of your contract with the vendor, and additional usage. You pay upfront or in installments according to your contract terms with the vendor. This entitles you to a specified quantity of use for the contract duration. Usage-based pricing is in effect for overages or additional usage not covered in the contract. These charges are applied on top of the contract price. If you choose not to renew or replace your contract before the contract end date, access to your entitlements will expire.
    Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator  to estimate your infrastructure costs.

    12-month contract (11)

     Info
    Dimension
    Description
    Cost/12 months
    Professional
    On-call and incident response for growing teams
    $252.00
    Business
    Streamlined incident response for the enterprise
    $492.00
    CustomerServProfessional
    Bi-directional comms between CS & Dev, protect SLAs, & lower MTTR
    $252.00
    CustomerService Business
    Bi-directional comms between CS & Dev, protect SLAs, & lower MTTR
    $492.00
    Runbook Automation
    Automate manual procedures in runbooks
    $1,500.00
    Automation Actions
    Add-on: Automate steps to diagnose & remediate incidents
    $240.00
    Live Call Routing
    Add-on: For on-call schedules & escalations (by line)
    $1,890.00
    Runbook Auto Job Runner
    Add-on: For Runbook Automation
    $750.00
    Stakeholder Users
    Bundle of 50 Stakeholder users
    $1,800.00
    PagerDuty Status Pages
    1000 User Pack
    $1,068.00

    Additional usage costs (1)

     Info

    The following dimensions are not included in the contract terms, which will be charged based on your usage.

    Dimension
    Cost/unit
    Additional events over contracted value
    $0.06

    Vendor refund policy

    All fees are non-cancellable and non-refundable except as required by law.

    Custom pricing options

    Request a private offer to receive a custom quote.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Vendor terms and conditions

    Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    Software as a Service (SaaS)

    SaaS delivers cloud-based software applications directly to customers over the internet. You can access these applications through a subscription model. You will pay recurring monthly usage fees through your AWS bill, while AWS handles deployment and infrastructure management, ensuring scalability, reliability, and seamless integration with other AWS services.

    Support

    Vendor support

    Our team provides multiple resources for customers to find answers to questions and get help with our product. Users may browse our integration guides (pagerduty.com/integrations) to integrate with partner tools, our knowledge base (support.pagerduty.com) to learn more about using PagerDuty, and our developer docs (developer.pagerduty.com) to use our APIs. Additionally, anyone can interact with other PagerDuty users and PagerDuty employees via the PagerDuty Community (community.pagerduty.com). Our Support team is available during regular business hours around the globe, Monday through Friday, and can be contacted at: Email: support@pagerduty.com  or via a ticket submitted at tickets.pagerduty.com

    AWS infrastructure support

    AWS Support is a one-on-one, fast-response support channel that is staffed 24x7x365 with experienced and technical support engineers. The service helps customers of all sizes and technical abilities to successfully utilize the products and features provided by Amazon Web Services.

    Product comparison

     Info
    Updated weekly

    Accolades

     Info
    Top
    10
    In Agile Lifecycle Management, IT Business Management, ML Solutions
    Top
    25
    In Log Analysis
    Top
    50
    In ELT/ETL

    Customer reviews

     Info
    Sentiment is AI generated from actual customer reviews on AWS and G2
    Reviews
    Functionality
    Ease of use
    Customer service
    Cost effectiveness
    15 reviews
    Insufficient data
    Positive reviews
    Mixed reviews
    Negative reviews

    Overview

     Info
    AI generated from product descriptions
    Alert Noise Reduction
    Machine learning-powered functionality that reduces alert noise by up to 98% through adaptive models based on user behavior patterns
    Incident Response Orchestration
    End-to-end incident management with service ownership model that mobilizes appropriate team members and automates infrastructure workflows
    Runbook Automation
    Managed cloud service enabling creation and delegation of automated operational tasks including service provisioning, CI/CD, configuration management, and incident remediation
    Multi-Platform Integration
    Over 700 integrations, APIs, and applications supporting integration with AWS services including CloudWatch, GuardDuty, CloudTrail, EventBridge, Security Hub, and DevOps Guru
    Event Detection and Diagnosis
    Automatic detection and diagnosis of disruptive events with machine learning-powered AIOps functionality leveraging over 10 years of data ingestion
    Unified Observability Platform
    Comprehensive visibility across applications, infrastructure, logs, databases, networks, and digital experiences through a single-pane-of-glass interface
    AIOps and Machine Learning
    AIOps enhanced with machine learning capabilities to simplify management of distributed environments and automatically prioritize alerts to reduce alert fatigue
    Automated Instrumentation and Dependency Mapping
    Automated instrumentation with dependency mapping and service relationship views to identify multi-level relationships across services
    Open Source and Container Support
    Support for open-source frameworks, container technologies, and third-party integrations for cloud-native environments
    Rapid Deployment and Integration
    Quick installation with automated setup and easy integration with SolarWinds Hybrid Cloud Observability for reduced time to value
    Workflow Orchestration
    Application and data workflow orchestration with scheduling, management, and monitoring capabilities for production workflows
    AIOps and Observability
    Cloud-native observability and AIOps solution with topology rationalization and root cause analysis for hybrid-cloud environments
    IT Asset Discovery and Mapping
    IT asset discovery and dependency mapping solution with dynamic topology updates and visual representation of business services
    Predictive Resource Optimization
    Predictive analytics-driven optimization for IT resources and applications including Kubernetes, microservices, containers, and multi-cloud services
    Edge Data Collection and Analysis
    Operational technology data collection, aggregation, and analysis at the edge with IT and OT data integration for failure prediction

    Contract

     Info
    Standard contract
    No
    No
    No

    Customer reviews

    Ratings and reviews

     Info
    4.5
    931 ratings
    5 star
    4 star
    3 star
    2 star
    1 star
    71%
    25%
    3%
    0%
    0%
    9 AWS reviews
    |
    922 external reviews
    External reviews are from G2  and PeerSpot .
    Sachin Mohanty

    Centralized alerts have reduced incident response time and now streamline SME on-call collaboration

    Reviewed on Mar 04, 2026
    Review provided by PeerSpot

    What is our primary use case?

    In my organization, we use PagerDuty Operations Cloud  to acknowledge alerts. PagerDuty Operations Cloud  is organized so that it is often used to page the SMEs. Whenever we work on any tasks and face critical situations where we are unable to troubleshoot from our end, we page for the SMEs. Irrespective of the team, if it is infra-related issues, we page to infra. If it is related to some other product, we page to that product's SMEs and involve them into a PagerDuty Operations Cloud call. We inform them regarding the issues, and then they acknowledge the alerts. After acknowledging the alerts, they start working on that particular error.

    PagerDuty Operations Cloud is also organized so that if there is any critical issue, it will create an alert that will go in a particular notification form to the SME with a phone call, stating that there is a critical issue which is in progress. The particular SME will acknowledge the alert and come and join the call, mentioning that they have been paged for this issue. Then we will start working with that particular team to resolve the issue from our end.

    Regarding the incident command system, we use the Freshservice  tool. Freshservice  and PagerDuty Operations Cloud have been synced in my organization. The incident command system is a structured way for major incidents. Whenever there would be any outage, in order to proceed with the communication flow, we use the incident command system in PagerDuty Operations Cloud. Everyone will jump into a call, and then multiple people will start fixing the issues. Everyone will be working hard to bring that instance back online or to restore that particular environment.

    What is most valuable?

    The best feature that I like about PagerDuty Operations Cloud is whenever we page a particular team. There is a specific feature where we can directly page a person. Usually, once we trigger the alert, it goes to a particular person, and if that person does not acknowledge it, then it will go to their reporting person. Even if they also do not acknowledge it, then it will go to some other person. In that case, it tends to take a bit of time because whenever we see the alerts, the alerts will be shifting to other people. Some people might not acknowledge the alerts due to various reasons, and it may get missed. In PagerDuty Operations Cloud, there is a specific feature where we can page a specific person or a specific user. If we give the particular team name, then in the subfield, we can specifically page a person. This feature attracts me a lot.

    Additionally, there is another feature where we can check the SME calendar. In my organization, for a particular week, one person will be allotted as an SME. That calendar shows which person is the SME for the particular week regarding the particular product. These are the features I enjoy the most in PagerDuty Operations Cloud.

    The main benefits I can say from using PagerDuty Operations Cloud are that we can easily page them. It is also widely used in our operations team for faster incident response, leading to a reduction of the MTTR, mean time to resolution. The smart on-call management allows us to create a call for the on-call people and to involve the backup engineers as well. One special thing in PagerDuty Operations Cloud is it has time zone-based scheduling. As per that particular time zone, we can schedule them. I witnessed automated escalation, where the particular person missed acknowledging the alert in PagerDuty Operations Cloud, leading to an automated escalation to their associate director or VP. This escalation policy is also very good in PagerDuty Operations Cloud.

    The impact of integrating PagerDuty Operations Cloud with Freshservice is very good because earlier, when it was not integrated, there were many problems while paging the alerts. Now, when we have integrated it to Freshservice, once the alert comes into the queue of Freshservice, automatically a PagerDuty Operations Cloud alert will be created. So automatically, it syncs. Once it gets synced, the alert will be automatically created in PagerDuty Operations Cloud and will go to that particular person who is allotted as an SME for that particular product.

    The measurable benefits from PagerDuty Operations Cloud are that it has made our work easier, where the alerts will be synced and then directly create an alert to the SMEs. Instead of doing it manually, if it is automated in such a way that an alert gets triggered and routed directly to the SME, then that is a great benefit.

    What needs improvement?

    To improve PagerDuty Operations Cloud, I can mention that we can improve the escalation policies. Nowadays, many people miss the alerts. There was an issue in a particular product, and when we paged it, that particular paged alert went to other product people. I do not know how that happened in PagerDuty Operations Cloud; it might be some configuration changes or anything in the backend. The point is we can improve on this setting, where the actual PagerDuty Operations Cloud alert should be routed and assigned to the correct person of that particular product. If it gets triggered to some other person unnecessarily, even that day, the particular person came into the Slack channel asking why they got paged for a product they were not part of. This is something we can improve on.

    One feature I would like to see included in PagerDuty Operations Cloud is for a particular week, each person is assigned as an SME. It would be beneficial to add a note in the particular calendar where if this person is not available, then the backup engineer's name can be included.

    For how long have I used the solution?

    I have been working with PagerDuty Operations Cloud for four and a half years.

    What do I think about the stability of the solution?

    We have not used the real-time digital operations management feature. The advanced analytics feature is being used by another product in my cloud operations team. In my team, we have not used it.

    How are customer service and support?

    Regarding customer service and technical support teams of PagerDuty Operations Cloud, we never reached out to the technical support team. In my team, the technical support will handle the cloud-based platforms and everything. However, regarding PagerDuty Operations Cloud, in my organization, we do not have any technical team related to it.

    How would you rate customer service and support?

    Which solution did I use previously and why did I switch?

    Prior to PagerDuty Operations Cloud, I have not seen any product of the same kind in my company. We do use PagerDuty Operations Cloud and also New Relic . A similar application, I have not seen before.

    How was the initial setup?

    I have not found any complexity in the initial setup process of PagerDuty Operations Cloud. The deployment was already pre-deployed.

    Which other solutions did I evaluate?

    I have not come across any other options or solutions available in the market. I am not sure if the on-call policy in Splunk is similar to PagerDuty Operations Cloud.

    What other advice do I have?

    We have integrated PagerDuty Operations Cloud with the Freshservice tool. Regarding automation in PagerDuty Operations Cloud, in my team, the admin access has been given to the onshore employees, not to Indian employees. I am not sure about that because I have been requesting admin access for a long time, but I have not been granted it yet. Given my experience with PagerDuty Operations Cloud, I recommend increasing the on-call primary escalation time to ten minutes. Additionally, for one hundred alerts, if we can manage that to one particular incident for one hundred alerts, that would also be beneficial. This adjustment will help with the mean time to resolution in all organizations. My overall rating for this product is ten out of ten.

    Aashish Bhandari

    On-call automation has transformed alert handling and now creates a faster, competitive workflow

    Reviewed on Mar 03, 2026
    Review from a verified AWS customer

    What is our primary use case?

    My use case for PagerDuty Operations Cloud  is from the SRE and DevOps team. We use PagerDuty Operations Cloud  for specific alerting purposes and for the pipeline process. When we build a pipeline and it suddenly fails due to some job and issues, we receive an error. We set up PagerDuty Operations Cloud with our monitoring services, which we are currently using, Datadog . Datadog  is connected with PagerDuty Operations Cloud, and whenever Datadog receives an alert or a spike or anything critical, it will trigger an alert to PagerDuty Operations Cloud, and we quickly get a notification. We are currently using this process, and we are also maintaining our on-shift call rotation. For example, on Monday, Wednesday, and Friday, I am working as a shift lead, and then on Tuesday, Saturday, and Sunday, someone else is the shift lead. Regarding MTTR and all those statistics, we can see how many alerts we received, how many alerts we acknowledged this month, and we have a timeline as well. One of the valuable parts of PagerDuty Operations Cloud is that in our team, we can have a competitive environment. For example, if I resolved the most alerts triggered and resolved this month, then someone else can do it next month, and whoever resolves the most critical alerts on time receives appreciation every month.

    What is most valuable?

    One feature of PagerDuty Operations Cloud that I find valuable is the on-call schedule. We can manage our on-call scheduling, and we have various alert and notification delivery methods available, including mobile. We can receive phone calls, emails, SMS, and push notifications. For example, if someone missed the notification, they will get a phone call, which is very straightforward. We also have incident automation, making collaboration with any third-party monitoring services we use very straightforward, such as Datadog. We can seamlessly automate things with PagerDuty Operations Cloud. The AI features are also beneficial; for example, noisy alerts that trigger regularly and false positive alerts get suppressed. It checks the past month's alerts, showing us that this alert triggered 60 percent, this alert triggered 20 percent, this alert is rare, and this alert is not rare. The escalation policy is excellent as well, as if I did not pick up the call, my manager will get the call; if my manager did not pick up, then his manager gets the call. These are some of the most valuable parts we use in PagerDuty Operations Cloud.

    In Datadog, we have multiple dashboards and monitoring systems where we see our spikes and alerts. When we integrated with PagerDuty Operations Cloud, we got better signal and less noise. When we are seeing a spike that is concurrent, in PagerDuty Operations Cloud, the AI feature already signifies that alert as a noisy alert, and it suppresses that alert. This significantly improves our workflow with both Datadog and PagerDuty Operations Cloud. We have faster response and faster escalation. Previously, in Datadog, we did not get notifications, and people would refresh it and check the spike every hour. Now that we integrated PagerDuty Operations Cloud, any alert triggers, and we quickly get a notification or a phone call. Therefore, we do not sit in front of a computer and refresh repeatedly. Additionally, we have a centralized incident workflow; PagerDuty Operations Cloud and Datadog feed into PagerDuty Operations Cloud incident timeline, so we see everything there. We do not need to open Datadog again and again, and if we need to deep dive into an alert from Datadog, we can click the link inside PagerDuty Operations Cloud, redirecting us to the Datadog dashboard where everything is noted down and visible.

    In PagerDuty Operations Cloud, AI suppressing our alerts has helped streamline repetitive tasks. For example, very noisy alerts get suppressed automatically, aiding smarter routing. When we have new joiners in our team, they see alerts already suppressed, allowing them to focus on the critical ones instead of the lower ones. Additionally, alert prioritization is present; we receive critical alerts, high alerts, and then low alerts. The faster prioritization facilitated by AI enhances our alert management processes. Also, the root cause historical pattern assists us; if we get an alert similar to one from last month, it tells us how we resolved that alert previously. Historical patterns using AI greatly aid us in alert management.

    What needs improvement?

    I have already used PagerDuty Operations Cloud, and my previous monitoring tools were very poor for alerting. I had a good impression of PagerDuty Operations Cloud, but I believe it can improve with deeper root cause insights. I know there is automation to detect recent deployments causing incidents, but a deeper root cause analysis could provide more details. If PagerDuty Operations Cloud offers more information, we will not need to jump into the main dashboards where the alert triggered. For instance, if we get more insights directly in PagerDuty Operations Cloud, we would not need to check the Datadog dashboard. Additionally, I think a sandbox mode would be helpful for new team members, allowing us to guide them in simulating alerts, performing escalation policies, and creating PagerDuty Operations Cloud channels.

    For how long have I used the solution?

    I have been working with PagerDuty Operations Cloud for five years. I worked on two different projects, and in both projects, we use PagerDuty Operations Cloud.

    What do I think about the stability of the solution?

    In my previous project, we utilized the flexible incident command system to coordinate large-scale incidents, but in my current project with only Datadog, we have not received many alerts or incidents in the last couple of days.

    How are customer service and support?

    I do not have direct contact with PagerDuty Operations Cloud tech support or customer service teams, but my senior team members have connected with them when we received an alert related to our team failing to set it up properly. The customer support team promptly gave us insight and helped us within 24 hours.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    I am currently working with PagerDuty Operations Cloud. Previously, on my previous project, we were on BigPanda , but we faced multiple issues during BigPanda . At that time, there was no call schedule feature, and there was no alert triggered feature for BigPanda. We then moved it to PagerDuty Operations Cloud, and suddenly everything was smooth. We got a phone app as well; we set up PagerDuty Operations Cloud on the phone as well. Whenever any alert triggered for us, we used to quickly check from our phone to see if it was a false positive, a true P1, P2 alert, a major alert, or a critical alert. We then quickly jump into the alert and work on it. PagerDuty Operations Cloud changed the process and the flow in our team very smoothly.

    How was the initial setup?

    I found the initial setup of PagerDuty Operations Cloud straightforward; I did not face any complexities during the setup for alerts or during the initial configuration.

    What's my experience with pricing, setup cost, and licensing?

    Regarding pricing for PagerDuty Operations Cloud, I am currently a software engineer and a senior software engineer, so I do not handle the pricing aspect. However, I hear from my manager that the pricing is very high for PagerDuty Operations Cloud, and only a few of us have the main business tier accounts. Many of us have low tier accounts that restrict us to acknowledging and viewing alerts, while a few have the ability to create and trigger alerts. Therefore, I do not think much about pricing, but I do believe it is somewhat high. However, I think this is valid because PagerDuty Operations Cloud provides a vast amount of benefits compared to other alerting systems.

    Which other solutions did I evaluate?

    Regarding the key differences, pros and cons of PagerDuty Operations Cloud compared to competitors, some pros include alert grouping, AI functionality, and the ability to easily integrate with Slack for quicker resolution. Additionally, we receive phone notifications and push notifications, which many of the other competitors do not provide. The pricing of PagerDuty Operations Cloud is also reasonable for the functionalities it offers compared to its competitors. These are some benefits I see in PagerDuty Operations Cloud, including helpful alert insights and direct links to dashboards we have integrated, such as Datadog and Grafana , which allow us to resolve issues quickly.

    What other advice do I have?

    The recommendation I share, based on my experience with PagerDuty Operations Cloud, is that it is one of the best platforms for synchronizing with your monitoring tools. It will improve your flow, and your team will definitely benefit from PagerDuty Operations Cloud compared to other competitors, as it offers numerous advantages. I give this review a rating of ten out of ten.

    reviewer2804718

    Alert handling has improved and monitoring now supports faster false positive resolution

    Reviewed on Feb 23, 2026
    Review from a verified AWS customer

    What is our primary use case?

    PagerDuty Operations Cloud  is used to understand whether there are false positives or false alerts because it is integrated into a system where there would be a phone number, such as anyone from the team's phone number or perhaps the shift supervisor's phone number. Once those triggers are there, it would hit that number until somebody picks it up and acknowledges or escalates the alert or threat.

    In a recent incident through PagerDuty Operations Cloud , there was one issue with one appliance that was continuously going wrong. Even if we were acknowledging it and closing it and had denoted it as a false positive and a false trigger or false alarm, it was still continuously hitting us. There was some issue with the appliance or some issue with the server. Once we understood that, we escalated this to the company's SecOps team, and they had to go inside to find out more details. PagerDuty Operations Cloud team was coordinated with them. Once they were coordinated, they could dig in deep and find out what the issue was. A high-priority P1 ticket was raised for that as per ITIL principles.

    PagerDuty Operations Cloud was being used inside the office only, and if we enter anybody's number, for example, it would continuously be hitting at any time whenever the alerts are there. Whoever's numbers are added, such as a shift supervisor or shift people, those would keep on hitting back.

    What is most valuable?

    The best features that PagerDuty Operations Cloud offers are that it is user-friendly, the design is scalable, and it is quite easy to use for a newbie as well. Logging into the alerts as well as logging into any details are easy over PagerDuty Operations Cloud.

    It is user-friendly because it does not need any sort of coding or any sort of scripting language to be filled in. It can just operate with mouse clicks. We need to know how to operate it, when to close, when not to close, and how to read green, yellow, red, and blue monitors or markers as per the monitoring.

    It is easy to use, and I think the software also comes at a very low budget or low cost, which is very cost-effective and cost-friendly as well, rather than its alternatives. It benefits in every way, in network security, in cybersecurity, and overall company data monitoring and firewall as well.

    I do not have any specific numbers because those will not be shared with us. This is confidential information. The response time is great, and after using PagerDuty Operations Cloud, false positives and false alarms, as well as security monitoring, were very strongly done.

    What needs improvement?

    PagerDuty Operations Cloud can be improved by adding more features. Whatever manual work is there could be automated using scripting. Then it would be more efficient.

    When we receive a call, it has to be entered all the way. If one thing had been entered, then multiple things need not have to be entered multiple times. It does not need to be entered repeatedly. It could be automated in that way.

    For how long have I used the solution?

    In my current field, I have been working almost eight to nine years.

    What do I think about the stability of the solution?

    PagerDuty Operations Cloud is stable.

    What do I think about the scalability of the solution?

    I rate its scalability a ten out of ten.

    How are customer service and support?

    Customer support is good.

    How would you rate customer service and support?

    Which solution did I use previously and why did I switch?

    No other options were evaluated because this is the only thing we started our work with. I am not aware of what other companies use.

    What was our ROI?

    Money would be saved because it is a tool that, if automated, can do a lot of work for the employees that would otherwise be manual tasking. Thereby, it would save both money and time.

    What other advice do I have?

    I can give the advice that anyone can surely go ahead with this product because it would be a win-win situation for both PagerDuty Operations Cloud as well as the company that is using it. I have rated this product a nine out of ten.

    Which deployment model are you using for this solution?

    Public Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Accounting

    Reliable Scheduling and App, but Needs More Integration Flexibility

    Reviewed on Jan 12, 2026
    Review provided by G2
    What do you like best about the product?
    The on-call scheduling and the mobile app are the standout features. The scheduling engine is incredibly robust—handling complex rotations, follow-the-sun models, and last-minute overrides is seamless once the logic is set up. The mobile app is equally impressive; it’s intuitive, reliable, and allows for quick triage or acknowledgement of incidents without needing to open a laptop. It really nails the "responder experience" by putting critical actions front and center.
    What do you dislike about the product?
    I find the API and Slack integrations to be too rigid. While the Slack integration works well for basic "ack/resolve" actions, it lacks the flexibility to customize the actual alert content or the wording of messages sent to channels. Similarly, the API often feels restricted when trying to build more complex, automated workflows or custom internal tooling. I’d like to see more "hooks" for customization so the integrations can better fit our specific team processes rather than forcing us into a standardized format.
    What problems is the product solving and how is that benefiting you?
    PagerDuty solves the problem of "alert noise" and accountability. Before using it, critical alerts would often get buried in a Slack channel or a shared inbox, leading to slow response times. By centralizing our alerting and using escalation policies, it ensures the right person is notified via the right channel (push, SMS, or call) until the issue is acknowledged.
    Nambi Srinivasan

    Automated incident workflows have transformed on-call operations and improved response times

    Reviewed on Jan 07, 2026
    Review from a verified AWS customer

    What is our primary use case?

    I have been working in my current field for over seven years as a DevOps and site reliability engineer, and my primary experience involves managing the reliability of infrastructure platforms hosted in multi-cloud and on-premises environments. I have predominantly worked with systems hosted in AWS  services, setting up infrastructure, CI/CD, observability, and completely establishing the release process where I utilize PagerDuty Operations Cloud  for triage and other SRE operations.

    I have been using PagerDuty Operations Cloud  for over four years, and I have utilized it in multiple ways. One involves using PagerDuty Operations Cloud through enterprise services via a subscription model, and I have also used it in a project at Intel where I utilized PagerDuty Operations Cloud from AWS  for approximately one to one and a half years. After that period, I have been using it as a subscription currently at IBM.

    One of the main use cases for PagerDuty Operations Cloud involves handling the operation center, particularly concerning incident resolutions and triaging different incidents as part of the score platform engineering team within a central IBM cloud where various IBM cloud services are hosted. To ensure continuous reliability, automated incidents are created in PagerDuty Operations Cloud and incident management automation is heavily utilized as part of the project. Previously, I worked on integrating PagerDuty Operations Cloud with default AWS services to create incidents for different AWS services as part of the host infrastructure at Intel. Currently, I am creating different incident workflows within IBM internal cloud operations to ensure an effective incident management process, utilizing integrations with different LLMs as part of incident management, along with agentic SRE tasks that have arisen in the project.

    Since I am part of a larger platform engineering team and SRE operations team, there are many incidents and services that my team handles. I handle over 26 IBM cloud score services hosted in our internal platform, where there have been many incidents related to service downtime, reliability issues, and update issues. A dedicated SRE team handles end-to-end incident management, and we wanted to automate the incident management process, especially since we receive hundreds of incidents per day, up to thousands of incidents during critical release times of different services. Thus, the manual on-call process has been automated through utilizing PagerDuty Operations Cloud.

    What is most valuable?

    One of the features I find valuable in PagerDuty Operations Cloud, which is part of our current migration activities, involves automating the entire incident management process by integrating all service incidents into a single incident management page. In IBM cloud, since many services and incidents occur, I utilize the runbook automation feature where I create runbooks for each service and common issues that facilitate incident management for common incidents.

    The runbook automation positively impacts my team's workflow by significantly speeding up the incident resolution process. In IBM cloud, we have different services hosted such as IBM Schematics and IBM Kubernetes  Service, with thousands of concurrent global users. We face several issues during multiple incidents, particularly in reliability and infrastructure side issues. Basic level zero incidents often require simple commands run in kubectl. Therefore, I created runbooks to address these common issues, allowing SRE team members to refer to the runbook and manually fix the issues. However, before using PagerDuty Operations Cloud's runbook automation, it took over 20 minutes to resolve these issues. After implementing PagerDuty Operations Cloud's runbook automation, I have reduced the response time from over 20 minutes to less than two minutes, saving approximately 80 to 90 percent of the time and making mean time to resolve significantly faster. I review the runbooks quarterly to update them with any new steps necessary.

    PagerDuty Operations Cloud has greatly improved our productivity. Previously, I handled many incidents with manual automation runbooks, leading to substantial toil for the SRE teams in resolving even minor incidents and complicating our on-call schedule. Once I adopted PagerDuty Operations Cloud and heavily utilized the runbook automation, I provided a list of common incidents to PagerDuty Operations Cloud, which then did the heavy lifting in fixing basic incidents automatically. This allowed my team to focus more on development activities related to platform engineering. Overall, it has reduced our toil by at least 50 to 60 percent and improved our efficiency, enabling us to onboard more services. We increased from onboarding seven core IBM cloud services to over 28 services now hosted.

    The expansion of services impacts our organization's goals and customer experience by allowing all IBM cloud internal services to be hosted on a dedicated platform engineering service called Rednote. Before using PagerDuty Operations Cloud, I utilized Nagios and Sysdig, and after migrating to PagerDuty Operations Cloud last year, I prepared a set of runbooks, automating on-call schedules and incident management. This automation has led to a significant increase in incident closure rates from around 40 to 45 percent, improving efficiency and reducing manual toil for basic incidents by about 60 percent. This enables my team to focus more on development activities as basic incidents that can be managed through simpler runbooks are now handled automatically by PagerDuty Operations Cloud. Additionally, incorporating AIOps  into our on-call scheduling and notifications helps it learn from previous incidents and proactively address issues. This scalability has allowed me to grow from handling four cloud services to 28 services, increasing from 200 to 300 customers to over 1800 plus customers, thanks to PagerDuty Operations Cloud.

    What needs improvement?

    Since I host our internal services, I want more customization relating to our specific use case.

    The needed improvements include the configuration process, as new team members face a steep learning curve to understand the platform. With many new members, they need training to set up runbook workflows, event orchestration, and manage complex on-call schedules across 23 services, making it a challenge for new users. Additionally, I feel the web interface requires improvements.

    I would rate PagerDuty Operations Cloud as eight out of ten because the cons include a complex configuration process and high costs for each add-on that I try to obtain, making subscriptions costly, along with limited customization in certain incident workflows.

    The primary reasons for rating it an eight include the complex configuration which makes it challenging for new users, as well as their difficulty in setting up advanced runbook workflows and managing complex on-call setups. The web user interface also requires improvement. Although I receive alerts via the mobile app, which is beneficial for handling schedule maintenance, the same features should be added to the web interface. Customization issues persist, such as the inability to clone entire schedules as part of the workflows, and I want to keep incidents open for a specified duration, neither of which I can currently customize. Thus, I raised a ticket with PagerDuty Operations Cloud to address these concerns. Furthermore, the cost is high, making it one of the more expensive incident management solutions.

    For how long have I used the solution?

    I have been using PagerDuty Operations Cloud for over four years.

    What do I think about the stability of the solution?

    PagerDuty Operations Cloud is definitely stable, providing faster incident management and making managing our on-call roster easy along with effective escalation and notification channels.

    What do I think about the scalability of the solution?

    Regarding scalability, I do not find many issues. PagerDuty Operations Cloud effectively handles concurrent incidents, and incidents are fixed properly and on time.

    How are customer service and support?

    My interaction with PagerDuty Operations Cloud's customer support mainly focused on customizing our workflows. They understand our concerns and are willing to implement solutions that integrate into PagerDuty Operations Cloud effectively. From a reliability perspective, I have not faced any issues, and their support provides timely assistance for custom integration and workflows.

    How would you rate customer service and support?

    Positive

    Which solution did I use previously and why did I switch?

    Previously, I used RunDeck automation, which is why I switched to PagerDuty Operations Cloud after PagerDuty Operations Cloud acquired RunDeck.

    How was the initial setup?

    Before using PagerDuty Operations Cloud, I utilized Nagios and Sysdig, and after migrating to PagerDuty Operations Cloud last year, I prepared a set of runbooks, automating on-call schedules and incident management. This automation led to a significant increase in incident closure rates from around 40 to 45 percent, improving efficiency and reducing manual toil for basic incidents by about 60 percent.

    What's my experience with pricing, setup cost, and licensing?

    I purchased PagerDuty Operations Cloud through AWS Marketplace  while at Intel, and my experience has been positive regarding pricing, setup costs, and licensing. I had around seven users part of it for a base pricing of around $450 per user, primarily for custom workflows and the ITSM  part. Currently, I am utilizing the runbook automation part, which costs around $2000 per year, and in the last three months, I have also used the AIOps  feature for approximately $700 to $800 per month, resulting in a cumulative cost of around $3000 per month.

    Which other solutions did I evaluate?

    Before choosing PagerDuty Operations Cloud, I evaluated other options, particularly competitors such as ZenDuty due to cost effectiveness, but I favored PagerDuty Operations Cloud for its RunDeck features and automation capabilities for incident workflows, despite the required migration.

    What other advice do I have?

    Utilizing PagerDuty Operations Cloud allows me to save a significant amount of time, not only on routine incidents but also in focusing on onboarding additional services. This significantly aids me in spending less time on routine operational activities, quantified by the reduced personnel needed to manage routine tasks.

    I highly recommend using PagerDuty Operations Cloud if you have numerous operational incidents to handle daily, especially if you prioritize reliability, particularly in critical projects such as IBM's core cloud services where outages must be avoided to ensure compliance and reliability.

    I urge potential users to adopt PagerDuty Operations Cloud if reliability is a priority and they have a sufficient budget, as it is suited for larger infrastructures and effectively manages redundant incidents, standing out as the number one option in its market segment. I have rated PagerDuty Operations Cloud as eight out of ten overall.

    Which deployment model are you using for this solution?

    Hybrid Cloud

    If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

    Amazon Web Services (AWS)
    View all reviews