Q: What is Amazon DevOps Guru?
A: Amazon DevOps Guru is a fully managed operations service that uses machine learning to make it easier for developers and operators to improve application availability. DevOps Guru does this by automatically detecting operational issues and recommending options for remediation or mitigation. DevOps Guru applies machine learning informed by over 20 years of Amazon and AWS operational excellence in running large, highly available applications to automatically collect and analyze data like application metrics, logs, events, and traces for identifying behaviors that deviate from normal operating patterns. Those patterns include things like under provisioned compute capacity, database I/O over-utilization, memory leaks, and more.
Q: Why should I use Amazon DevOps Guru?
A: Amazon DevOps Guru saves you hours if not days of time and effort spent detecting, debugging, and resolving operational issues and enables you to effectively monitor complex and evolving applications. It helps avoid common oversights and errors in monitoring, such as missing alarms, which cause application downtime. When operational issues occur, DevOps Guru saves debugging time by fetching relevant and specific information from a large number of data sources. DevOps Guru generates Operational Insights to alert you of the issue, with a summary of related anomalies, contextual information about why and when the issue occurred, along with recommendations on how to remediate issues and reduce application downtime.
Q: How does Amazon DevOps Guru work?
A: Amazon DevOps Guru’s machine learning models leverage over 20 years of operational expertise in building, scaling, and maintaining highly available applications for Amazon.com. DevOps Guru automatically ingests and analyzes metrics like latency, error rates, and request rates for all resources to establish normal operating bounds. DevOps Guru then uses a pre-trained machine learning model to identify deviations from the established baseline. When it identifies anomalous application behavior like increased latency, error rates, or resource constraints that could cause potential outages or service disruptions, it alerts operators with issue details like the resources involved, the issue timeline, and other related events to help them quickly understand the potential impact and likely causes of the issue. It also provides options for remediation or mitigation. Developers can then use those suggestions from DevOps Guru to reduce time to resolution when issues arise and improve application availability and reliability with no manual configuration setup – and with no machine learning expertise required. DevOps Guru can be used as a standalone service, and also integrates with partner applications from PagerDuty and Atlassian along with AWS System Manager Ops Center.
Q: How can I get started with Amazon DevOps Guru?
A: With a few clicks, you can enable Amazon DevOps Guru in the AWS Management Console. DevOps Guru provides you with an onboarding wizard that helps you quickly configure the analysis coverage for your AWS resources. Once enabled, DevOps Guru continuously analyzes the operational data for your AWS resources based on your selection and produces insights whenever it detects ongoing or emergent operational issues.
Q: How do I specify the AWS resources for DevOps Guru to analyze?
A: You can choose your analysis coverage boundary to be your entire AWS account or, you can prescribe the specific AWS CloudFormation stacks that you want DevOps Guru to analyze. Based on your selection, DevOps Guru will analyze the operational data for all supported AWS resources in the coverage boundary you choose.
When you add new resources to your coverage boundary selection, DevOps Guru will automatically start analyzing the additional resources. Similarly, DevOps Guru will stop analyzing and billing any resources when you remove them from your account or CloudFormation stack.
Q: What types of issues can Amazon DevOps Guru detect?
A: Amazon DevOps Guru can automatically detect operational issues like missing or misconfigured alarms, early warning of resource exhaustion, code and configuration changes that could lead to outages. DevOps Guru uses ML to correlate anomalies in metrics with operational events and provides you with contextual insights to help you focus on the right remediation steps. DevOps Guru also correlates and groups related application and infrastructure metrics like web application latency spikes, running out of disk space, bad code deployments, or memory leaks to reduce false and redundant alarms so you can focus on high-severity issues.
Q: What monitoring services does Amazon DevOps Guru work with?
A: At launch, Amazon DevOps Guru can use data from Amazon CloudWatch, AWS Config, AWS System Manager Ops Center, AWS CloudFormation, and AWS X-Ray. Amazon DevOps Guru is also integrated with partner operations monitoring and incident management solutions like Atlassian OpsGenie and Pager Duty.
Q: How does Amazon DevOps Guru work with other related operational services, such as AWS Systems Manager OpsCenter?
A: If you use AWS Systems Manager Ops Center, Amazon DevOps Guru Operational Insights can be surfaced directly within the OpsCenter dashboard as OpsItems.
Q: How does Amazon DevOps Guru protect my content?
A: Amazon DevOps Guru uses encryption in transit and at rest to protect your content during ingestion and data analysis.
Q: Where did AWS obtain the training data to build this service?
A: Our training data was generated by internal AWS services and infrastructure.
Q: What is an Amazon DevOps Guru Operational Insight?
A: Amazon DevOps Guru Operational Insights aggregate the information needed to investigate and remediate an operational issue directly in the DevOps Guru console. An insight is composed of three main sections. It highlights the anomalous metrics related to the operational issue, with graphs to easily visualize abnormal system and application behavior. The insight also includes contextual information such as relevant events, log snippets, etc. so you can easily understand the scope and issue timeline. Operational Insights also include recommendations on actions you can take to remediate the issue.
Q: Can I integrate Amazon DevOps Guru into my existing ticket/incident management system?
A: You can configure Amazon DevOps Guru to create an OpsItem in AWS Systems Manager OpsCenter for each insight that it generates. You also can configure DevOps Guru to deliver its insights via AWS Simple Notification Service (SNS), which you can consume into incident management tools such PagerDuty and Atlassian.
Q: After enabling the service, when will I see Operational Insights?
A: Once enabled, Amazon DevOps starts baselining your application, which may range from minutes to an hour depending on the number of resources being analyzed. After baselining, DevOps Guru will analyze your resources continuously and produces Insights when it detects anomalous behavior.
Pricing and billing
Q: How much does Amazon DevOps Guru cost?
A: With Amazon DevOps Guru, you only pay for what you use. There is no up-front commitment or minimum fee. After you enable DevOps Guru and specify the applications you want to monitor, DevOps Guru starts analyzing the operational data for the resources that these applications use. There are two components that determine your bill: charges for AWS resource analysis, and charges for DevOps Guru API calls. For more details, please refer to our pricing page.
Q: How are AWS resource analysis hours priced for different resource types?
A: The AWS resource types (e.g. Amazon S3 Bucket, Amazon EC2 Instance) analyzed by DevOps Guru are categorized into two pricing groups. The rate you’re charged for a specific AWS resource type depends on the price group: A or B.
Q: Am I charged for resources that I delete or don’t use?
A: No, You pay for the number of AWS resource hours analyzed, for each active resource. A resource is only active if it produces metrics, events, or log entries within an hour.
Q: Which AWS resources are supported for DevOps Guru analysis?
A: DevOps Guru analyzes more than 25 different AWS resource types (e.g. Amazon S3 Bucket, Amazon EC2 Instance), with support for additional resource types coming soon.
Q: Other than Amazon DevOps Guru costs, are there any other costs that might incur to my Amazon DevOps Guru usage?
A: If you configure Amazon Simple Notification Service (SNS) to receive notifications about DevOps Guru events, you will incur additional charges per standard Amazon SNS pricing. Similarly, if you configure to receive an OpsItem for DevOps Guru insights, you incur additional charges per standard AWS Systems Manager pricing.
Q: Is Amazon DevOps Guru available in AWS Free Tier?
A: Yes, AWS Free Tier includes DevOps Guru analysis of 7,200 AWS resource hours and usage of 10,000 DevOps Guru API calls per month for 3 months.
Q: In which AWS regions is Amazon DevOps Guru available?
A: Amazon DevOps Guru is available in the following AWS regions: US East (N. Virginia), US East (Ohio), US West (Oregon), Europe (Ireland), and Asia Pacific (Tokyo), with additional regions coming soon. You can also refer to the AWS Regional Services List.