AWS Cloud Operations & Migrations Blog

Tag: DevOps

AWS Systems Manager Incident Manager integration with Amazon CloudWatch Part 2

AWS Systems Manager Incident Manager integration with Amazon CloudWatch

This is the second post in a two-part series about AWS Systems Manager Incident Manager. In the first post, we covered onboarding steps like creating contacts, an escalation plan, and a response plan in Incident Manager. In this post, we discuss the integration between Incident Manager and Amazon CloudWatch and how Incident Manager components manage an […]

Using AWS CodePipeline to deploy AWS Config conformance packs created with the Rule Development Kit

As consultants, we often help customers manage AWS services using infrastructure as code (IaC). We follow DevOps practices for building, versioning, testing, and deploying services. We also use AWS Config custom and managed rules to evaluate the configuration settings of AWS resources. AWS Config continuously tracks the configuration changes that occur among AWS resources and […]

automated operations cloud operating model

Reinventing automated operations (Part I)

This is the first in a two-part series that covers lessons learned at AWS Managed Services (AMS) as we help customers and partners achieve operational excellence on AWS. To create a secure and consistent cloud operating model, you need both operational experience and AWS skills. In my conversations with customers, it is common for experienced […]

Amazon DevOps Guru sends insights to Amazon SNS and Amazon SNS forwards the insights to PagerDuty

Deliver ML-powered operational insights to your on-call teams via PagerDuty with Amazon DevOps Guru

Amazon DevOps Guru, now in preview, is an ML-powered cloud operations service that assists you in improving application availability. It’s easy to set up and use, and leverages machine learning models informed by years of operational expertise in building, scaling, and maintaining highly available applications at Amazon.com. DevOps Guru continuously analyzes streams of disparate data […]

secure devops pipeline AWS Service Catalog

Creating a secure DevOps pipeline for AWS Service Catalog

Many AWS customers I speak with want to manage their AWS services using infrastructure as code (IaC) and DevOps practices for managing, versioning, and deploying products and portfolios. A best practice is deploying infrastructure templates from a continuous delivery (CD) pipeline with validation. In so doing, you can provide the AWS services your users need […]

How to set up automatic failover for AWS OpsWorks for Chef Automate

Creating a resilient configuration management system comes with a variety of challenges. The goal of this post is to solve an important component of this: failing over to a standby AWS OpsWorks for Chef Automate server when a primary server is unavailable. With the procedure described in this post, if the main OpsWorks for Chef […]

Detect and remediate issues faster with AWS Systems Manager OpsCenter and Moogsoft AIOps

AWS Systems Manager, the operational hub for AWS and hybrid cloud deployments, recently announced the launch of OpsCenter to help you view, investigate, and resolve operational issues related to your environment from a central location. OpsCenter presents operational issues in a standardized view, along with contextually relevant data, and associated Systems Manager Automation documents, enabling […]

How Twitch monitors its services with Amazon CloudWatch

by Alex Cioc and Steve McCurry Twitch is the leading service and community for multiplayer entertainment and is owned by Amazon. Twitch also provides social and features and micro-transaction features that drive content engagement for its audiences. These services operate at a high transaction volume. Twitch uses Amazon CloudWatch to monitor its business-critical services. It […]