AWS Cloud Operations & Migrations Blog

Category: Amazon CloudWatch

Title of blog on box image

What’s new in AWS Observability at re:Invent 2022

Kick off your AWS re:Invent 2022 week with a round-up of the AWS Observability launches across Amazon CloudWatch, AWS X-Ray, Amazon Managed Grafana, and Amazon Managed Service for Prometheus. From understanding impact of internet issues on your application performance and availability with CloudWatch, to VPC support and Prometheus alerting in Managed Grafana, read on to […]

Enhancing DevOps Practices with Amazon CloudWatch Application Performance Monitoring

Organizations seeking to deliver meaningful technology services at a higher velocity to their customers have incorporated application performance monitoring (APM) into their DevOps operating models. Software development and IT operations teams that have traditionally worked in their own silos now strive to work in concert to increase organizational agility. The transformation path is unique for […]

Avoid patching failures due to low disk space with AWS Systems Manager Automation and CloudWatch alarms.

Every organization has to comply with keeping their fleet updated on patching and ensure that business and workloads are not affected due to patching. One of the challenges for the operations teams is to patch at scale without affecting production software. The most common reasons workloads patching fails are insufficient disk space, a spike in […]

AWS Cloud Operations Kiosks at AWS re:Invent 2022

The Expo on Day 3 of AWS on Wednesday, December 1, 2021 at the Venetian Resort in Las Vegas, Nevada. For most organizations, the question isn’t “if we move to the cloud” anymore; it’s “what do we move first?” and “how soon can we be operating in the cloud?” Wherever you are in your digital […]

How to develop an Observability strategy – Part 2

Your observability strategy starts with your business. “Observability” describes how well you can understand what’s happening in a system. Developing an observability strategy isn’t a one-time effort. It’s a continuous improvement effort that occurs throughout the lifecycle of your workloads. It enables your teams to determine whether or not the workloads they design and run […]

Cost Optimization recommendations for AWS Config

In this post, we’ll walk you through the various best practices and recommendations for optimizing AWS Config costs. This also provides technical guidance for looking at the rules and the recorder, how to start deleting or removing rules that aren’t needed, and then editing the Settings of Config, specifically the “Resource types to record”, to […]

Monitoring the availability and health of on-premises application using AWS CloudWatch Synthetics

Amazon CloudWatch is a monitoring and observability service that provides you with data and actionable insights to monitor your applications, respond to system-wide performance changes, and optimize resource utilization. You can utilize various CloudWatch capabilities to monitor the health of your application that is available over the internet, or resides within an Amazon Virtual Private Cloud (Amazon VPC) […]

Deciding between large accounts or micro accounts for distributed operations at AWS

When you’re starting your journey at AWS, you must define your AWS account strategy. There are many possible variations for how to organize the AWS accounts – by workload, team, specialization, business domain, functional domain, and many others. A common question from customers is: should I deploy multiple workloads into a single AWS account, or […]

Monitoring Data Ingestion Tasks with Amazon CloudWatch Metrics and Alarms

Data is produced every day in increasing volumes and varieties in on-premises and cloud environments. Data ingestion into AWS is a common task and there are many services and architecture patterns that customers use to bring in data. In this post, we provide a guide for establishing monitoring and alerting on a data ingestion workload […]

Introducing vended logs for Amazon Managed Service for Prometheus

Customers are using Amazon Managed Service for Prometheus to monitor and alert on their container metrics. Amazon Managed Service for Prometheus ships with Alert Manager, the open source alert routing component in Prometheus. Alert manager routes alerts to Amazon Simple Notification Service (Amazon SNS). However, there are some common reasons why alert manager may fail […]