AWS Cloud Operations & Migrations Blog

Tag: Monitoring

Announcing Amazon Managed Grafana workspace version selection with version 9.4 support

Many customers that use Amazon Managed Grafana have requested for the ability to choose a Grafana version with the latest product features including navigation, dashboards, and visualizations. Today, we are announcing Amazon Managed Grafana workspace version selection with version 9.4 support. Since the product was launched, Amazon Managed Grafana maintained a single version offering globally. […]

How Audible used Amazon CloudWatch cross-account observability to resolve severity tickets faster

This blog was co-written with Audible’s Apurva Jatakia, Kaushik S., and David Etler. Audible’s consumption services platform serves thousands of requests every second, and each incoming request is served by a distributed set of microservices owned by different teams. An Audible team, in charge of a platform called Stagg, is responsible for five separate microservices. […]

Announcing inbound network access control in Amazon Managed Grafana

Many customers that use Amazon Managed Grafana have a need to restrict the Grafana workspace public access and enable fine-grained control to allow which traffic sources can reach the Grafana workspace. Today, we are announcing Amazon Managed Grafana’s new feature that supports inbound network access control. This enables you to secure Grafana workspaces using VPC […]

Title of blog on box image

What’s new in AWS Observability at re:Invent 2022

Kick off your AWS re:Invent 2022 week with a round-up of the AWS Observability launches across Amazon CloudWatch, AWS X-Ray, Amazon Managed Grafana, and Amazon Managed Service for Prometheus. From understanding impact of internet issues on your application performance and availability with CloudWatch, to VPC support and Prometheus alerting in Managed Grafana, read on to […]

Viewing custom metrics from statsd with Amazon Managed Service for Prometheus and Amazon Managed Grafana

Monitoring applications based on custom metrics is important for a resilient system. One of the mechanisms to generate custom metrics from applications is statsd – a NodeJs process to collect custom application performance metrics periodically. However, statsd doesn’t provide long-term storage, rich querying, visualization, or an alerting solution. Amazon Managed Service for Prometheus and Amazon […]

Viewing collectd statistics with Amazon Managed Service for Prometheus and Amazon Managed Service for Grafana

Monitoring systems are essential for a resilient solution. A popular tool to monitor Linux-based physical or virtual machines is collectd – a daemon to collect system and application performance metrics periodically. However, collectd doesn’t provide long-term storage for metrics, rich querying, visualization, or an alerting solution. The Amazon Managed Service for Prometheus is a serverless […]

Announcing AWS CloudTrail Lake – a managed audit and security Lake

Organizations managing cloud infrastructure in AWS need effective mechanisms to audit operations in their AWS accounts for security and compliance. In November 2013, we announced AWS CloudTrail as the auditing platform for AWS. Since then, millions of customers have adopted this service. We believe CloudTrail is so important to AWS customers’ success that every new […]

Delete Amazon CloudWatch Synthetics dependent resources when you delete a CloudFormation stack

Delete Amazon CloudWatch Synthetics dependent resources when you delete a CloudFormation stack

Amazon CloudWatch Synthetics allows you to monitor application endpoints more easily. It runs tests on your endpoints every minute, and alerts you if your application endpoints don’t behave as expected. These tests can be customized to check for availability, latency, transactions, broken or dead links, page load errors, load latencies for UI assets, complex wizard […]

Linux Fleet

How to create an Amazon EC2 Auto Scaling policy based on a memory utilization metric (Linux)

This is the first in a two-part series about how to create an Amazon EC2 Auto Scaling policy based on memory utilization metric. This post covers Linux OS. In part 2 I’ll cover how to create an Amazon EC2 Auto Scaling policy based on a memory utilization metric in Windows OS. Amazon EC2 Auto Scaling […]

Windows Fleet

How to create an Amazon EC2 Auto Scaling policy based on a memory utilization metric (Windows)

In the first of this two-part series, I showed you how to create an Amazon EC2 Auto Scaling policy based on a memory utilization metric for Linux OS. In this second post, I walk through how to create Amazon EC2 Auto Scaling policy based on memory utilization metric for Windows OS. I will use a […]