Management & Governance | AWS Cloud Operations Blog

Use AWS Systems Manager Automation to create input parameters that populate AWS resources as a dropdown list

As a Solution Architect at AWS, my customers regularly ask how to automate everyday operations within their cloud environment. Their use cases include a variety of operational needs, such as provisioning new resources within an AWS account, and patching/updating managed Amazon Elastic Compute Cloud (Amazon EC2) instances. They are also focused on cost management with […]

Customize AWS Config resource tracking in AWS Control Tower environment

[Update on Nov/24/2025] Based on customer feedback account inclusion option has been added to the solution, several bug fixes and enhancements made and documented in the changelog, This blog has been slightly updated to align with those changes. [Update on Sep/21/2024] AWS Config recorder has recently provided support for periodic recording, this captures the latest […]

How to use Resilience Hub’s Fault Injection Experiments to test application’s resilience

In this post, you’ll learn how to utilize AWS Fault Injection Simulator (AWS FIS) and AWS Resilience Hub to refactor a simple serverless application. Resilience Hub lets you define, validate, and track the resiliency of your AWS application. Resilience Hub integrates with AWS FIS, a chaos engineering service, to provide fault-injection simulations of real-world failures. These […]

Viewing Amazon CloudWatch metrics with Amazon Managed Service for Prometheus and Amazon Managed Grafana

Monitoring AWS services comprising of a customer workload with Amazon CloudWatch is important for resiliency of a workload. Customers can bring their CloudWatch data alongside their existing Prometheus data sources to improve their ability to join or query across for a holistic view of their systems. The Amazon Managed Service for Prometheus is a serverless […]

Validating and Improving the RTO and RPO Using AWS Resilience Hub

“Everything fails, all the time”, a famous quote from Werner Vogels, VP and CTO of Amazon.com. When you design and build an application, a typical goal is to have it working, the next is to keep it running, no matter what disruptions may occur. It is crucial to achieve resiliency, but you need to consider […]

Procuring software on AWS Marketplace for customers in regulated spaces

Customers operating in highly-regulated spaces often tell us about the compliance challenges that they face when procuring commercial software in the cloud. This is especially true for federal customers subject to the GSA Schedule , or state and local customers operating under NASPO Value Point. Procurements in this space often require negotiated purchasing agreements and […]

Use AWS RAM and AWS MGN to Govern your Migration at scale in AWS

Introduction AWS customers consider Lift & Shift as the first increment of value delivery in their cloud adoption journey. Following this strategy customers will have benefits of speed, cost reduction, business agility, operational resiliency, and staff productivity. As part of the migration plan they will adopt a multi-account strategy to establish their AWS foundation at […]

AWS named for the first time ever as a Challenger in 2022 Gartner Magic Quadrant for Application Performance Monitoring and Observability

This year, AWS was recognized for the first time as a Challenger in the 2022 Gartner Application Performance Monitoring and Observability (APM) Magic Quadrant. This is the first time AWS is recognized in the report’s 12-year history. The report is published annually and assesses vendors based on their Ability to Execute and Completeness of Vision. […]

Accelerate your Monitoring and Observability foundation through AWS Managed Services

To establish a strong foundation for efficiently and safely operating your workloads in the cloud, you must consider how you will monitor the health of your workloads. As described in the AWS Well-Architected Operational Excellence pillar, one of the cloud’s design principles for operational excellence is “Anticipate Failure.” Therefore, design your cloud operations with proactive […]

How to isolate signed-in users from guest users within Amazon CloudWatch RUM

Real user monitoring (RUM) helps web application owners monitor the performance of client-side applications running on end-user devices. For example, RUM can help application owners detect when end-users are experiencing slow page load speeds, application errors, network errors, or issues with the application’s user interface. Amazon CloudWatch RUM is a managed RUM service which is […]

AWS Cloud Operations Blog

Category: Management & Governance