AWS Cloud Operations Blog
Category: Management & Governance
Investigating Service Issues with Amazon CloudWatch Application Signals Custom Metrics
When a critical service fails, you need to know how much revenue you’re losing, not just that latency has increased. This post shows you how to integrate business metrics with CloudWatch Application Signals to see both technical performance and business impact in one unified view. With CloudWatch Application Signals, you can view metrics, traces, and […]
Cross-Region AWS PrivateLink monitoring with Amazon CloudWatch Network Synthetic Monitor
Introduction Global, distributed AWS architectures are the backbone for customers seeking high availability, resilience, and regulatory compliance. Workloads are commonly deployed across multiple AWS Regions and Availability Zones (AZs), often using AWS PrivateLink to connect services securely and privately across Amazon Virtual Private Cloud (Amazon VPC) networks. This approach enhances security and separation while requiring […]
Search and discover governance controls with Control Catalog in AWS Control Tower
As you scale your AWS environment from hundreds to thousands of AWS accounts, maintaining consistent governance standards across this expanded infrastructure requires a strategic approach. Governance controls—the automated policies and rules that enforce standards across your cloud infrastructure—are essential for managing this scale, but implementing them presents two fundamental challenges. First, without proper controls, a […]
Troubleshoot AWS Tagging Compliance with AWS Resource Explorer
With AWS Resource Explorer’s immediate resource discovery launch on October 13, 2025, customers can now discover resources from their very first search in Unified Search in the AWS Management Console or the Resource Explorer console. Operations like troubleshooting and problem resolution, making resource changes, investigating resource dependencies, identifying security risks, and optimizing costs are critical […]
Amazon CloudWatch RUM now supports mobile application monitoring
Amazon CloudWatch RUM now supports iOS and Android applications, expanding real user monitoring beyond web applications. Developers and SREs can now quickly isolate mobile application issues and improve end-user experience, with visibility into performance metrics such as screen load times, crash rates, and API latencies.
Announcing AWS CloudTrail Event Aggregation and Insights for Data Events
AWS CloudTrail records API calls and events for your AWS account, providing audit trails for governance, compliance, and operational troubleshooting. Customers can also enable data events in CloudTrail to gain deeper visibility into resource-level operations. These include Amazon S3 object-level operations (such as GetObject/PutObject) or AWS Lambda function invocations. Data events help detect unauthorized access, […]
Enforce consistent tagging across IaC deployments with AWS Organizations Tag Policies
Organizations manage thousands of AWS resources across multiple accounts and Regions to support their business operations. They want consistent tagging to support essential workflows such as attribute-based-access-controls (ABAC), cost allocation, organizing resources by project/application/owner/environment, and triggering automated processes based on tag criteria. Many customers use Infrastructure as Code (IaC) tools like AWS CloudFormation, Terraform, and […]
How Indeed scaled Governance across 1,000+ AWS accounts with AWS Trusted Advisor
Indeed is the #1 job site¹ in the world. With 615 million Job Seeker Profiles², people in more than 60 countries across 28 languages come to Indeed to search for jobs, post resumes, and research companies. Over 3.3 million employers use Indeed to find and hire new employees. Supporting this massive scale requires resilient, well-architected […]
AWS Resource Explorer launches immediate resource discovery within a Region
AWS now provides immediate access to resource search capabilities through AWS Resource Explorer so that customers can discover resources across services in their AWS account. Operations like troubleshooting and problem resolution, making resource changes, investigating resource dependencies, identifying security risks and optimizing costs are critical everyday activities for the cloud operations team. With resource search, […]
Guide to AWS Cloud Resilience sessions at re:Invent 2025
If you’re attending AWS re:Invent with the goal of learning how to prevent costly downtime for your organization, you can look forward to more than 150 breakout sessions, workshops, chalk talks, builders’ sessions, and code talks that will help you improve the resilience of your critical applications. New this year, we’ll also be hosting two […]








