AWS Cloud Operations Blog
Category: Technical How-to
Simplify AWS Control Tower governance with enhanced AWS CloudFormation Hooks
Introduction Organizations using AWS Control Tower to govern their multi-account environments face a persistent challenge: when AWS CloudFormation deployments fail due to proactive control violations, teams receive minimal information about why the failure occurred or how to fix it. This lack of visibility leads to: Delayed deployments as developers struggle to understand cryptic error messages […]
Deploying custom Terraform to LZA-Managed Accounts with AFT
As organizations scale their AWS environments, managing infrastructure consistently while enabling team autonomy becomes increasingly challenging. Landing Zone Accelerator on AWS (LZA) and AWS Account Factory for Terraform (AFT) both extend AWS Control Tower to help customers manage AWS environments at scale, offering complementary strengths. Many AWS customers struggle to balance centralized security governance with […]
Innovation sandbox on AWS with real-time analytics dashboard
How do you deploy hundreds of AWS accounts for a large-scale hackathon? Provide real-time visibility to leadership? Enable participant self-service while monitoring spending across accounts? Enterprise innovation events often lack real-time visibility into participant engagement, resource utilization, and outcomes. Leaders can’t see engagement metrics; builders can’t access accounts and information on-demand. Without observability and governance, […]
Search and discover governance controls with Control Catalog in AWS Control Tower
As you scale your AWS environment from hundreds to thousands of AWS accounts, maintaining consistent governance standards across this expanded infrastructure requires a strategic approach. Governance controls—the automated policies and rules that enforce standards across your cloud infrastructure—are essential for managing this scale, but implementing them presents two fundamental challenges. First, without proper controls, a […]
Amazon CloudWatch RUM now supports mobile application monitoring
Amazon CloudWatch RUM now supports iOS and Android applications, expanding real user monitoring beyond web applications. Developers and SREs can now quickly isolate mobile application issues and improve end-user experience, with visibility into performance metrics such as screen load times, crash rates, and API latencies.
Announcing AWS CloudTrail Event Aggregation and Insights for Data Events
AWS CloudTrail records API calls and events for your AWS account, providing audit trails for governance, compliance, and operational troubleshooting. Customers can also enable data events in CloudTrail to gain deeper visibility into resource-level operations. These include Amazon S3 object-level operations (such as GetObject/PutObject) or AWS Lambda function invocations. Data events help detect unauthorized access, […]
Improve the resiliency with AWS Fault Injection service and Amazon ARC Region switch
System impairments occur frequently in distributed cloud environments, making application resilience critical for customers. While traditional disaster recovery testing approaches are often manual and time-consuming, modern chaos engineering practices help validate your application’s ability to handle failures automatically. Amazon Application Recovery Controller (ARC) simplifies application recovery across AWS Regions and availability zones by providing readiness […]
Handling sensitive log data using Amazon CloudWatch
Introduction Efficient logging is crucial to building effective investigative and response workflows. Logs, metrics and traces offer critical value when investigating application issues, security events and debugging failures. Structured wide-event logs can provide a means to investigate application behaviour without requiring access to data stores. This level of verbosity in application logs increases the likelihood […]
Assess compliance and configuration of Kubernetes resources with AWS Config
Many customers today rely on AWS Config for recording configuration, tracking configuration history, and evaluating compliance of their AWS resources such as Amazon Elastic Compute Cloud (EC2) instances, Amazon Simple Storage Service (S3) buckets, and even Amazon Elastic Kubernetes Service (EKS) clusters. This provides them with a comprehensive view of their AWS infrastructure configuration state […]
Reimagine AIOps with Amazon CloudWatch Investigations and Amazon Nova Sonic
Reimagine AIOps with Amazon CloudWatch Investigations and Amazon Nova Sonic in Amazon Bedrock to transform how cloud operations teams handle incidents. Traditional monitoring approaches require engineers to navigate multiple complex dashboards, analyze extensive logs, and manually execute remediation steps—a process that becomes particularly challenging during after-hours incidents or when away from workstations. When minutes matter […]









