AWS Cloud Operations Blog

Category: Management Tools

Search and discover governance controls with Control Catalog in AWS Control Tower

Search and discover governance controls with Control Catalog in AWS Control Tower

As you scale your AWS environment from hundreds to thousands of AWS accounts, maintaining consistent governance standards across this expanded infrastructure requires a strategic approach. Governance controls—the automated policies and rules that enforce standards across your cloud infrastructure—are essential for managing this scale, but implementing them presents two fundamental challenges. First, without proper controls, a […]

Resolve application issues autonomously with AWS DevOps Agent (Preview) and Dynatrace

Application issues require fast resolution to maintain business continuity and customer satisfaction, but manual investigation creates delays that can cost organizations significantly in lost revenue and productivity. Last week, we launched AWS DevOps Agent (Preview), a frontier agent that resolves and proactively prevents incidents, continuously improving reliability and performance of applications in AWS, multicloud, and […]

Amazon CloudWatch RUM now supports mobile application monitoring

Amazon CloudWatch RUM now supports iOS and Android applications, expanding real user monitoring beyond web applications. Developers and SREs can now quickly isolate mobile application issues and improve end-user experience, with visibility into performance metrics such as screen load times, crash rates, and API latencies.

Prometheus MCP Server: AI-Driven Monitoring Intelligence for AWS Users

Prometheus MCP Server: AI-Driven Monitoring Intelligence for AWS Users

We recently launched the open source Prometheus Model Context Protocol (MCP) server for Amazon Managed Service for Prometheus. This new capability enables artificial intelligence (AI) code assistants such as Amazon Q Developer CLI, Cline, and Cursor to interact with your Prometheus monitoring infrastructure through natural language queries. The MCP server provides AI assistants with contextual […]

2025 Top 10 Announcements for AWS Cloud Operations

2025 Top 10 Announcements for AWS Cloud Operations

At AWS re:Invent 2025, we’re excited to share latest innovations designed to empower organizations to thrive in the transformative AI era. This year’s top Cloud Operations announcements address the most pressing challenges our customers face today—from gaining comprehensive visibility into generative AI workloads to significantly accelerating incident resolution and efficiently managing the exponential growth of […]

Announcing AWS CloudTrail Event Aggregation and Insights for Data Events

AWS CloudTrail records API calls and events for your AWS account, providing audit trails for governance, compliance, and operational troubleshooting. Customers can also enable data events in CloudTrail to gain deeper visibility into resource-level operations. These include Amazon S3 object-level operations (such as GetObject/PutObject) or AWS Lambda function invocations. Data events help detect unauthorized access, […]

Enforce consistent tagging across IaC deployments with AWS Organizations Tag Policies

Enforce consistent tagging across IaC deployments with AWS Organizations Tag Policies

Organizations manage thousands of AWS resources across multiple accounts and Regions to support their business operations. They want consistent tagging to support essential workflows such as attribute-based-access-controls (ABAC), cost allocation, organizing resources by project/application/owner/environment, and triggering automated processes based on tag criteria. Many customers use Infrastructure as Code (IaC) tools like AWS CloudFormation, Terraform, and […]

AWS X-Ray SDKs/Daemon migration to OpenTelemetry

AWS X-Ray SDKs/Daemon migration to OpenTelemetry

AWS X-Ray is transitioning to OpenTelemetry as its primary instrumentation standard for application tracing. OpenTelemetry-based instrumentation solutions are recommended for producing traces from applications and sending them to AWS X-Ray. X-Ray’s existing console experience and functionality continuous to be fully supported and remains unchanged by this transition. OpenTelemetry is the industry-wide open-source standard for tracing […]

How Indeed scaled Governance across 1,000+ AWS accounts with AWS Trusted Advisor

Indeed is the #1 job site¹ in the world. With 615 million Job Seeker Profiles², people in more than 60 countries across 28 languages come to Indeed to search for jobs, post resumes, and research companies. Over 3.3 million employers use Indeed to find and hire new employees. Supporting this massive scale requires resilient, well-architected […]

Handling sensitive log data using Amazon CloudWatch

Introduction Efficient logging is crucial to building effective investigative and response workflows. Logs, metrics and traces offer critical value when investigating application issues, security events and debugging failures. Structured wide-event logs can provide a means to investigate application behaviour without requiring access to data stores. This level of verbosity in application logs increases the likelihood […]