AWS Cloud Operations Blog

Category: Amazon CloudWatch

Investigating Service Issues with Amazon CloudWatch Application Signals Custom Metrics

Investigating Service Issues with Amazon CloudWatch Application Signals Custom Metrics

When a critical service fails, you need to know how much revenue you’re losing, not just that latency has increased. This post shows you how to integrate business metrics with CloudWatch Application Signals to see both technical performance and business impact in one unified view. With CloudWatch Application Signals, you can view metrics, traces, and […]

CrossRegionPrivateLinkNetworkSyntheticMonitor

Cross-Region AWS PrivateLink monitoring with Amazon CloudWatch Network Synthetic Monitor

Introduction Global, distributed AWS architectures are the backbone for customers seeking high availability, resilience, and regulatory compliance. Workloads are commonly deployed across multiple AWS Regions and Availability Zones (AZs), often using AWS PrivateLink to connect services securely and privately across Amazon Virtual Private Cloud (Amazon VPC) networks. This approach enhances security and separation while requiring […]

Amazon CloudWatch RUM now supports mobile application monitoring

Amazon CloudWatch RUM now supports iOS and Android applications, expanding real user monitoring beyond web applications. Developers and SREs can now quickly isolate mobile application issues and improve end-user experience, with visibility into performance metrics such as screen load times, crash rates, and API latencies.

AWS X-Ray SDKs/Daemon migration to OpenTelemetry

AWS X-Ray SDKs/Daemon migration to OpenTelemetry

AWS X-Ray is transitioning to OpenTelemetry as its primary instrumentation standard for application tracing. OpenTelemetry-based instrumentation solutions are recommended for producing traces from applications and sending them to AWS X-Ray. X-Ray’s existing console experience and functionality continuous to be fully supported and remains unchanged by this transition. OpenTelemetry is the industry-wide open-source standard for tracing […]

Handling sensitive log data using Amazon CloudWatch

Introduction Efficient logging is crucial to building effective investigative and response workflows. Logs, metrics and traces offer critical value when investigating application issues, security events and debugging failures. Structured wide-event logs can provide a means to investigate application behaviour without requiring access to data stores. This level of verbosity in application logs increases the likelihood […]

Amazon CloudWatch Application Signals new enhancements for application monitoring

Amazon CloudWatch Application Signals new enhancements for application monitoring

Today, we’re excited to announce new enhanced features in Amazon CloudWatch Application Signals that simplifies how you monitor large-scale distributed applications. Improvements to CloudWatch Application Signals application map automatically discovers and organizes services into groups based on their relationships, with support for custom grouping that aligns with your business perspective. You can now view the […]

Embracing AI- driven operations and observability at re:Invent 2025

Embracing AI- driven operations and observability at re:Invent 2025

As organizations continue to scale their cloud presence, effective operations become increasingly critical for success. AWS re:Invent 2025’s Cloud Operations track brings together industry experts, AWS leaders, and customers to share insights on modernizing monitoring & observability through This blog post will guide you through the key themes of operations and observability and highlight sessions […]

Amazon Nova Sonic in Amazon Bedrock

Reimagine AIOps with Amazon CloudWatch Investigations and Amazon Nova Sonic

Reimagine AIOps with Amazon CloudWatch Investigations and Amazon Nova Sonic in Amazon Bedrock to transform how cloud operations teams handle incidents. Traditional monitoring approaches require engineers to navigate multiple complex dashboards, analyze extensive logs, and manually execute remediation steps—a process that becomes particularly challenging during after-hours incidents or when away from workstations. When minutes matter […]

Simplifying Log Management using Amazon CloudWatch Logs Centralization

Managing logs across multiple AWS accounts and regions has always been a complex challenge for organizations. As AWS infrastructure grows to include separate accounts for production, development, and staging environments, along with regions, the complexity of log management increases exponentially. During critical incidents, especially during off-hours, teams spend valuable time, searching through multiple accounts, correlating […]

Optimizing metrics ingestion with Amazon Managed Service for Prometheus

Optimizing metrics ingestion with Amazon Managed Service for Prometheus

Managing metrics collection at scale in complex cloud environments presents significant challenges for organizations, particularly when it comes to controlling costs and maintaining operational efficiency. As the volume of metrics grows exponentially with the expansion of container deployments and other cloud-native workloads, customers often struggle to balance comprehensive monitoring with resource optimization. This can lead […]