Monitoring and Observability
Gain insights and improve the performance of your applications and infrastructure
Why Monitoring and Observability?
Benefits
Understand application health
Know what is going on anywhere and everywhere in your system to provide the best possible experience for your end users. Detect problems quickly, investigate efficiently, and remediate as soon as possible to minimize disruption for your customers and reduce Mean Time to Resolution (MTTR).
Accelerate collaboration
When application issues occur, engage the correct stakeholder for any alerts from the beginning. IT and business teams are able to automate mundane and repetitive tasks while streamlining complex ones. Working together, IT and business teams can use insights from observability data to take a more user-centric approach and deliver exceptional end user experiences.
Reduce operational cost
Across hundreds of thousands of instances, a small percentage performance improvement in how much CPU an application uses can add up to millions of dollars in savings. Similarly, by using observability to understand and predict your future capacity needs, you can take advantage of the cost savings available from reserve and spot pricing.
Increase customer satisfaction
Elevate your customer experiences and business outcomes when you improve application, infrastructure, and network availability. Reduce downtimes and build fast, seamless digital experiences for your end customers. This allows both your internal teams and the end customers to operate efficiently to develop and deploy faster.

Over the past year, CloudWatch Synthetics and a simple system based on Amazon CloudWatch Alarms, Amazon SES, and AWS Lambda functions have proactively allowed us to respond to our customer's application and infrastructure issues. With CloudWatch Synthetics, our DevOps and support teams have been able begin analyzing and resolving problems even before the client notifies us of the issue. CloudWatch Synthetics is a critical component of exceeding SLAs/SLOs for our customers and, ultimately, our success.
SVP of Data and Analytic, Lennar
Lee Slezak
We were looking for an easy and seamless integration that we could get up and running quickly to collect core web vital metrics for our products. We have been using Amazon CloudWatch RUM to monitor our website performance, specifically page load times, JavaScript errors, and other core web vital metrics. Using RUM has helped our team collect and measure real-world performance metrics of our websites, while also giving us a unified way to collect and analyze that data. What made RUM stand out was how it integrated seamlessly with our products and other parts of CloudWatch, allowing us to use collected data for further processing, without the added worry of loss of connectivity or data shortage.
SVP of Data and Analytic, Lennar
Lee Slezak
We were looking to consolidate all our monitoring, logging, metrics, and alerting under one tool. CloudWatch has helped us alleviate the operational burden to set up, configure, and learn third-party systems. Our teams use CloudWatch extensively to monitor error rates and status codes for multiple high-profile workloads. CloudWatch enables next-level automation and expands the capacity of each individual.
SVP of Data and Analytic, Lennar
Lee Slezak
HP Print Org supports over 500 services running on Amazon Elastic Kubernetes Service (EKS). The team used self-hosted Prometheus to monitor the hardware and services metrics. As the platform grew, they struggled to keep up with the monitoring, especially maintaining the self-hosted, multi-region Prometheus setup
SVP of Data and Analytic, Lennar
Lee Slezak