Amazon Web Services

In this AWS re:Invent 2023 session, David Yanacek explores how to leverage observability to enhance system resilience. He covers various failure modes and demonstrates practical techniques using AWS services like CloudWatch and X-Ray. Key topics include using dimensionality to diagnose issues, uncovering hidden problems through synthetic workloads and real user monitoring, and preventing future issues with auto-scaling and controlled experiments. Yanacek emphasizes the importance of measuring things that can fail separately and using composite alarms to reduce alert fatigue. The talk provides valuable insights for IT professionals looking to improve their observability practices and operate resilient systems effectively.

product-information
skills-and-how-to
resilience
devtools
mgmt-govern
Show 5 more

Up Next

VideoThumbnail
2:53:33

Streamlining Patch Management: AWS Systems Manager's Comprehensive Solution for Multi-Account and Multi-Region Patching Operations

Nov 22, 2024
VideoThumbnail
9:30

Deploying ASP.NET Core 6 Applications on AWS Elastic Beanstalk Linux: A Step-by-Step Guide for .NET Developers

Nov 22, 2024
VideoThumbnail
47:39

Simplifying Application Authorization: Amazon Verified Permissions at AWS re:Invent 2023

Nov 22, 2024
VideoThumbnail
2:51

How to Start, Connect, and Enroll Amazon EC2 Mac Instances into Jamf for Apple Mobile Device Management

Nov 22, 2024
VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024