Amazon Web Services

In this AWS re:Invent 2023 session, three AWS operational leaders share valuable insights on building resilient systems at scale. They discuss five key topics: dependencies and modes, blast radius, queues, errors, and retries. The speakers emphasize the importance of thinking beyond traditional availability metrics and focus on shortening time to mitigation when unexpected issues occur. They provide real-world examples from AWS services like Route 53 and EC2, demonstrating how seemingly small changes can have significant impacts at scale. The session offers practical advice on implementing resilience strategies, including proper error classification, thoughtful retry mechanisms, and effective queue management. This talk is essential for anyone looking to improve their system's ability to recover quickly from failures in large-scale environments.

cloud-trends-and-knowledge
skills-and-how-to
resilience
arch-strategy
mgmt-govern
Show 4 more

Up Next

VideoThumbnail
30:23

T3-2 Amazon SageMaker Canvasで始めるノーコード機械学習 (Level 200)

Jun 27, 2025
VideoThumbnail
31:49

T2-3 AWS を使った生成 AI アプリケーション開発 (Level 300)

Jun 27, 2025
VideoThumbnail
26:05

T4-4: AWS 認定 受験準備の進め方 AWS Certified Solutions Architect – Associate 編 後半

Jun 26, 2025
VideoThumbnail
32:15

T3-1: はじめてのコンテナワークロード - AWS でのコンテナ活用の第一歩

Jun 26, 2025
VideoThumbnail
29:37

BOS-09: はじめてのサーバーレス - AWS Lambda でサーバーレスアプリケーション開発 (Level 200)

Jun 26, 2025