AWS Architecture Blog
A multi-dimensional approach helps you proactively prepare for failures, Part 3: Operations and process resiliency
In Part 1 and Part 2 of this series, we discussed how to build application layer and infrastructure layer resiliency. In Part 3, we explore how to develop resilient applications, and the need to test and break our operational processes and run books. Processes are needed to capture baseline metrics and boundary conditions. Detecting deviations […]
A multi-dimensional approach helps you proactively prepare for failures, Part 2: Infrastructure layer
Distributed applications resiliency is a cumulative resiliency of applications, infrastructure, and operational processes. Part 1 of this series explored application layer resiliency. In Part 2, we discuss how using Amazon Web Services (AWS) managed services, redundancy, high availability, and infrastructure failover patterns based on recovery time and point objectives (RTO and RPO, respectively) can help in […]
A multi-dimensional approach helps you proactively prepare for failures, Part 1: Application layer
Resiliency of applications surpasses everything else in building customer trust. Because of this, it cannot be an afterthought. Instead of simply reacting to a failure, why not be proactive? As your system expands, you’ll likely encounter issues that can hinder your ability to scale, like security and cost. So, it’s necessary to think about the […]


