AWS Fault Injection Simulator Documentation

AWS Fault Injection Simulator is a fault injection service that makes it easier for teams to discover an application’s weaknesses in order to improve performance, observability, and resiliency.

Setup

AWS Fault Injection Simulator supports engineering parameters to make it easier to build and run fault injection experiments, without needing to install agents. Sample experiments are available to use as a starting point. Fault injection actions are used to define actions such as stopping an instance, throttling an API, and failing over a database. Fault Injection Simulator is designed to support Amazon CloudWatch so that you can use your existing metrics to monitor Fault Injection Simulator experiments.

Run real-world scenarios

Scenarios can be insufficient to create the real-world conditions that cause failure. AWS Fault Injection Simulator supports impairing performance of different types of resources, APIs, services, and geographic locations. Affected resources can be randomized, and custom fault types can be created using AWS Systems Manager.

Safety controls

AWS Fault Injection Simulator is designed to help you target experiments, based on environments, application, and other dimensions using tags, which provide guardrails and help keep your fault injection experiments under control. Fault Injection Simulator also has the option to set rules based on Amazon CloudWatch Alarms or other tools to stop an experiment.

Security model

AWS Fault Injection Simulator is integrated with AWS Identity and Access Management (IAM), which helps you control which users and resources have permission to access and run Fault Injection Simulator experiments, and which resources and services can be affected.

Visibility throughout an experiment

AWS Fault Injection Simulator is designed to provide visibility throughout the stages of an experiment via the console and APIs. As an experiment is running, AWS Fault Injection Simulator helps you to  observe what actions have executed. After an experiment has completed you can see details on what actions were run, if stop conditions were triggered, how metrics compared to your expected steady state, and more. To support accurate operational metrics and effective troubleshooting, you can also identify what resources and APIs are affected by a Fault Injection Simulator experiment.

Console and programmatic access

You can use AWS Fault Injection Simulator with the AWS Management Console, AWS CLI, and AWS SDKs. The Fault Injection Simulator APIs allow you to access the service so that you can integrate fault injection testing into your continuous integration and continuous delivery (or CI/CD) pipeline, and custom tooling.

Additional Information

For additional information about service controls, security features and functionalities, including, as applicable, information about storing, retrieving, modifying, restricting, and deleting data, please see https://docs.aws.amazon.com/index.html. This additional information does not form part of the Documentation for purposes of the AWS Customer Agreement available at http://aws.amazon.com/agreement, or other agreement between you and AWS governing your use of AWS’s services.