This Guidance demonstrates how customer applications can scan artifacts for Personally Identifiable Information (PII), financial information or credentials, and other sensitive information with Amazon Macie.

Architecture Diagram

Download the architecture diagram PDF 

Well-Architected Pillars

The AWS Well-Architected Framework helps you understand the pros and cons of the decisions you make when building systems in the cloud. The six pillars of the Framework allow you to learn architectural best practices for designing and operating reliable, secure, efficient, cost-effective, and sustainable systems. Using the AWS Well-Architected Tool, available at no charge in the AWS Management Console, you can review your workloads against these best practices by answering a set of questions for each pillar.

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

  • This Guidance uses Infrastructure as Code with technologies such as the AWS Cloud Development Kit (AWS CDK) and/or AWS CloudFormation templates. All Lambda function operations are logged in CloudWatch. These solutions detect the presence of sensitive information that is tracked by Macie.

    Read the Operational Excellence whitepaper 
  • For secure authentication and authorization, this Guidance requires identities to acquire temporary credentials.

    Customer managed policies must be created for the Lambda function implementation roles following the principle of least privilege.

    Macie uses VPC endpoints to access Amazon S3 whereas access to API Gateway is protected using AWS WAF.  The Amazon S3 buckets store the artifacts and the results or findings must be encrypted using AWS KMS keys.

    Read the Security whitepaper 
  • This Guidance implements a reliable application architecture by leveraging serverless technology, including Macie, that logs job status events to CloudWatch. To support data backup and recovery, DynamoDB tables must be periodically backed up and all of the Amazon S3 buckets must be replicated into a different region.

    Read the Reliability whitepaper 
  • Macie is purpose built for sensitive data discovery and classification. To meet workload requirements, highly scalable managed services such as Lambda functions, API Gateway, and DynamoDB are leveraged.  

    Read the Performance Efficiency whitepaper 
  • For this Guidance, cost optimization is achieved by leveraging serverless technology (for example: Lambda). The scale and costs are dictated by Macie to ensure that only the minimum resources are required.

    Read the Cost Optimization whitepaper 
  • To scale and continually match the load to ensure only the minimum resources are required, this Guidance utilizes serverless (Lambda) and event driven (EventBridge) technologies.

    Read the Sustainability whitepaper 

Implementation Resources

A detailed guide is provided to experiment and use within your AWS account. Each stage of building the Guidance, including deployment, usage, and cleanup, is examined to prepare it for deployment.

The sample code is a starting point. It is industry validated, prescriptive but not definitive, and a peek under the hood to help you begin.

[Content Type]


This [blog post/e-book/Guidance/sample code] demonstrates how [insert short description].


The sample code; software libraries; command line tools; proofs of concept; templates; or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.

Was this page helpful?