Guidance for Tokenization to Improve Data Security and Reduce Audit Scope on AWS
Overview
How it works
These technical details feature an architecture diagram to illustrate how to effectively use this solution. The architecture diagram shows the key components and their interactions, providing an overview of the architecture's structure and functionality step-by-step.
Well-Architected Pillars
The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.
Operational Excellence
AWS X-Ray and Amazon CloudWatch Logs enable visualization and logging of tokenization transactions across API Gateway, Lambda functions, and Lambda layers. By visualizing traces and collecting logs, users can more easily troubleshoot performance bottlenecks or identify failures.
Moreover, the AWS Database Encryption software development kit (SDK) for DynamoDB provides APIs for encryption, decryption, and key management, reducing overhead compared to manual service integrations and cryptographic implementations.
Lastly, the included AWS CloudFormation template automates provisioning of required resources, streamlining deployment to support users with quick experimentation, and reducing the overhead of manually configuring services.
Security
The services selected for this Guidance work in tandem to secure API access, protect the sensitive data network, enable fine-grained access control, manage encryption keys to reduce risk, and enforce mutual TLS. Specifically, AWS WAF filters incoming traffic to allow only legitimate access to tokenization APIs, preventing distributed denial of service (DDoS) attacks. Amazon VPC endpoints and AWS PrivateLink control network-level access to DynamoDB tables storing sensitive data and keys. The AWS IAM Access Analyzer provides insights to fine-tune access permissions. AWS KMS manages the encryption keys used by the tokenization Lambda function. Amazon Cognito handles user authentication and authorization for the tokenization APIs. And lastly, the Database Encryption SDK for DynamoDB generates secure data encryption keys from AWS KMS and stores encrypted data in DynamoDB.
Reliability
The API Gateway API keys help to rate limit APIs for different API clients and set burst rate limits for managing transactions per second. AWS KMS has a request per second quota on cryptographic operations, and API throttling prevents requests from exceeding the current quota limit. Lambda makes the tokenization APIs highly scalable to meet the fluctuating demands of tokenizing sensitive data, while the AWS Serverless Application Model (AWS SAM) simplifies the deployment of new code versions and automation templates.
Furthermore, the use of private subnets deployed across multiple Availability Zones (AZs), Regional services with built-in resilience and high availability, multi-AZ Amazon VPC endpoints, and Amazon DynamoDB global tables provide enhanced reliability and availability. AWS SAM also provides a higher-level abstraction on top of CloudFormation to define Lambda functions and enable local unit testing. Collectively, these services provide the framework to help ensure workloads perform their intended functions correctly and consistently, while also enabling quick recovery from failures.
Performance Efficiency
API Gateway and Lambda enable near real-time, synchronous, event-driven communication between the client (UI) and server. The Lambda function can also handle thousands of tokenization requests per second in real-time. Similarly, API Gateway can handle thousands of API requests per second in real-time to tokenize sensitive data when a user submits information on a web page.
Furthermore, DynamoDB allows for the storage of unstructured information at scale with a latency of less than a few milliseconds. Moreover, DynamoDB provides a low-latency database layer for storing encrypted sensitive information and generated tokens.
Cost Optimization
The Lambda function allows memory and CPU requirements to be optimized for price and performance using the AWS Lambda Power Tuning tool. Users can also select the Amazon DynamoDB Standard-Infrequent Access (Standard-IA) storage class for workloads that require long-term storage of infrequently accessed data, thereby optimizing storage costs. Both Lambda and DynamoDB provide on-demand and provisioned capacity options to cater to various price and performance scenarios.
Lastly, PrivateLink optimizes the data transfer costs by keeping the network traffic within the AWS network and avoiding charges for NAT gateway, a Network Address Translation (NAT) service.
Sustainability
Lambda, API Gateway, and DynamoDB are designed to scale dynamically to meet the demand for optimized resource utilization, thereby reducing the energy usage required to run the servers. These are serverless services that optimize resource utilization and dynamically scale to match the demands of the tokenization and de-tokenization APIs. The storage and compute layers scale dynamically to accommodate the incoming traffic demands, which in turn reduces the overall energy usage and environmental impacts.
Disclaimer
Did you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages