AWS Solutions Library

Guidance for Creating a Personalized Avatar with Amazon SageMaker

Go to sample code

Overview

This Guidance shows how to calibrate and deploy a Stable Diffusion model to generate personalized avatars with a simple text prompt. Stable Diffusion is a text-to-image model, generated by a type of artificial intelligence (AI) that leverages the latest advances in machine learning. Here, the models are built by Amazon SageMaker and calibrated with the DreamBooth approach, which uses 10-15 images of the user to capture the precise details of the subject. The model generates a personalized avatar that can be used in a variety of applications, including social media, gaming, and virtual events. The Guidance also includes a text prompt feature that allows users to generate avatars based on specific text inputs. This feature expands the capabilities of the applications and provides media and entertainment organizations more ways to develop personalized content, tailored to the consumer.

This Guidance provides an AI-based approach for helping media and entertainment organizations develop personalized, tailored content at scale. However, users of this Guidance should take precautions to ensure these AI capabilities are not abused or manipulated. Visit Safe image generation and diffusion models with Amazon AI content moderation services to learn about safeguarding content through a proper moderation mechanism.

How it works

These technical details feature an architecture diagram to illustrate how to effectively use this solution. The architecture diagram shows the key components and their interactions, providing an overview of the architecture's structure and functionality step-by-step.

Download the architecture diagram

100 %

Well-Architected Pillars

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

SageMaker multi-model endpoints and Amazon CloudWatch are utilized throughout this Guidance and designed to enhance your operational excellence. First, SageMaker multi-model endpoints allow you to deploy a multitude of models behind a single endpoint, reducing the number of endpoints you need to manage. SageMaker manages loading and caching models based on your traffic patterns. You can add or update the model without redeploying the endpoint. Simply upload the models to the SageMaker managed Amazon S3 location. Additionally, SageMaker automatically integrates with CloudWatch where you can track metrics, events, and log files from the model and gain insights into the performance of your models. You can also set up alarms and proactively monitor issues before they impact the customer experience.

Read the Operational Excellence whitepaper

API Gateway provides built-in mechanisms to authenticate and authorize API requests, preventing denial-of-service attacks, or other types of abuse that can overload your backend resources. You can also use Amazon Cognito user pools, OAuth 2.0, or IAM roles to control access to your APIs. And to protect data, API Gateway ensures data coming to your endpoint is SSL/TLS encrypted. It also supports API throttling, helping to protect your APIs from excessive traffic or abuse. Also, consider adding AWS WAF, a web application firewall, in front of the API Gateway to protect applications from web-based attacks and exploits. Finally, consider AWS Shield to protect your workloads from distributed denial of service (DDoS) attacks.

Read the Security whitepaper

API Gateway, Lambda, and SageMaker are deployed throughout this Guidance to enhance the reliability of your workloads. First, API Gateway provides built-in fault tolerance and automatic scaling to handle spikes in traffic. It also integrates with Lambda and SageMaker to make it easy for you to build scalable, serverless APIs. Moreover, SageMaker is designed to provide high reliability and availability for running machine learning workloads and serving machine learning models. It provides managed automatic scaling, fault tolerance, health checks, monitoring, and diagnostics. It runs on a distributed infrastructure spread across multiple availability zones, ensuring high availability. These ensure reliability for your model training and inferences.

Read the Reliability whitepaper

SageMaker is used here to enhance performance efficiency, providing a high-performance, low-latency inference service that can be used to host machine learning models. You can easily configure instance type, count, and other deployment configurations to right-size your inference workload, optimizing for latency, throughput, and cost.

Read the Performance Efficiency whitepaper

SageMaker multi-model endpoints provide a scalable and cost-effective way to deploy large numbers of models. These endpoints use the same container to host all of your models, allowing you to reduce the overhead of managing separate endpoints. In a situation when some of the models are not utilized as much, you have the resource sharing in place to maximize infrastructure utilization and save costs when compared to having separate endpoints.

Read the Cost Optimization whitepaper

SageMaker Asynchronous Inference is a capability that queues incoming requests and processes those requests asynchronously. Meaning, SageMaker can autoscale down to zero instances when not used, saving compute resources when idling and helping to minimize the environmental impacts of running your cloud workloads.

Read the Sustainability whitepaper

Implementation Resources

The sample code is a starting point. It is industry validated, prescriptive but not definitive, and a peek under the hood to help you begin.

Open sample code on GitHub

Disclaimer

The sample code; software libraries; command line tools; proofs of concept; templates; or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.

Did you find what you were looking for today?

Let us know so we can improve the quality of the content on our pages

Guidance for Creating a Personalized Avatar with Amazon SageMaker

Overview

How it works

Well-Architected Pillars

Implementation Resources

Disclaimer

Did you find what you were looking for today?

Learn

Resources

Developers

Help

Guidance for Creating a Personalized Avatar with Amazon SageMaker

Overview

How it works

Well-Architected Pillars

Operational Excellence

Security

Reliability

Performance Efficiency

Cost Optimization

Sustainability

Implementation Resources

Related Content

Disclaimer

Did you find what you were looking for today?

Learn

Resources

Developers

Help