Amazon Web Services

This video provides an in-depth look at AWS infrastructure powering generative AI applications. Chetan from the EC2 team discusses how AWS is building highly performant and cost-effective compute, networking, and storage solutions to support large-scale AI model training and inference. He covers innovations like AWS Trainium chips, ultra-high bandwidth networking, and optimized GPU instances. Alexandru Costin from Adobe and Belinda Zeng from Amazon retail then share how they are leveraging AWS to build and deploy generative AI capabilities at scale. Alexandru discusses Adobe's Firefly models for image and design generation, while Belinda covers Amazon's work on semantic representations and AI-powered product search and recommendations. The speakers highlight how AWS's purpose-built infrastructure and managed services enable them to rapidly innovate with generative AI while optimizing performance and costs.

customer-stories
product-information
generative-ai
ai-ml
compute
Show 6 more

Up Next

VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
2:51

How to Start, Connect, and Enroll Amazon EC2 Mac Instances into Jamf for Apple Mobile Device Management

Nov 22, 2024
VideoThumbnail
6:45

Grindr's Next-Gen Chat System: Leveraging AWS for Massive Scale and Security

Nov 22, 2024
VideoThumbnail
9:30

Deploying ASP.NET Core 6 Applications on AWS Elastic Beanstalk Linux: A Step-by-Step Guide for .NET Developers

Nov 22, 2024