Amazon Web Services

This video from AWS re:Invent 2023 explores the intersection of SaaS and generative AI, focusing on multi-tenant patterns and strategies. Presenters Tod Golding and James Jory discuss how to implement generative AI in SaaS environments, covering topics like tenant-specific customizations, data partitioning, pricing models, and architectural considerations. They delve into techniques such as Retrieval-Augmented Generation (RAG) and fine-tuning for creating targeted tenant experiences. The session also addresses challenges in isolation, throttling, and cost attribution specific to multi-tenant generative AI implementations. Practical examples and architectural patterns are provided to help SaaS providers integrate generative AI capabilities effectively.

general-independent-software-vendor-isv
product-information
skills-and-how-to
reinvent
industry-agnostic
Show 9 more

Up Next

VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
9:30

Deploying ASP.NET Core 6 Applications on AWS Elastic Beanstalk Linux: A Step-by-Step Guide for .NET Developers

Nov 22, 2024
VideoThumbnail
47:39

Simplifying Application Authorization: Amazon Verified Permissions at AWS re:Invent 2023

Nov 22, 2024
VideoThumbnail
39:31

AWS re:Invent 2023: What's New in AWS Amplify for Full-Stack Web and Mobile App Development

Nov 22, 2024