Amazon Web Services

This video provides an overview of best practices for building generative AI applications on AWS, focusing on model selection. Dan Stair, an analytic specialist Solutions architect, introduces a four-part series covering the essentials of building a generative AI application. The video explains the concept of large language models, their functioning, and the typical journey of implementing them. It also discusses important building blocks like embedding vectors and vector databases, as well as patterns for improving model performance such as prompt engineering and retrieval augmented generation. The presentation highlights AWS services like SageMaker JumpStart and Amazon Bedrock for deploying and managing large language models, as well as various vector store options available on AWS.

product-information
skills-and-how-to
generative-ai
ai-ml
gen-ai
Show 2 more

Up Next

VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
2:53:33

Streamlining Patch Management: AWS Systems Manager's Comprehensive Solution for Multi-Account and Multi-Region Patching Operations

Nov 22, 2024
VideoThumbnail
9:30

Deploying ASP.NET Core 6 Applications on AWS Elastic Beanstalk Linux: A Step-by-Step Guide for .NET Developers

Nov 22, 2024
VideoThumbnail
47:39

Simplifying Application Authorization: Amazon Verified Permissions at AWS re:Invent 2023

Nov 22, 2024