Amazon Web Services

This video provides a comprehensive guide on selecting the right Large Language Model (LLM) for GenAI applications on AWS. It walks through a Jupyter notebook demonstrating a systematic approach to LLM evaluation, starting from a specific business problem in financial analysis. The process includes shortlisting models, evaluating them using metrics like Jaccard similarity and cosine similarity, and considering factors such as accuracy, speed, and cost. The demonstration uses models like Claude V2, Cohere Command, and Llama 2, showcasing how to leverage AWS services like Amazon Bedrock and SageMaker for model deployment and evaluation. The video emphasizes the importance of tailoring the evaluation process to specific use cases and provides insights into balancing performance and cost considerations in LLM selection.

product-information
skills-and-how-to
generative-ai
ai-ml
gen-ai
Show 4 more

Up Next

VideoThumbnail
15:58

Revolutionizing Business Intelligence: Generative AI Features in Amazon QuickSight

Nov 22, 2024
VideoThumbnail
1:01:07

Accelerate ML Model Delivery: Implementing End-to-End MLOps Solutions with Amazon SageMaker

Nov 22, 2024
VideoThumbnail
9:30

Deploying ASP.NET Core 6 Applications on AWS Elastic Beanstalk Linux: A Step-by-Step Guide for .NET Developers

Nov 22, 2024
VideoThumbnail
47:39

Simplifying Application Authorization: Amazon Verified Permissions at AWS re:Invent 2023

Nov 22, 2024
VideoThumbnail
2:53:33

Streamlining Patch Management: AWS Systems Manager's Comprehensive Solution for Multi-Account and Multi-Region Patching Operations

Nov 22, 2024